Knowledge Discovery from Numbers in Text
Project/Area Number |
22700137
|
Research Category |
Grant-in-Aid for Young Scientists (B)
|
Allocation Type | Single-year Grants |
Research Field |
Intelligent informatics
|
Research Institution | The University of Tokyo |
Principal Investigator |
YOSHIDA Minoru 東京大学, 情報基盤センター, 助教 (40361688)
|
Project Period (FY) |
2010 – 2011
|
Project Status |
Completed (Fiscal Year 2011)
|
Budget Amount *help |
¥2,860,000 (Direct Cost: ¥2,200,000、Indirect Cost: ¥660,000)
Fiscal Year 2011: ¥1,040,000 (Direct Cost: ¥800,000、Indirect Cost: ¥240,000)
Fiscal Year 2010: ¥1,820,000 (Direct Cost: ¥1,400,000、Indirect Cost: ¥420,000)
|
Keywords | 自然言語処理 / 数値情報 / テキストマイニング / 接尾辞配列 / クラスタリング / 数値検索 / ディリクレ過程混合モデル |
Research Abstract |
We studied a method for processing numbers written in text to discover relations between words and numbers. We indexed texts using suffix arrays augmented with functions for searching digits as numbers with the queries being able to include range of numbers. The search function can be performed in reasonable time for large text, which enabled us to obtain the relations between words and numbers interactively from such texts. We also studied methods for mining the texts that contain many numbers.
|
Report
(3 results)
Research Products
(20 results)
-
-
-
-
-
-
-
-
-
-
-
-
-
[Presentation] Web People Search2010
Author(s)
Minoru Yoshida, Hiroshi Nakagawa
Organizer
Person Name Disambiguation and Other Problems(Tutorial), the 2nd Asian Conference on Machine Learning(ACML 2010)
Year and Date
2010-11-08
Related Report
-
-
[Presentation] ITC-UT2010
Author(s)
Minoru Yoshida, Shin Matsushima, Shingo Ono, Hiroshi Nakagawa
Organizer
Tweet Categorization by Query Categrization for On-line Reputation management. WePS-3, CLEF 2010 Labs
Year and Date
2010-09-23
Related Report
-
-
-
-
-