2014 Fiscal Year Final Research Report
Dual Bootstrap Mining with Feature Words and Contents Words
Project/Area Number |
24500176
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Multi-year Fund |
Section | 一般 |
Research Field |
Intelligent informatics
|
Research Institution | Kyushu University |
Principal Investigator |
HIROKAWA Sachio 九州大学, 学内共同利用施設等, 教授 (40126785)
|
Co-Investigator(Kenkyū-buntansha) |
NAKATOH Tetsuya 九州大学, 情報基盤研究開発センター, 助教 (20253502)
|
Project Period (FY) |
2012-04-01 – 2015-03-31
|
Keywords | SVM / 属性選択 / ブートストラップ / 可視化 / 特徴語 / 機械学習 |
Outline of Final Research Achievements |
We developed a text mining method to extract feature words of search result using SVM (support vector machine). We succeeded to find small number of feature words that characterize the documents. We extended the bootstrap method for a measurement of generality and specificity of feature words. We confirmed the effectiveness of the methods by applying them to analyze the real data of scientific articles, students' free comments, English writing errors, security reports, medical records and Web documents.
|
Free Research Field |
テキストマイニング
|