2010 Fiscal Year Final Research Report
Research of Automatic Annotation of Written Language Corpora
Project Area | Compilation of a balanced corpus of written Japanese: Infrastructure for the coming Japanese linguistics |
Project/Area Number |
18061005
|
Research Category |
Grant-in-Aid for Scientific Research on Priority Areas
|
Allocation Type | Single-year Grants |
Review Section |
Humanities and Social Sciences
|
Research Institution | Nara Institute of Science and Technology |
Principal Investigator |
MATSUMOTO Yuji Nara Institute of Science and Technology, 情報科学研究科, 教授 (10211575)
|
Co-Investigator(Kenkyū-buntansha) |
TOKUNAGA Takenobu 東京工業大学, 大学院・情報理工学研究科, 教授 (20197875)
INUI Kentaro 東北大学, 大学院・情報科学研究科, 教授 (60272689)
HASIDA Koiti 独立行政法人産業技術総合研究所, サービス工学研究センター, 次長 (00357766)
ASAHARA Masayuki 奈良先端科学技術大学院大学, 情報科学研究科, 助教 (80379528)
HASHIMOTO Taiichi 東京工業大学, 総合プロジェクト支援センター, 特任准教授 (10345382)
KOMACHI Mamoru 奈良先端科学技術大学院大学, 情報科学研究科, 助教 (60581329)
|
Project Period (FY) |
2006 – 2010
|
Keywords | コーパス / 形態素解析 / 統語解析 / 述語項構造解析 / アノテーション / 自然言語解析 / 照応解析 / 固有表現認識 / 機械学習 |
Research Abstract |
We developed various automatic annotation systems for Japanese corpora, as well as corpus annotation assistance tools for error correction of annotation and for flexible use of annotated corpora. The automatic annotation systems we developed range over morphological analysis, syntactic dependency analysis, coordination structure analysis, Named Entity recognizer, predicate argument structure analysis, anaphora and co-reference analysis, temporal relation analysis of events, and so on. We also developed annotated corpora with those information.
|
Research Products
(37 results)