Project/Area Number |
16200009
|
Research Category |
Grant-in-Aid for Scientific Research (A)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Intelligent informatics
|
Research Institution | Nagoya University (2005-2007) Kyoto University (2004) |
Principal Investigator |
SATO Satoshi Nagoya University, Graduate School of Engineering, Professor (30205918)
|
Co-Investigator(Kenkyū-buntansha) |
UTSURO Takahito Tsukuba Univesity, Graduate School of Systems and Information Engineering, Associate Professor (90263433)
YAMAMOTO Kazuhide Nagaoka Univesity of Technology, Department of Electrical Engineering, Associate Professor (40359708)
INUI Kentaro Nara Institute of Science and Technology, Graduate School of Information Science, Associate Professor (60272689)
FUJITA Atsushi Nagoya University, Graduate School of Engineering, Assistant Professor (10402801)
|
Project Period (FY) |
2004 – 2007
|
Project Status |
Completed (Fiscal Year 2007)
|
Budget Amount *help |
¥48,620,000 (Direct Cost: ¥37,400,000、Indirect Cost: ¥11,220,000)
Fiscal Year 2007: ¥12,350,000 (Direct Cost: ¥9,500,000、Indirect Cost: ¥2,850,000)
Fiscal Year 2006: ¥13,000,000 (Direct Cost: ¥10,000,000、Indirect Cost: ¥3,000,000)
Fiscal Year 2005: ¥13,000,000 (Direct Cost: ¥10,000,000、Indirect Cost: ¥3,000,000)
Fiscal Year 2004: ¥10,270,000 (Direct Cost: ¥7,900,000、Indirect Cost: ¥2,370,000)
|
Keywords | Basic Vocabulary / Functional Expressions / Idioms / Paraphrasing / Readability / Dictionaries / テキストの難易度 / 電子化辞書 / 基本語 / 複合辞 / 用例データベース / 機能表現用例コーパス / 語構成 / 境界認定 / 文末整形 |
Research Abstract |
1. We have compiled JC2.2, a basin vocabulary of Japanese, which contains 5,814 basic words and terms including 211 functional expressions and 177 idioms 2. We have compiled a dictionary of Japanese functional expressions It contains 341 functional expressions and their 16,711 surface forms, which are hierarchically organized. By using this dictionary, we have implemented a paraphraser of Japanese functional expressions. 3. We have compiled MUST1, an example database of Japanese compound functional expressions. Combining this database and a machine lemming method, we have implemented a detector of Japanese fimctional exressions 4. We have compiled a textbook corpus which contains 1478 sample passages (about 1M characters) extracted from 127 textbooks of elementary school, junior high school, high school, and university. By using this textbook corpus as a criterion, we have implemented a readability analyzer of Japanese texts. For a given Japanese text, this analyzer produces one of 13 grades based on character unigram models constructed from the textbook corpus. The performance of this analyzer, which is measured by the correlation coefficient, is considerably high (R>0.9) . 5. We studied a dynamic phrasal thesaurus, which generates phrasal synonyms for a given ph ram. We also studied application-oriented paraphrasing methods': paraphrasing of verbal noun phrases into compound nouns paraphrasing of verb phrases for easy understanding, transforming sentence ends into news headline style. We have proposed a new summarization method, summarization by analogy. 6.We have released JC22, MUST1, and the readability analyzer to the public.
|