Project/Area Number |
09480067
|
Research Category |
Grant-in-Aid for Scientific Research (B)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Intelligent informatics
|
Research Institution | Kobe Shoin Women's University (1999) Osaka University (1997-1998) |
Principal Investigator |
GUNJI Takao Kobe Shoin Women's University and College, Graduate School of Letters, Professor, 文学部, 教授 (10158892)
|
Co-Investigator(Kenkyū-buntansha) |
MIYATA Takashi Nara Institute of Science Technology, Graduate School of Information Science, Assistant Professor, 情報科学研究科, 助手 (00283929)
MATSUMOTO Yuji Nara Institute of Science Technology, Graduate School of Information Science, Professor, 情報科学研究科, 教授 (10211575)
MATSUI Michinao Kobe Shoin Women's University and College, Graduate School of Letters, Assistant Professor, 文学部, 講師 (00273714)
HASIDA Koiti Electrotechnical Laboratory, Department of Information Sciences, Director, 情報科学部, 部長
SIRAI Hidetosi Chukyo University, Faculty of Information Sciences, Associate Professor, 情報科学部, 助教授 (10134462)
|
Project Period (FY) |
1997 – 1999
|
Project Status |
Completed (Fiscal Year 1999)
|
Budget Amount *help |
¥14,400,000 (Direct Cost: ¥14,400,000)
Fiscal Year 1999: ¥4,200,000 (Direct Cost: ¥4,200,000)
Fiscal Year 1998: ¥5,200,000 (Direct Cost: ¥5,200,000)
Fiscal Year 1997: ¥5,000,000 (Direct Cost: ¥5,000,000)
|
Keywords | constraint-based grammar / continuous quantity / optimality / lexicon / electronic corpus / tag / unification / Japanese language |
Research Abstract |
This study attempted to set up a basis for designing grammatical framework for a comprehensive and systematic description of the Japanese language and developing a solid Japanese grammar system based on the constraint-based grammar formalism, in which there are no derivational concepts like transformations. We have achieved the following results : 1. Automatic extraction of lexical items from corpora. Given electronic corpora, e.g., the Mainichi Shimbun on CD-ROM, we have developed a system that automatically extracts lexical items from the corpora using Chasen with an entirely new set of grammar rules and dictionary. 2. Lexical description of major words and tag attachment based on it. Based on the feature structure commonly assumed in constraint-based Japanese grammar, we have comprehensive lexical description for major nouns and verbs. A set of tags for the given corpora was then selected and a tagging system for general use was designed. 3. Development of tool system for grammar development. We have developed a GUI editing system for feature structures and incorporated it into an integrated grammar development system. By using such a system in a parsing system, a number of problems in the grammar system were clarified. 4. Parsing as constraint transformation. We have developed a parsing system using general constraint processing procedures that don't depend on a particular kind of information (phonological, syntactic, or semantic) or the actual operation on it. Such a system was utilized in tagging and the design of dialog system. 5. Study of language acquisition process for infants. Using data from infants in the process of acquiring verbs, their morphological properties as well as the selection of subjects and word order were analyzed. We have also attempted to utilize tagged data from infants in the analysis of continuous acquisition process.
|