• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

1996 Fiscal Year Final Research Report Summary

Building a Japanese Parsed Corpus

Research Project

Project/Area Number 07558046
Research Category

Grant-in-Aid for Scientific Research (A)

Allocation TypeSingle-year Grants
Section試験
Research Field Intelligent informatics
Research InstitutionKYOTO UNIVERSITY

Principal Investigator

NAGAO Makoto  Kyoto University, Department of Electronics and Communication, Professor, 工学研究科, 教授 (30025960)

Co-Investigator(Kenkyū-buntansha) TSUNODA Tatsuhiko  Kyoto University, Department of Electronics and Communication, Instructor, 工学研究科, 助手 (10273468)
MARUYAMA Hiroshi  IBM Japan, Ltd., Tokyo Research Laboratory, Researcher, 東京基礎研究所, 研究員
KUROHASHI Sadao  Kyoto University, Department of Electronics and Communication, Instructor, 工学研究科, 助手 (50263108)
Project Period (FY) 1995 – 1996
KeywordsNatural Language Processing / Text Corpus / Morphological Analysis / Parsing
Research Abstract

The goal of the project was to construct a Japnese parsed corpus and to simultaneously improve a morphological analyzer and a parser. In the period of two years' project, we have achieved the following results :
(a) We enhanced our morphological analyzer JUMAN to handle a word string as a whole, and to find and enter problematic fixed expressions which were analyzed incorrectly by the normal morphological analysis. We released the enhanced version of JUMAN,JUMAN3.0 in October 1996.
(b) We enhanced the treatment of coordination structures and subordinate structures in our parser, KNP.KNP was also enhanced to handle several types of phrases with exceptional sentential functions. We released the enhanced version of KNP,KNP2.0 In March 1997.
(c) We made a mouse-based interface to help and to speed up the human correction of tags assigned by JUMAN and KNP.The interface also provides the retrieval functions for the corpus.
(d) As of March 1997, we have constructed about 20,000 sentences of parsed and manually-corrected corpus. Out of them, we opened about 10,000 sentences in March 1997.

  • Research Products

    (10 results)

All Other

All Publications (10 results)

  • [Publications] 山地 治: "連語登録による形態素解析システムJUMANの精度向上" 言語処理学会第2回年次大会. 73-76 (1996)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 黒橋 禎夫: "京都大学におけるテキストコーパスの作成" 情報処理学会「大規模テキストコーパスの作成及び共有の問題点」シンポジウム論文集. 19-26 (1996)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 黒橋 禎夫: "京都大学テキストコーパス・プロジェクト" 言語処理学会第3回年次大会. 115-118 (1997)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Sadao Kurohashi: "Building a Japanese Parsed Corpus while Improving the Parsing System" ACL/EACL'97 Workshop on Computational Environments for Grammar Development and Linguistic Engineering. (投稿中). (1997)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 黒橋 禎夫: "京都大学テキストコーパス・プロジェクト" 人工知能学会第11回全国大会. (発表予定). (1997)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Osamu Yamaji: "Improvements of Japanese Morphological Analyzer JUMAN by Handling Fixed Expressions" Proceedings of the Second Annual Conference of ANLP. 73-76 (1996)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Sadao Kurohashi: "Building a Taxt Corpus at Kyoto University" Proceedings of the ISPS Symposium on Building and Sharing Very Large Corpora. 19-26 (1996)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Sadao Kurohashi: "Kyoto University Taxt Corpus Project" Proceedings of the Third Annual Conference of ANLP. 115-118 (1997)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Sadao Kurohashi: "Building a Japanese Parsed Corpus while Improving the Parsing System" ACL EACL'97 Workshop on Computational Environments for Grammar Development and Linguistic Engineering. (submitted). (1997)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Sadao Kurohashi: "Kyoto University Text Corpus Project" Proceedings of the 11th Annual Conference of JSAI. (to be published). (1997)

    • Description
      「研究成果報告書概要(欧文)」より

URL: 

Published: 1999-03-09  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi