2002 Fiscal Year Final Research Report Summary

Grammatical Analysis of Japanese KANA-Sentences in Old Style by Computer

Research Project

Project/Area Number	13680492
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	情報システム学(含情報図書館学)
Research Institution	Musashi Institute of Technology
Principal Investigator	UEHARA Tetsuzou MUSASHI INST. OF TECHNOLOGY, FACULTY OF ENGINEERING, PROF., 工学部, 教授 (60257102)
Co-Investigator(Kenkyū-buntansha)	SHIMIZU Yumiko MUSASHI INST. OF TECHNOLOGY, FACULTY OF ENGINEERING, ASST. PROF., 環境情報学部, 助教授 (30298020) ARAI Shuuichi MUSASHI INST. OF TECHNOLOGY, FACULTY OF ENGINEERING, ASST. PROF., 工学部, 助教授 (20212590)
Project Period (FY)	2001 – 2002
Keywords	OLD JAPANESE TEXT / CORPUS / GRAMMAR / CONCEPT / NATURAL LANGUAGE PROCESSING / JAPANESE LANGUAGE PROC. / LEXICAL PROCESSING
Research Abstract	Dictionaries with concept data and corpora with part-of-speech tags are important tools in natural language processing. However, there is not any such data tool for Japanese old style language. Therefore we studied a method for preparing such corpora and realized an old-Japanese corpus with 150,000 words. We performed an experiment of lexical analysis on old Japanese by statistically learning of the part-of-speech tagged corpus. In addition, we studied a method of acquiring a common concept by finding a portion where concepts concerning the words concentrated on a thesaurus. In order to estimate the method, we applied it to English-Japanese Dictionary, which is a sequence of records each having an English headword and Japanese equivalent words. The result shows some usefulness. However, it is not sufficient to get the concepts of old-Japanese words using old Japanese dictionary. This is a future study.

Research Products
(8 results)

All Other

All Publications (8 results)

[Publications] 伴祐介: "EDR日本語コーパスにおけるキーワード検索のためのデータ構造の検討"情報処理学会第64回(平成14年)全国大会講演論文集. 2. 2-65-2-66 (2002)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 池田直: "複数の概念を有する単語のセットからの共通概念の抽出"情報処理学会第64回(平成14年)全国大会講演論文集. 2. 2-75-2-76 (2002)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 上原徹三: "古典の総索引からの品詞タグ付きコーパスの作成"言語処理学会論文誌「自然言語処理」. 10・2. 59-78 (2003)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 池田直: "多義の単語のセットからの共通概念の抽出"電子情報通信学会論文誌DII. J86DII・9. 6 (2003)
- Description
  「研究成果報告書概要(和文)」より
[Publications] Ban Y.: "Data Structure For Retrieving By Keyword in EDR Japanese Corpus"Proc. 64-th Meeting of IPSJ. 2-65-2-66 (2002)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Ikeda S.: "Exftracting A Common Concept From A Set Of Words Each Of Which Has Several Concepts"Proc. 64-th Meeting of IPSJ. 2-75-2-76 (2002)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Uehara T.: "A Transformation of Concordance Data on Japanese Classics to Corpus Tagged with Part-of -Speech"Natural Language Processing. Vol.10, No.2. 59-78 (2003)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Ikeda S.: "Extracting A Common Concept From A Set Of Words Having Several Meanings, The IEICE Trans. on Information and Systems"6 (2003)
- Description
  「研究成果報告書概要(欧文)」より

2002 Fiscal Year Final Research Report Summary

Grammatical Analysis of Japanese KANA-Sentences in Old Style by Computer

Principal Investigator

UEHARA Tetsuzou MUSASHI INST. OF TECHNOLOGY, FACULTY OF ENGINEERING, PROF., 工学部, 教授 (60257102)

Research Products

[Publications] 伴 祐介: "EDR日本語コーパスにおけるキーワード検索のためのデータ構造の検討"情報処理学会第64回(平成14年)全国大会講演論文集. 2. 2-65-2-66 (2002)

Description

[Publications] 池田 直: "複数の概念を有する単語のセットからの共通概念の抽出"情報処理学会第64回(平成14年)全国大会講演論文集. 2. 2-75-2-76 (2002)

Description

[Publications] 上原 徹三: "古典の総索引からの品詞タグ付きコーパスの作成"言語処理学会論文誌「自然言語処理」. 10・2. 59-78 (2003)

Description

[Publications] 池田 直: "多義の単語のセットからの共通概念の抽出"電子情報通信学会論文誌DII. J86DII・9. 6 (2003)

Description

[Publications] Ban Y.: "Data Structure For Retrieving By Keyword in EDR Japanese Corpus"Proc. 64-th Meeting of IPSJ. 2-65-2-66 (2002)

Description

[Publications] Ikeda S.: "Exftracting A Common Concept From A Set Of Words Each Of Which Has Several Concepts"Proc. 64-th Meeting of IPSJ. 2-75-2-76 (2002)

Description

[Publications] Uehara T.: "A Transformation of Concordance Data on Japanese Classics to Corpus Tagged with Part-of -Speech"Natural Language Processing. Vol.10, No.2. 59-78 (2003)

Description

[Publications] Ikeda S.: "Extracting A Common Concept From A Set Of Words Having Several Meanings, The IEICE Trans. on Information and Systems"6 (2003)

Description

[Publications] 伴祐介: "EDR日本語コーパスにおけるキーワード検索のためのデータ構造の検討"情報処理学会第64回(平成14年)全国大会講演論文集. 2. 2-65-2-66 (2002)

[Publications] 池田直: "複数の概念を有する単語のセットからの共通概念の抽出"情報処理学会第64回(平成14年)全国大会講演論文集. 2. 2-75-2-76 (2002)

[Publications] 上原徹三: "古典の総索引からの品詞タグ付きコーパスの作成"言語処理学会論文誌「自然言語処理」. 10・2. 59-78 (2003)

[Publications] 池田直: "多義の単語のセットからの共通概念の抽出"電子情報通信学会論文誌DII. J86DII・9. 6 (2003)