1998 Fiscal Year Final Research Report Summary

Development of an Integrated Tool System for Technical Term Auto-Extraction and Knowledge Acquisition from Corpora

Research Project

Project/Area Number	08558027
Research Category	Grant-in-Aid for Scientific Research (A)
Allocation Type	Single-year Grants
Section	展開研究
Research Field	Intelligent informatics
Research Institution	The University of Tokyo
Principal Investigator	TSUJI Junichi The University of Tokyo, Graduate School of Science, Professor, 大学院・理学系研究科, 教授 (20026313)
Co-Investigator(Kenkyū-buntansha)	IKEHARA Satoru The University of Tottori, Faculty of Engineering, Professor, 工学部, 教授 (70283968) KAGEURA Kyo National Center for Science Information Systems, Associate Professor, 助教授 (00211152) KOYAMA Teruo National Center for Science Information Systems, Professor, 教授 (80124410) KIYONO Masaki Matsushita Electric Industrial company, Research institute of Tokyo, research worker, 東京研究所, 研究員
Project Period (FY)	1996 – 1998
Keywords	knowledge acquisition / semantic classification / database / technical term extraction
Research Abstract	The goal of this project was to provide the systems that can acquire knowledge on terminology from texts in a semi-automatic manner. In order to accomplish the goal, we have developed the following three systems. 1. Central Database for Terminology : We have created a database system for terminology by integrating the text/lexicon database developed by EDR and the programming language LiLFeS, which was developed at University of Tokyo for easy and flexible treatment linguistic entities By this system, we can perform a systematic maintenance of the knowledge acquired by the following two systems. 2. Systems for term recognition : The research group in the NACSIS introduced a statistical metric to identify technical terminology in texts, and built the programs that can recognize terms using this metric. The group in University of Tokyo attacked the same problem in a different perspective, and succeeded in providing a term recognition method based on character n-grams. Those programs are integrated so that they can work as a front end of the database system described in 1. 3. Systems for acquiring ontological knowledge on terms : The research group in University of Tokyo developed the programs for obtaining semantic classifications of words according to surface clues appearing in texts. The Matsushita research group developed a similar technique using deeper syntactic structures of texts. Those systems were applied to the documents in Genome texts, the news articles about stock markets and so on.

Research Products
(12 results)

All Other

All Publications (12 results)

[Publications] T.Koyama: "Research on Natural Low Database"Proc.JCKBSE'96. 242-245 (1996)
- Description
  「研究成果報告書概要(和文)」より
[Publications] K.Kageura: "Some Statistical Characterizations of Terminological and Non-Terminological Elements Evaluation and Examination in Tepanese Technical Abstiacts"TKE'96. 131-138 (1996)
- Description
  「研究成果報告書概要(和文)」より
[Publications] J.Tsujii: "Analysis of Word Structure of Medical Synonyms"TKE'96. 190-196 (1996)
- Description
  「研究成果報告書概要(和文)」より
[Publications] K.Kageura: "A Statistical Analysis of Morphemes in Japanese Terminorogy"COLING-ACL'98. 638-645 (1998)
- Description
  「研究成果報告書概要(和文)」より
[Publications] T.Makino,K.Torisawa,J.Tsujii: "LiLFeS-Practical Programming Language for Typed Feature Structures"Proc.NLPRS'97. 239-244 (1997)
- Description
  「研究成果報告書概要(和文)」より
[Publications] T.Seki*,H.S.Park,J.Tsujii: "Identifying the Interaction between Genes and Gene Products Based on Frequently Seen Verbs in Medline Abstracts"Genome Informatics. 9. 62-71 (1998)
- Description
  「研究成果報告書概要(和文)」より
[Publications] Teruo Koyama: "Research on Natural Law Database"Proceedings of JCKBSE'96. 242-245 (1996)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Kyo Kageura: "Some Statistical Characterizations of Terminological and Non-Terminological Elements : Evaluation and Examination in Japanese Technical Abstracts"Proceedings of TKE'96. 131-138 (1996)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Junichi Tsujii: "Analysis of World Structure of Medical Synonyms"Proceedings of TKE'96. 190-196 (1996)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Kyo Kageura: "A Statistical Analysis of Morphemes In Japanese Terminology"Proceedings of COLING'98. 638-645 (1998)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Takaki Makino, Kentaro Torisawa, Junichi Tsujii: "LiLFeS-Practical Programming Language for Typed Feature Structures"Proceedings of NLPRS'97. 239-244 (1997)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Tsuyoshi Sekimizu, H. S. Park, Junichi Tsujii: "Identifying the Interaction between Genes and Gene Products Based on Frequently Seen Verbs In Medline Abstracts"Proceedings of Genome Informatics. Vol.9. 62-71 (1998)
- Description
  「研究成果報告書概要(欧文)」より

1998 Fiscal Year Final Research Report Summary

Development of an Integrated Tool System for Technical Term Auto-Extraction and Knowledge Acquisition from Corpora

Principal Investigator

TSUJI Junichi The University of Tokyo, Graduate School of Science, Professor, 大学院・理学系研究科, 教授 (20026313)

Research Products

[Publications] T.Koyama: "Research on Natural Low Database"Proc.JCKBSE'96. 242-245 (1996)

Description

[Publications] K.Kageura: "Some Statistical Characterizations of Terminological and Non-Terminological Elements Evaluation and Examination in Tepanese Technical Abstiacts"TKE'96. 131-138 (1996)

Description

[Publications] J.Tsujii: "Analysis of Word Structure of Medical Synonyms"TKE'96. 190-196 (1996)

Description

[Publications] K.Kageura: "A Statistical Analysis of Morphemes in Japanese Terminorogy"COLING-ACL'98. 638-645 (1998)

Description

[Publications] T.Makino,K.Torisawa,J.Tsujii: "LiLFeS-Practical Programming Language for Typed Feature Structures"Proc.NLPRS'97. 239-244 (1997)

Description

[Publications] T.Seki*,H.S.Park,J.Tsujii: "Identifying the Interaction between Genes and Gene Products Based on Frequently Seen Verbs in Medline Abstracts"Genome Informatics. 9. 62-71 (1998)

Description

[Publications] Teruo Koyama: "Research on Natural Law Database"Proceedings of JCKBSE'96. 242-245 (1996)

Description

[Publications] Kyo Kageura: "Some Statistical Characterizations of Terminological and Non-Terminological Elements : Evaluation and Examination in Japanese Technical Abstracts"Proceedings of TKE'96. 131-138 (1996)

Description

[Publications] Junichi Tsujii: "Analysis of World Structure of Medical Synonyms"Proceedings of TKE'96. 190-196 (1996)

Description

[Publications] Kyo Kageura: "A Statistical Analysis of Morphemes In Japanese Terminology"Proceedings of COLING'98. 638-645 (1998)

Description

[Publications] Takaki Makino, Kentaro Torisawa, Junichi Tsujii: "LiLFeS-Practical Programming Language for Typed Feature Structures"Proceedings of NLPRS'97. 239-244 (1997)

Description

[Publications] Tsuyoshi Sekimizu, H. S. Park, Junichi Tsujii: "Identifying the Interaction between Genes and Gene Products Based on Frequently Seen Verbs In Medline Abstracts"Proceedings of Genome Informatics. Vol.9. 62-71 (1998)

Description