• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Production of a Prototype Lexical Database Featuring High-speed, High-accuracy Access and Lexical Knowledge Acquisition

Research Project

Project/Area Number 05558038
Research Category

Grant-in-Aid for Developmental Scientific Research (B)

Allocation TypeSingle-year Grants
Research Field 情報システム学(含情報図書館学)
Research InstitutionThe Science University of Tokyo

Principal Investigator

FUJISAKI Hiroya  Science University of Tokyo, Faculty of Professor Industrial Science and Technology, 基礎工学部, 教授 (80010776)

Co-Investigator(Kenkyū-buntansha) KURASHIMA Tokihisa  Sanseido Publishing Company, Publishing Division Managing Director, 出版局, 局長
OHNO Sumio  Science University of Tokyo, Faculty of Industrial Science and Technology Resear, 基礎工学部, 助手 (80256677)
HIROSE Keikichi  University of Tokyo, Faculty of Engineering Professor, 工学部, 教授 (50111472)
KAMEDA Hiroyuki  Tokyo Engineering University, Faculty of Engineering Associate Professor, 工学部, 助教授 (00194994)
Project Period (FY) 1993 – 1994
Project Status Completed (Fiscal Year 1994)
Budget Amount *help
¥9,100,000 (Direct Cost: ¥9,100,000)
Fiscal Year 1994: ¥2,900,000 (Direct Cost: ¥2,900,000)
Fiscal Year 1993: ¥6,200,000 (Direct Cost: ¥6,200,000)
KeywordsHigh-speed, High-accuracy Lexical Access / Acquisition of Lexical Knowledge / Lexical Database / A Model of Hierarchical Structure of Information / Database Management System / Unknown Word / 語 知識の獲得 / 大規模テキストデータ / データ管理システム
Research Abstract

(1) Generation of lexical data : The data from the "Shin-Meikai Kokugo Jiten" (by Sanseido Publishing Co.) have been modified and expanded to generate the lexical data(approx. 170,000 words).
(2) Construction of a word lexicon : A partial lexicon containing only the words used in a target domain has been constructed based on a model of the hierarchical structure of information.
(3) Basic design of a database management system : The database management system consists of modules for the access, modification, expansion, acquisition, information structure management, and man-machine interface.
(4) Implementation of the database management system : The database management system has been implemented on a workstation using the C language.
(5) Design and implementation of detailed specifications of lexical data : Detailed specifications of the lexical data have been designed and the above-mentioned lexical data have been processed and implemented on a workstation.
(6) Implementation and evaluation of the access module and the knowledge acquisition module : The access module and the knowledge acquisition module of the database management system have been implemented on a workstation using the C language and prolog.
(7) Generation of the lexical database : The results of (5) and (6) have been integrated into a lexical database and its performance has been compared with that of a database using conventional access system both in the speed and the accuracy of access, confirming the advantages of the proposed system.
(8) Test of usefulness of the lexical database : The proposed lexical database has been tested in the morpheme analysis of newspaper articles and weather reports, and the results have confirmed that the system has achieved the expected speed and accuracy of access as well as the capability of unknown word acquisition, demonstrating the validity of the proposed lexical database.

Report

(3 results)
  • 1994 Annual Research Report   Final Research Report Summary
  • 1993 Annual Research Report
  • Research Products

    (18 results)

All Other

All Publications (18 results)

  • [Publications] 横田 和章: "認知単位を基本とする文解析手法の検討" 情報処理学会第48回(平成6年前期)全国大会講演論文集. 3. 69-70 (1994)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] 亀田 弘之: "用例からの類推にもとづく知識の獲得と一般化について -未知複合語の獲得を中心にして-" 電子情報通信学会『言語・知識の獲得と運用』研究会資料LK93-2. 1-8 (1993)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] 亀田 弘之: "日本語文章理解における未知語とその処理" 「知識科学の最前線」シンポジウム論文集. 17-27 (1993)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] 横田 和章: "コーパスに基づく構文木の自動生成法" 電子情報通信学会総合大会情報・システム部門講演論文集. 1. 120 (1995)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] 亀田 弘之: "べた書き日本語文からの未知語獲得システムの試作" 電子情報通信学会技術研究報告「思考と言語」TL94-11. 94. 17-24 (1994)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] 亀田 弘之: "統語解析処理にもとづく未知語獲得システムの試作" 電子情報通信学会総合大会基礎・境界部門講演論文集. 474 (1995)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] YOKOTA,Kazuaki and FUJISAKI,Hiroya: "A Study on a Method of Text Analysis Based on Cognitive Units" Proceedings of 48th National Convention, the Information Processing Society of Japan. Vol.3. 69-70 (1994)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] KAMEDA,Hiroyuki: "A Note on Acquisition and Generalization of Knowledge Based on Analogy Reasoning from Examples" Report of the Technical Committee on Acquisition and Utilization of Language and Knowledge, the Institute of Electronics, Information and Communication Engineers. No.LK93-2. 1-6 (1993)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] KAMEDA,Hiroyuki: "Unknown Words and Their Processing in Machine Understanding of Japanese Text" Proceedings of Symposium on Frontiers of Knowledge Science. 17-27 (1993)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] YOKOTA,Kazuaki and FUJISAKI Hiroya: "A Method of Automatic Generation of Phrase Structures Based on a Corpus" 1995 National Convention Record of the Institute of Electronics, Information and Communication Engineers, Information and Systems Division. Vol.1. 120 (1995)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] KAMEDA,Hiroyuki and SAKURAI Tomoko: "Implementation of an Unknown Word Acquisition System from Japanese Texts" Technical Report, the Institute of Electronics, Information and Communication Enginerrs. TL94-11. 17-24 (1994)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] KAMEDA Hiroyuki and SAKURAI Tomoko: "Unknown Word Acquisition System Based on Syntactic Analysis" 1995 National Convention Record of the Institute of Electronics, Information and Communication Engineers, Engineering Sciences Division. 474 (1995)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] 横田和章: "コーパスに基づく構文木の自動生成法" 電子情報通信学会総合大会情報・システム部門講演論文集. 1. 120 (1995)

    • Related Report
      1994 Annual Research Report
  • [Publications] 亀田弘之: "べた書き日本語文からの未知語獲得システムの試作" 電子情報通信学会技術研究報告「思考と言語」TL94-11. 17-24 (1994)

    • Related Report
      1994 Annual Research Report
  • [Publications] 亀田弘之: "統語解析処理にもとづく未知語獲得システムの試作" 電子情報通信学会総合大会基礎・境界部門講演論文集. 474 (1995)

    • Related Report
      1994 Annual Research Report
  • [Publications] 横田和幸: "認知単位を基本とする文解析手法の検討" 情報処理学会第48回(平成6年前期)全国大会講演論文集. 3. 60-70 (1994)

    • Related Report
      1993 Annual Research Report
  • [Publications] 亀田弘之: "用例からの類推にもとづく知識の獲得と一般化について-未知複合語の獲得を中心にして-" 電子情報通信学会「言語・知識の獲得と運用」研究会資料. 1-8 (1993)

    • Related Report
      1993 Annual Research Report
  • [Publications] 亀田弘之: "日本語文章理解における未知語とその処理" 「知識科学の最前線」シンポジウム論文集. 17-27 (1993)

    • Related Report
      1993 Annual Research Report

URL: 

Published: 1993-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi