• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Study on Information Retrieval based on Similarity Calculation of Intra-Document Structure

Research Project

Project/Area Number 11680383
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionYokohama National University

Principal Investigator

MORI Tatsunori  Yokohama National University, Faculty of Engineering, Associate, 工学部, 助教授 (70212264)

Co-Investigator(Kenkyū-buntansha) NAKAGAWA Hiroshi  University of Tokyo, Information Technology Center, Professor, 情報基盤センター, 教授 (20134893)
Project Period (FY) 1999 – 2000
Project Status Completed (Fiscal Year 2000)
Budget Amount *help
¥3,600,000 (Direct Cost: ¥3,600,000)
Fiscal Year 2000: ¥1,100,000 (Direct Cost: ¥1,100,000)
Fiscal Year 1999: ¥2,500,000 (Direct Cost: ¥2,500,000)
KeywordsRetrieval of similar documents / Extraction of Numerical Expressions / Extraction of Named Entity / Question Answering / Information Retrieval / Information Extraction
Research Abstract

The purpose of this research is establishment of the method for "content"-based information retrieval. In our research, the "content" is regarded as the combination of the following items : a) Logical structure of document annotated by tags, b) Text, and c) Information extracted by the technology of Information Extraction.
Through the two year research, we obtained the following results :
1. Extraction of structure of intra-documents based on similarity among passages :
By using not only intra-document information but also inter-document information, we improve the effectiveness of retrieving relevant portions of document.
2. Multi-strategic named entity recognizer based on machine learning and extraction patterns :
By combining those two types of strategies for named entity task, we improve the accuracy of recognition of named entities.
3. Extraction of numerical information and its application to Question Answering :
We consider "Question Answering" is the one of the ideal context retrieval system. Named entities correspond to the answer for the 4W-type questions. On the other hand, it is numerical expressions what corresponds to H-type questions. Therefore, we proposed a method to extract numerical expressions with its context as a part of a QA system.

Report

(3 results)
  • 2000 Annual Research Report   Final Research Report Summary
  • 1999 Annual Research Report
  • Research Products

    (15 results)

All Other

All Publications (15 results)

  • [Publications] 大森信行,岡村潤,森辰則,中川裕志: "情報検索手法を利用した関連マニュアル群のハイパーテキスト化"情報処理学会論文誌. 40・6. 2776-2784 (1999)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] T.Mori,S.Tanaka,H.Nakagawa: "Similarity Calculation of Segment Retrieval for Aid in reading Related Documents"Proceedings of Natural Language Processing Pacific Rim Symposium '99. 178-183 (1999)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] T.Mori,M.Matsuo,H.Nakagawa: "Zero pronoun resolution by Linguistic Constraints and Defaults"The Machine Translation Journal. 14・2-3. (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] T.Mori,T.Kokubu,T.Tanaka: "Cross-Lingual Information Retrieval based on LSI with Multiple Word Spaces"Proceedings of NTCIR Workshop 2 Meeting. (2001)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] T.Mori,M.Kikuchi,K.Yoshida: "Term Weighting Method based on Information Gain Ratio for Summarizing Documents retrieved by IR systems"Proceedings of NTCIR Workshop 2 Meeting. (2001)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] N.Ohmori, J.Okamura, T.Mori and H.Nakagawa: "Hypertextualization for Related Instruction Manuals Using the Techniques of Information Retrieval"Journal of Information Processing Society of Japan. Vol.40, No.6. 2776-2784 (1999)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] T.Mori, S.Tanaka and H.Nakagawa: "Similarity Calculation of Segment Retrieval for Aid in reading Related Documents"Proceedings of Natural Language Processing Pacific Rim Symposium '99. 178-183 (1999)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] T.Mori, M.Matsuo and H.Nakagawa: "Zero pronoun resolution by Linguistic Constraints and Defaults"The Machine Translation Journal. Vol.14, No.2-3. (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] T.Mori, T.Kokubu and T.Tanaka: "Cross-Lingual Information Retrieval based on LSI with Multiple Word Spaces"Proceedings of NTCIR Workshop 2 Meeting. (2001)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] T.Mori, M.Kikuchi and K.Yoshida: "Term Weighting Method based on Information Gain Ratio for Summarizing Documents retrieved by IR systems"Proceedings of NTCIR Workshop 2 Meeting. (2001)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] T.Mori,M.Matsuo,H.Nakagawa: "Zero pronoun resolution by Linguistic Constraints and Defaults"The Machine Translation Journal. 14・2-3. (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] T.Mori,T.Kokubu,T.Tanaka: "Cross-Lingual Information Retrieval based on LSI with Multiple Word Spaces"Proceedings of NTCIR Workshop 2 Meeting. (2001)

    • Related Report
      2000 Annual Research Report
  • [Publications] T.Mori,M.Kikuchi,K.Yoshida: "Term Weighting Method based on Information Gain Ratio for Summarizing Documents retrieved by IR systems"Proceedings of NTCIR Workshop 2 Meeting. (2001)

    • Related Report
      2000 Annual Research Report
  • [Publications] 大森 信行,岡村 潤,森 辰則,中川 裕志: "情報検索手法を利用した関連マニュアル群のハイパーテキスト化"情報処理学会論文誌. 40・6. 2776-2784 (1999)

    • Related Report
      1999 Annual Research Report
  • [Publications] T.Mori,S.Tanaka,H.Nakagawa: "Similarity Calculation of Segment Retrieval for Aid in reading Related Documents"Proceedings of Natural Language Processing Pacific Rim Symposium '99. 178-183 (1999)

    • Related Report
      1999 Annual Research Report

URL: 

Published: 1999-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi