• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2003 Fiscal Year Final Research Report Summary

Concept Chemical Knowledge Representation

Research Project

Project/Area Number 13680433
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionThe University of Tokyo

Principal Investigator

ISHIZUKA Mitsuru  The University of Tokyo, Graduate School of Information Science and Technology, Professor, 大学院・情報理工学系研究科, 教授 (50114369)

Project Period (FY) 2001 – 2003
KeywordsNatural Language Expression / Knowledge Representation / Inference / Key word extraction / Important Sentence Extraction / Knowledge Sharing
Research Abstract

A huge quantity of electric documents mostly consisting of natural language texts are stored and circulated recently through the World Wide Web (WWW), etc. in the society. Although they include a variety of useful knowledge, it has been difficult for us and computers to fully utilize them as knowledge, such as deriving an answer by combining multiple pieces of knowledge. Toward one solution to this problem, we have proposed and developed a knowledge representation/inference scheme called "Concept Chemical Representation (CCR)", which representation is close to natural language expression and thus convenient for the conversion from natural language sentences.
When converting natural language texts into CCR efficiently, it is required to focus meaningful sentences and ignore other parts for avoiding the inclusion of useless components into the CCR knowledge base. Accordingly, we have developed the following extraction methods of keywords and important sentences from a document.
1) keyword extraction using the deviation statistics of word co-occurrence.
2) keyword extraction using the small world structure of word co-occurrence.
3) keyword extraction based on term activities measured as the human cognitive process for term recognition.
While most existing methods are based on the use of TF*IDF (term frequency * inverse document frequency), our above original methods are different from the existing ones. Our methods can be applied to English sentences as well as Japanese sentences.

  • Research Products

    (10 results)

All Other

All Publications (10 results)

  • [Publications] 松尾 豊, 石塚 満: "語の共起の統計的情報に基づく文書からのキーワード抽出アルゴリズム"人工知能学会論文誌. 17. 217-223 (2002)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 松尾 豊, 大澤幸生, 石塚 満: "Small World構造に基づく文書からのキーワード抽出"情報処理学会論文誌. 43. 1825-1833 (2002)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 松村真宏, 大澤幸生, 石塚 満: "語の活性度に基づくキーワード抽出法"人工知能学会論文誌. 17. 398-406 (2002)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] N.Matsumura, Y.Ohsawa, M.Ishizuka: "Automatic Indexing for Asserted Keywords from a Document"Jour.of New Generation Computing. 21. 37-48 (2002)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] N.Okazaki, Y.Matsuo, N.Matumura, M.Ishizuka: "Sentence Extraction by Spreading Activation through Sentence Similarity"IEICE Transactions of Information and Systems. E86-D. 1686-1694 (2003)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Y.Matsuo, M.Ishizuka: "Keyword Extraction from A Document using Word Co-occurrence Statistical Information"Transactions of the Japanese Society for Artificial Intelligence. 17-3. 217-223 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Y.Matsuo, Y.Ohsawa, M.Ishizuka: "Keyword Extraction using Small World Structure in a Document"Transactions of the Information Processing Society of Japan. 43-6. 1825-1833 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] N.Matsumura, Y.Ohsawa, M.Ishizuka: "Automatic Indexing Based on Term Activity"Transactions of the Japanese Society for Artificial Intelligence. 17-4. 398-406 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] N.Matsumura, Y.Matsuo, Y.Ohsawa, M.Ishizuka: "Automatic Indexing for Extracting Asserted Keywords from a Document"Journal of New Generation Computing. 21-1. 37-47 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] N.Okazaki, Y.Matsuo, N.Matsumura, M.Ishizuka: "Sentence Extraction by Spreading Activation through Sentence Similarity"IEICE Transactions of Information and Systems. E86-D-9. 1686-1694 (2003)

    • Description
      「研究成果報告書概要(欧文)」より

URL: 

Published: 2005-04-19  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi