• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Construction of Information Retrieval Infrastructure Based on Structural Natural Language Processing

Planned

Project AreaCyber Infrastructure for the Information-explosion Era
Project/Area Number 19024040
Research Category

Grant-in-Aid for Scientific Research on Priority Areas

Allocation TypeSingle-year Grants
Review Section Science and Engineering
Research InstitutionKyoto University

Principal Investigator

KUROHASHI Sadao  Kyoto University, 大学院・情報学研究科, 教授 (50263108)

Co-Investigator(Kenkyū-buntansha) KAWAHARA Daisuke  京都大学, 大学院・情報学研究科, 准教授 (10450694)
SHIBATA Tomohide  京都大学, 大学院・情報学研究科, 助教 (70452315)
Research Collaborator SHINZATO Keiji  京都大学, 大学院・情報学研究科, 特定研究員
SASANO Ryohei  東京工業大学, 精密工学研究所, 助教 (70603918)
Project Period (FY) 2007 – 2010
Project Status Completed (Fiscal Year 2010)
Budget Amount *help
¥35,400,000 (Direct Cost: ¥35,400,000)
Fiscal Year 2010: ¥9,000,000 (Direct Cost: ¥9,000,000)
Fiscal Year 2009: ¥9,000,000 (Direct Cost: ¥9,000,000)
Fiscal Year 2008: ¥8,700,000 (Direct Cost: ¥8,700,000)
Fiscal Year 2007: ¥8,700,000 (Direct Cost: ¥8,700,000)
Keywords自然言語処理 / 情報検索 / 述語項構造 / 柔軟マッチング / クラスタリング / 同義関係 / 意味サーチ
Research Abstract

The essential purpose of Information Retrieval is not to get relevant documents, but to obtain relevant information and knowledge. In order to achieve this, we believe that text understanding by machine, or Natural Language Processing is the most important aspect. This research project constructed IR infrastructure based on structural NLP, analyzing predicate argument structures in texts, handling expressive diversity in natural language, and providing a bird's-eye view towards a given topic by organizing and relating information.

Report

(6 results)
  • 2010 Annual Research Report   Final Research Report ( PDF )
  • 2009 Annual Research Report   Self-evaluation Report ( PDF )
  • 2008 Annual Research Report
  • 2007 Annual Research Report

Research Products

(41 results)

All 2010 2009 2008 2007 Other

All Journal Article (15 results) (of which Peer Reviewed: 15 results) Presentation (21 results) Remarks (5 results)

  • [Journal Article] The Effect of Corpus Size on Case Frame Acquisition for Predicate-Argument Structure Analysis2010

    • Author(s)
      Ryohei Sasano, Daisuke Kawahara, Sadao Kurohashi
    • Journal Title

      IEICE TRANSACTIONS on Information and Systems Vol.E93-D, No.6

      Pages: 1361-1368

    • NAID

      10027987438

    • Related Report
      2010 Final Research Report
    • Peer Reviewed
  • [Journal Article] 形態論的制約を用いたオンライン未知語獲得2010

    • Author(s)
      村脇有吾, 黒橋禎夫
    • Journal Title

      自然言語処理 Vol.17, No.1

      Pages: 55-75

    • NAID

      10027015968

    • Related Report
      2010 Final Research Report
    • Peer Reviewed
  • [Journal Article] The Effect of Corpus Size on Case Frame Acquisition for Predicate-Argument Structure Analysis2010

    • Author(s)
      Ryohei Sasano, Daisuke Kawahara, Sadao Kurohashi
    • Journal Title

      IEICE TRANSACTIONS on Information and Systems

      Volume: Vol.E93-D Pages: 1361-1368

    • NAID

      10027987438

    • Related Report
      2010 Annual Research Report
    • Peer Reviewed
  • [Journal Article] 形態論的制約を用いたオンライン未知語獲得2010

    • Author(s)
      村脇有吾, 黒橋禎夫
    • Journal Title

      自然言語処理 Vol.17,No.1

      Pages: 55-75

    • NAID

      10027015968

    • Related Report
      2009 Self-evaluation Report
    • Peer Reviewed
  • [Journal Article] 同一文抽出に基づく類似ページの検出と分類2010

    • Author(s)
      柴田知秀, 姜ナウン, 黒橋禎夫
    • Journal Title

      人工知能学会論文誌 Vol.25,No.1

      Pages: 224-232

    • NAID

      130000151253

    • Related Report
      2009 Self-evaluation Report
    • Peer Reviewed
  • [Journal Article] 形態論的制約を用いたオンライン未知語獲得2010

    • Author(s)
      村脇有吾, 黒橋禎夫
    • Journal Title

      自然言語処理 17

      Pages: 55-75

    • NAID

      10027015968

    • Related Report
      2009 Annual Research Report
    • Peer Reviewed
  • [Journal Article] 同一文抽出に基づく類似ページの検出と分類2010

    • Author(s)
      柴田知秀, 姜ナウン, 黒橋禎夫
    • Journal Title

      人工知能学会論文誌 25

      Pages: 224-232

    • NAID

      130000151253

    • Related Report
      2009 Annual Research Report
    • Peer Reviewed
  • [Journal Article] キーワード蒸留型クラスタリングによる大規模ウェブ情報の俯瞰2009

    • Author(s)
      馬場康夫, 新里圭司, 柴田知秀, 黒橋禎夫
    • Journal Title

      情報処理学会論文誌 Vol.50, No.4

      Pages: 1399-1409

    • NAID

      110007970430

    • Related Report
      2010 Final Research Report
    • Peer Reviewed
  • [Journal Article] キーワード蒸留型クラスタリングによる大規模ウェブ情報の俯瞰2009

    • Author(s)
      馬場康夫, 新里圭司, 柴田知秀, 黒橋禎夫
    • Journal Title

      情報処理学会論文誌 Vol.50,No.4

      Pages: 1399-1409

    • NAID

      110007970430

    • Related Report
      2009 Self-evaluation Report
    • Peer Reviewed
  • [Journal Article] キーワード蒸留型クラスタリングによる大規模ウェブ情報の俯瞰2009

    • Author(s)
      馬場康夫, 新里圭司, 柴田知秀, 黒橋禎夫
    • Journal Title

      情報処理学会論文誌 50

      Pages: 1399-1409

    • NAID

      110007970430

    • Related Report
      2009 Annual Research Report
    • Peer Reviewed
  • [Journal Article] 二段階の機械学習を用いたボトムアップ型の固有表現認識2009

    • Author(s)
      船山弘孝, 柴田知秀, 黒橋禎夫
    • Journal Title

      第8回情報科学技術フォーラム 第2分冊

      Pages: 19-26

    • Related Report
      2009 Annual Research Report
    • Peer Reviewed
  • [Journal Article] 大域的情報を用いた日本語固有表現認識2008

    • Author(s)
      笹野遼平, 黒橋禎夫
    • Journal Title

      情報処理学会論文誌 Vol.49, No.11

      Pages: 3765-3776

    • NAID

      40019554496

    • Related Report
      2008 Annual Research Report
    • Peer Reviewed
  • [Journal Article] 自動獲得した名詞関係辞書に基づく共参照解析の高度化2008

    • Author(s)
      笹野遼平, 黒橋禎夫
    • Journal Title

      自然言語処理 vol.15, No.5

      Pages: 99-118

    • NAID

      10024449000

    • Related Report
      2008 Annual Research Report
    • Peer Reviewed
  • [Journal Article] SYNGRAPH: A flexible matching method based on synonymous expression extraction from an ordinary dictionary and a web corpus2008

    • Author(s)
      Tomohide Shibata, Michitaka Odani, Jun Harashima, Takashi Oonishi, Sadao Kurohashi
    • Journal Title

      Third International Joint Conference on Natural Language Processing

      Pages: 787-792

    • Related Report
      2007 Annual Research Report
    • Peer Reviewed
  • [Journal Article] 自動構築した大規模格フレームに基づく構文・格解析の統合的確率モデル2007

    • Author(s)
      河原大輔, 黒橋禎夫
    • Journal Title

      自然言語処理 14-4

      Pages: 67-81

    • NAID

      10019567959

    • Related Report
      2007 Annual Research Report
    • Peer Reviewed
  • [Presentation] Exploiting Term Importance Categories and Dependency Relations for Natural Language Search2010

    • Author(s)
      Keiji Shinzato, Sadao Kurohashi
    • Organizer
      The Second Workshop on NLPIX
    • Place of Presentation
      Beijing, China
    • Year and Date
      2010-08-28
    • Related Report
      2010 Annual Research Report
  • [Presentation] Summarizing Search Results using PLSI2010

    • Author(s)
      Jun Harashima, Sadao Kurohashi
    • Organizer
      The Second Workshop on NLPIX 2010
    • Place of Presentation
      Beijing, China
    • Year and Date
      2010-08-28
    • Related Report
      2010 Annual Research Report
  • [Presentation] Identifying Contradictory and Contrastive Relations between Statements to Outline Web Information on a Given Topic2010

    • Author(s)
      Daisuke Kawahara, Kentaro Inui, Sadao Kurohashi
    • Organizer
      The 23rd International Conference on Computational Linguistics
    • Place of Presentation
      Beijing, China
    • Year and Date
      2010-08-27
    • Related Report
      2010 Annual Research Report
  • [Presentation] Semantic Classification of Automatically Acquired Nouns using Lexico-Syntactic Clues2010

    • Author(s)
      Yugo Murawaki, Sadao Kurohashi
    • Organizer
      23rd International Conference on Computational Linguistics
    • Place of Presentation
      Beijing, China
    • Year and Date
      2010-08-26
    • Related Report
      2010 Annual Research Report
  • [Presentation] Using Smaller Constituents Rather Than Sentences in Active Learning for Japanese Dependency Parsing2010

    • Author(s)
      Manabu Sassano, Sadao Kurohashi
    • Organizer
      The 48th Annual Meeting of the Association for Computational Linguistics
    • Place of Presentation
      Uppsala, Sweden
    • Year and Date
      2010-07-12
    • Related Report
      2010 Annual Research Report
  • [Presentation] Dependency Tree-based Sentiment Classification using CRFs with Hidden Variables2010

    • Author(s)
      Tetsuji Nakagawa, Kentaro Inui, Sadao Kurohashi
    • Organizer
      Human Language Technologies : The 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics
    • Place of Presentation
      Los Angeles, U.S.A.
    • Year and Date
      2010-06-04
    • Related Report
      2010 Annual Research Report
  • [Presentation] Acquiring Reliable Predicate-argument Structures from Raw Corpora for Case Frame Compilation2010

    • Author(s)
      Daisuke Kawahara, Sadao Kurohashi
    • Organizer
      In Proceedings of the 7th International Conference on Language Resources and Evaluation(LREC10)pp.1389-1393
    • Place of Presentation
      Malta
    • Year and Date
      2010-05-20
    • Related Report
      2010 Final Research Report
  • [Presentation] Acquiring Reliable Predicate-argument Structures from Raw Corpora for Case Frame Compilation2010

    • Author(s)
      Daisuke Kawahara, Sadao Kurohashi
    • Organizer
      7th International Conference on Language Resources and Evaluation
    • Place of Presentation
      Malta
    • Year and Date
      2010-05-20
    • Related Report
      2010 Annual Research Report
  • [Presentation] Online Japanese Unknown Morpheme Detection using Orthographic Variation2010

    • Author(s)
      Yugo Murawaki, Sadao Kurohashi
    • Organizer
      The 7th International Conference on Language Resources and Evaluation
    • Place of Presentation
      Malta
    • Year and Date
      2010-05-19
    • Related Report
      2010 Annual Research Report
  • [Presentation] A Probabilistic Model for Associative Anaphora Resolution2009

    • Author(s)
      Ryohei Sasano, Sadao Kurohashi
    • Organizer
      Conference on Empirical Methods in Natural Language Processing
    • Place of Presentation
      Singapore, Singapore
    • Year and Date
      2009-08-06
    • Related Report
      2009 Annual Research Report
  • [Presentation] The Effect of Corpus Size on Case Frame Acquisition for Discourse Analysis2009

    • Author(s)
      Ryohei Sasano, Daisuke Kawahara, Sadao Kurohashi
    • Organizer
      North American Chapter of the Association for Computational Linguistics - Human Language Technologies
    • Place of Presentation
      Boulder, Colorado
    • Year and Date
      2009-06-03
    • Related Report
      2009 Self-evaluation Report
  • [Presentation] The Effect of Corpus Size on Case Frame Acquisition for Discourse Analysis2009

    • Author(s)
      Ryohei Sasano, Daisuke Kawahara.Sadao Kurohashi
    • Organizer
      North American Chapter of the Association for Computational Linguistics-Human Language Technologies
    • Place of Presentation
      Boulder, Colorado U.S.A.
    • Year and Date
      2009-06-03
    • Related Report
      2009 Annual Research Report
  • [Presentation] Online Acquisition of Japanese Unknown Morphemes using Morphological Constraints2008

    • Author(s)
      Yugo Murawaki and Sadao Kurohashi
    • Organizer
      EMNLP 2008 : Conference on Empirical Methods in Natural Language Processing
    • Place of Presentation
      Waikiki, Hawaii
    • Year and Date
      2008-10-25
    • Related Report
      2008 Annual Research Report
  • [Presentation] Coordination Disambiguation without Any Similarities2008

    • Author(s)
      Daisuke Kawahara and Sadao Kurohashi
    • Organizer
      22nd International Conference on Computational Linguistics
    • Place of Presentation
      Manchester, UK
    • Year and Date
      2008-08-19
    • Related Report
      2008 Annual Research Report
  • [Presentation] A Fully-Lexicalized Probabilistic Model for Japanese Zero Anaphora Resolution2008

    • Author(s)
      Ryohei Sasano, Daisuke Kawahara and Sadao Kurohashi
    • Organizer
      22nd International Conference on Computational Linguistics
    • Place of Presentation
      Manchester, UK
    • Year and Date
      2008-08-18
    • Related Report
      2008 Annual Research Report
  • [Presentation] A Large-Scale Web Data Collection as a Natural Language Processing Infrastructure2008

    • Author(s)
      Keiji Shinzato, Daisuke Kawahara, Chikara Hashimoto and Sadao Kurohashi
    • Organizer
      6th International Conference on Language Resources and Evaluation
    • Place of Presentation
      Marrakech, Morocco
    • Year and Date
      2008-05-29
    • Related Report
      2008 Annual Research Report
  • [Presentation] 日本語Textual Entailmentのデータ構築と自動獲得した類義表現に基づく推論関係の認識2008

    • Author(s)
      小谷 通隆, 柴田 知秀, 中田 貴之, 黒橋 禎夫
    • Organizer
      言語処理学会 第14回年次大会
    • Place of Presentation
      東京大学
    • Year and Date
      2008-03-20
    • Related Report
      2007 Annual Research Report
  • [Presentation] 検索エンジン基盤TSUBAKIを用いた大規模ウェブ情報クラスタリングシステムの構築2008

    • Author(s)
      馬場 康夫, 新里 圭司, 黒橋 禎夫
    • Organizer
      情報処理学会自然言語処理研究会
    • Place of Presentation
      国立情報学研究所
    • Year and Date
      2008-01-22
    • Related Report
      2007 Annual Research Report
  • [Presentation] SYNGRAPH: A Flexible Matching Method based on Synonymous Expression Extraction from an Ordinary Dictionary and a Web Corpus2008

    • Author(s)
      Tomohide Shibata, Michitaka Odani, Jun Harashima, Takashi Oonishi, Sadao Kurohashi
    • Organizer
      In Proceedings of Third International Joint Conference on Natural Language Processing(IJCNLP2008, poster) pp.787-792
    • Place of Presentation
      Hyderabad, India
    • Year and Date
      2008-01-09
    • Related Report
      2010 Final Research Report
  • [Presentation] SYNGRAPH: A Flexible Matching Method based on Synonymous Expression Extraction from an Ordinary Dictionary and a Web Corpus2008

    • Author(s)
      Tomohide Shibata, Michitaka Odani, Jun Harashima, Takashi Oonishi, Sadao Kurohashi
    • Organizer
      Third International Joint Conference on Natural Language Processing
    • Place of Presentation
      Hyderabad, India
    • Year and Date
      2008-01-09
    • Related Report
      2009 Self-evaluation Report
  • [Presentation] TSUBAKI: An Open Search Engine Infrastructure for Developing New Information Access Methodology2008

    • Author(s)
      Keiji Shinzato, Tomohide Shibata, Daisuke Kawahara, Chikara Hashimoto, Sadao Kurohashi
    • Organizer
      In Proceedings of Third International Joint Conference on Natural Language Processing(IJCNLP2008) pp.189-196
    • Place of Presentation
      Hyderabad, India
    • Year and Date
      2008-01-08
    • Related Report
      2010 Final Research Report
  • [Remarks] 検索エンジン基盤TSUBAKI

    • URL

      http://tsubaki.ixnlp.nii.ac.jp/

    • Related Report
      2010 Final Research Report
  • [Remarks] 報道関連・検索は「キーワード」から「文章」へ, 日経産業新聞(2007年8月21日10面)

    • Related Report
      2009 Self-evaluation Report
  • [Remarks] 「情報大爆発」どうさばく, 朝日新聞be (2008年7月5日b3面)

    • Related Report
      2009 Self-evaluation Report
  • [Remarks] 情報爆発に立ち向かう, Newton 2009年8月号(2009年6月26日発売)ホームページ情報

    • Related Report
      2009 Self-evaluation Report
  • [Remarks] 検索エンジン基盤TSUBAKI URL

    • URL

      http://tsubaki.ixnlp.nii.ac.jp/

    • Related Report
      2009 Self-evaluation Report

URL: 

Published: 2007-03-31   Modified: 2018-03-28  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi