2009 Fiscal Year Annual Research Report

自然言語意味概念の共通的記述に基づく次世代Web基盤環境構築とその知能化機能

Research Project

Project/Area Number	19200010
Research Institution	The University of Tokyo
Principal Investigator	石塚満 The University of Tokyo, 大学院・情報理工学系研究科, 教授 (50114369)
Keywords	Web知能化 / テキスト処理 / 概念表現 / 次世代Web基盤 / 意味計算 / 関係抽出
Research Abstract	情報流通と情報蓄積・共有,更には共創のグローバルな基盤に成長したWebを,コンピュータがその情報コンテンツの意味内容を把握できるようにして,検索・マイニング・編集・組織化等の意味に立ち入る操作を可能にする次世代知能化Web基盤に向けた研究開発を進めた.具体的には,メタデータ記述を中心とするSemantic Webとは異なり,我が国発の技術として我々が中心でW3Cで国際標準化を行っている,自然言語テキストが表す概念意味の共通的記述言語であるCDL(Concept Description Language)に関する研究を進めた.CDLに関する第一の技術課題は,テキストからCDLへの変換である.全自動機械翻訳が困難であるのと同様に,テキストからCDLへの全自動変換も困難であるので,人の負担の少ない半自動変換を目指し,語義(word sense)の選択と決定を介して,CDLへ変換するアプローチについて研究した.出来るだけの語義曖昧性解消をコンピュータにより行うことにより,人手介在の部分を減らすインタフェースの開発を行った.また,語句の単なるマッチッグでなく,意味的なマッチングを可能とし,CDLデータの意味的検索法の研究開発を行った. CDLは単語エンティティ間を関係識別子で結んで表すことをベースとしているが,関連研究として分布仮説に基づき,単語エンティティ対間の関係類似性をWeb検索エンジンの結果を利用して計算する手法を考案,開発した.この原理を活用し,(Tokyo, Japan), (?, France)のようなエンティティ対を入力として,?=Parisを答えるような潜在的関係検索エンジン(Latent Relational Search Engine)を開発した.これは特許出願も行った. 更に,Webから抽出した多数の任意エンティティ対から意味のある関係を有するエンティティ対を抽出する,新しい共クラスタリング法によるOpen Information Retrieval手法を創案,開発した.この研究成果は,2010年4月のWWW(Web技術関係で最も権威ある国際会議)にフルペーパーとして採録され,国際的に高い評価を得た.

Research Products
(6 results)

All 2010 2009 Other

All Journal Article (3 results) (of which Peer Reviewed: 3 results) Presentation (1 results) Remarks (1 results) Patent(Industrial Property Rights) (1 results)

[Journal Article] A Botton-up Approach to Sentence Ordering for Multi-document Summarization2010
- Author(s)
  Danushka Bollegala, N.Okazaki, 石塚満
- Journal Title
  
  Information Processing and Management 46
  
  Pages: 89-109
- Peer Reviewed
[Journal Article] ブログ上での話題伝播に注目した重要語判別2009
- Author(s)
  古川忠延, 松尾豊, 大向一輝, 内山幸樹, 石塚満
- Journal Title
  
  知能と情報(日本知能情報ファジィ学会誌) 21
  
  Pages: 557-566
- Peer Reviewed
[Journal Article] ISRST : Generating Interesting Multimedia Stories on the Web2009
- Author(s)
  Arturo Nakasone, H.Prendinger, 石塚満
- Journal Title
  
  Int'l Journal of Applied Artificial Intelligence 23
  
  Pages: 633-679
- Peer Reviewed
[Presentation] Measuring the Similarity between Implicit Semantic Relations from the Web2009
- Author(s)
  Danushka Bollegala, Y.Matsuo, M.Ishizuka
- Organizer
  18th Int'l World Wide Web Conf.(WWW2009)
- Place of Presentation
  マドリッド,スペイン
- Year and Date
  2009-04-24
[Remarks]
- URL
  http://www.miv.t.u-tokyo.ac.jp/
[Patent(Industrial Property Rights)] 検索方法及びシステム2009
- Inventor(s)
  ボッレーガラ・ダヌシカ,グェン・トアン・ドウク,石塚満
- Industrial Property Rights Holder
  東京大学
- Industrial Property Number
  特許出願2009-275762
- Filing Date
  2009-12-03

2009 Fiscal Year Annual Research Report

自然言語意味概念の共通的記述に基づく次世代Web基盤環境構築とその知能化機能

Principal Investigator

石塚 満 The University of Tokyo, 大学院・情報理工学系研究科, 教授 (50114369)

Research Products

[Journal Article] A Botton-up Approach to Sentence Ordering for Multi-document Summarization2010

Author(s)

Journal Title

[Journal Article] ブログ上での話題伝播に注目した重要語判別2009

Author(s)

Journal Title

[Journal Article] ISRST : Generating Interesting Multimedia Stories on the Web2009

Author(s)

Journal Title

[Presentation] Measuring the Similarity between Implicit Semantic Relations from the Web2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Remarks]

URL

[Patent(Industrial Property Rights)] 検索方法及びシステム2009

Inventor(s)

Industrial Property Rights Holder

Industrial Property Number

Filing Date

石塚満 The University of Tokyo, 大学院・情報理工学系研究科, 教授 (50114369)