Producing and Evaluating Encyclopedic Content by Reorganizing Heterogeneous Information

Research Project

Project/Area Number	17300028
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Single-year Grants
Section	一般
Research Field	Media informatics/Database
Research Institution	University of Tsukuba
Principal Investigator	FUJII Atsushi University of Tsukuba, Graduate School of Library, Information and Media Studies, Associate Professor (30302433)
Co-Investigator(Kenkyū-buntansha)	ISHIKAWA Tetsuya University of Tokyo, Historiographical Institute, Professor (20041808) ITOU Katunobu Hosei University, Faculty of Computer and Information Sciences, Professor (30356472) AKIBA Tomoyosi Toyohashi University of Technology, Department of Information and Computer Sciences, Associate Professor (00356346)
Project Period (FY)	2005 – 2007
Project Status	Completed (Fiscal Year 2007)
Budget Amount *help	¥10,150,000 (Direct Cost: ¥9,100,000、Indirect Cost: ¥1,050,000) Fiscal Year 2007: ¥4,550,000 (Direct Cost: ¥3,500,000、Indirect Cost: ¥1,050,000) Fiscal Year 2006: ¥3,100,000 (Direct Cost: ¥3,100,000) Fiscal Year 2005: ¥2,500,000 (Direct Cost: ¥2,500,000)
Keywords	World Wide Web / Encyclopedias / Multimedia / Natural language processing / Information retrieval / Speech recoenition / User interfaces / Content production / コンテンツ構築
Research Abstract	We proposed an automatic method to extract term descriptions from the World Wide Web and have built a Web search site called "Cyclone" (http://cycbne.slis.tsukuba.ac.jp), where users can efficiently obtain encyclopedic term descriptions fir specific word Senses. Approximately 750, 000 Japanese terms have been indexed as headwords. However, to explain certain headwords, specifically those related to entities such as devices and animals, it is useful to present a picture of the entity, in addition to a textual description Hand-crafted multimedia encyclopedias, such as Encarta, integrate ext, sound, usage, and video data to describe a single headword from different perspectives. However; due to the limitations of manual compilation, existing encyclopedias often lack new terms and new definitions for existing terms. In view of the above problem, the objective of this research was to produce encyclopedic content, for which we reorganized heterogeneous information in the World Wide Web and TV broadcasting. We proposed a method for integrating images on the Web and textual descriptions in Cyclone. Our method resolves any ambiguity in the meaning of an image by text analysis, so that images for a polysemous word, such as "hub (network device and center of wheel)", are classified using word senses. We also proposed a method to associate text and video information, for which we integrated information retrieval and speech recognition technologies. In addition, to associate information across languages, we proposed lemmatization and transliteration methods. Our research is a step toward the automatic compilation of multimedia encyclopedias.

Report

(4 results)

2007 Annual Research Report Final Research Report Summary
2006 Annual Research Report
2005 Annual Research Report

Research Products
(23 results)

All 2008 2007 2006 2005

All Journal Article (19 results) (of which Peer Reviewed: 5 results) Presentation (4 results)

[Journal Article] OpinionReader:意思決定支援を目的とした主観情報の集約・可視化システム2008
- Author(s)
  藤井敦
- Journal Title
  
  電子情報通信学会論文誌 J91-D No.2
  
  Pages: 459-470
- NAID
  110007385928
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2007 Final Research Report Summary
- Peer Reviewed
[Journal Article] A System for Summarizing and Visualizing Subjective Information Towards Supporting Decision Making2008
- Author(s)
  Atsushi Fujii. OpinionReader
- Journal Title
  
  The Transactions of D-II Vol.J91-D, No.2
  
  Pages: 459-470
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2007 Final Research Report Summary
[Journal Article] Opinion Reader:意思決定支援を目的とした主観情報の集約・可視化システム2008
- Author(s)
  藤井敦
- Journal Title
  
  電子情報通信学会論文誌 J91-D
  
  Pages: 459-470
- Related Report
  2007 Annual Research Report
- Peer Reviewed
[Journal Article] 中国語への翻字における確率的な漢字選択手法2007
- Author(s)
  黄海湘, 藤井敦, 石川徹也
- Journal Title
  
  電子情報通信学会論文誌 J90-D
  
  Pages: 2914-2923
- NAID
  110007380598
- Related Report
  2007 Annual Research Report
- Peer Reviewed
[Journal Article] 小説テキストを対象とした人物情報の抽出と体系化2007
- Author(s)
  馬場こづえ
- Journal Title
  
  言語処理学会第13回年次大会発表論文集
  
  Pages: 574-577
- Related Report
  2006 Annual Research Report
[Journal Article] LODEM: A system for on-demand video lectures.2006
- Author(s)
  Atsushi, Fujii・Katunobu, Itou・Tetsuya, Ishikawa
- Journal Title
  
  Speech Communications 48 No.5
  
  Pages: 516-531
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2007 Final Research Report Summary
- Peer Reviewed
[Journal Article] A system for on-demand video lectures2006
- Author(s)
  Atsushi Fujii, Katunobu Itou, and Tetsuya Ishikawa
- Journal Title
  
  Speech Communication Vol.48, No.5
  
  Pages: 516-531
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2007 Final Research Report Summary
[Journal Article] LODEM : A system for on-demand video lectures.2006
- Author(s)
  Atsushi Fujii
- Journal Title
  
  Speech Communications 48(5)
  
  Pages: 516-531
- Related Report
  2006 Annual Research Report
[Journal Article] 伝統的モンゴル語と現代モンゴル語を対象とした双方向的な翻字手法2006
- Author(s)
  満都拉
- Journal Title
  
  情報処理学会論文誌 47(8)
  
  Pages: 2733-2745
- Related Report
  2006 Annual Research Report
[Journal Article] A System for Summarizing and Visualizing Arguments in Subjective Documents : Toward Supporting Decision Making2006
- Author(s)
  Atsushi Fujii
- Journal Title
  
  Proceedings of COLING-ACL Workshop on Sentiment and Subjectivity in Text
  
  Pages: 15-22
- Related Report
  2006 Annual Research Report
[Journal Article] Modeling Impression in Probabilistic Transliteration into Chinese2006
- Author(s)
  LiLi Xu
- Journal Title
  
  Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
  
  Pages: 242-249
- Related Report
  2006 Annual Research Report
[Journal Article] Exploiting Dynamic Passage Retrieval for Spoken Question Recognition and Context Processing towards Speech-driven Information Access Dialogue2006
- Author(s)
  Tomoyoshi Akiba
- Journal Title
  
  Proceedings of the 5th International Confdrence on Language Resources and Evaluation
  
  Pages: 1530-1535
- Related Report
  2006 Annual Research Report
[Journal Article] Extraction and Organization of Encyclopedic Knowledge Information Using the World Wide Web2005
- Author(s)
  Atsushi, Fujii・Tetsuya, Ishikawa
- Journal Title
  
  Systems and Computers in Japan 36 No.14
  
  Pages: 81-90
- NAID
  110006246740
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2007 Final Research Report Summary
- Peer Reviewed
[Journal Article] Extraction and Organization of Encyclopedic Knowledge Information Using the World Wide Web2005
- Author(s)
  Atsushi Fujii and Tetsuya Ishikawa
- Journal Title
  
  Systems and Computers in Japan Vol.36, No.14
  
  Pages: 81-90
- NAID
  110006246740
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2007 Final Research Report Summary
[Journal Article] Extraction and Organization of Encyclopedic Knowledge Information Using the World Wide Web2005
- Author(s)
  Atsushi Fujii, Tetsuya Ishikawa
- Journal Title
  
  Systems and Computers in Japan Vol.36,No.14
  
  Pages: 81-90
- NAID
  110006246740
- Related Report
  2005 Annual Research Report
[Journal Article] Toward the Automatic Compilation of Multimedia Encyclopedias : Associating Images with Term Descriptions on the Web2005
- Author(s)
  Atsushi Fujii, Tetsuya Ishikawa
- Journal Title
  
  Proceedings of the 2005 IEEE/WIC/ACM International Joint Conference on Web Intelligence
  
  Pages: 536-542
- Related Report
  2005 Annual Research Report
[Journal Article] Image Retrieval and Disambiguation for Encyclopedic Web Search2005
- Author(s)
  Atsushi Fujii, Tetsuya Ishikawa
- Journal Title
  
  Proceedings of the 19th International Joint Conference on Artificial Intelligence
  
  Pages: 1598-1599
- Related Report
  2005 Annual Research Report
[Journal Article] Exploiting Passage Retrieval for N-Best Rescoring of Spoken Questions2005
- Author(s)
  Tomoyosi Akiba, Hiroyuki Abe
- Journal Title
  
  Proceedings of International Conference on Speech Communication and Technology
  
  Pages: 65-68
- Related Report
  2005 Annual Research Report
[Journal Article] Bi-directional Cross Language Question Answering using a Single Monolingual QA System2005
- Author(s)
  Kei Shimizu, Tomoyosi Akiba, Atsushi Fujii, Katunobu Itou
- Journal Title
  
  Proceedings of the Fifth NTCIR Workshop
  
  Pages: 236-241
- Related Report
  2005 Annual Research Report
[Presentation] Effects of Related Term Extraction in Transliteration into Chinese2008
- Author(s)
  HaiXiang Huang and Atsushi Fujii
- Organizer
  Proceedings of the Third International Joint Conference on Natural Language Processing
- Place of Presentation
  インドハイデラバード
- Year and Date
  2008-01-09
- Related Report
  2007 Annual Research Report
[Presentation] A Lemmatization Method for Modern Mongolian and its Application to Information Retrieval2008
- Author(s)
  Badam, Osor, Khaltar・Atsushi, Fujii
- Organizer
  Proceedings of the Third International Joint Conference on Natural Language Processing
- Place of Presentation
  インドハイデラバード
- Year and Date
  2008-01-08
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2007 Final Research Report Summary
[Presentation] A Lemmatization Method for Modern Mongolian and its Application to Information Retrieval2008
- Author(s)
  Badam-Osor Khaltar and Atsushi Fujii
- Organizer
  The Third International Joint Conference on Natural Language Processing
- Place of Presentation
  Hyderabad, India
- Year and Date
  2008-01-08
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2007 Final Research Report Summary
[Presentation] A Lemmatization Method for Modern Mongolian and its Application to Information Retrieval2008
- Author(s)
  Badam-Osor Khaltar and Atsushi Fujii
- Organizer
  Proceedings of the Third International Joint Conf erence on Natural Language Processing
- Place of Presentation
  インドハイデラバード
- Year and Date
  2008-01-08
- Related Report
  2007 Annual Research Report

Producing and Evaluating Encyclopedic Content by Reorganizing Heterogeneous Information

Principal Investigator

FUJII Atsushi University of Tsukuba, Graduate School of Library, Information and Media Studies, Associate Professor (30302433)

¥10,150,000 (Direct Cost: ¥9,100,000、Indirect Cost: ¥1,050,000)

Report

Research Products

[Journal Article] OpinionReader:意思決定支援を目的とした主観情報の集約・可視化システム2008

Author(s)

Journal Title

NAID

Description

Related Report

[Journal Article] A System for Summarizing and Visualizing Subjective Information Towards Supporting Decision Making2008

Author(s)

Journal Title

Description

Related Report

[Journal Article] Opinion Reader:意思決定支援を目的とした主観情報の集約・可視化システム2008

Author(s)

Journal Title

Related Report

[Journal Article] 中国語への翻字における確率的な漢字選択手法2007

Author(s)

Journal Title

NAID

Related Report

[Journal Article] 小説テキストを対象とした人物情報の抽出と体系化2007

Author(s)

Journal Title

Related Report

[Journal Article] LODEM: A system for on-demand video lectures.2006

Author(s)

Journal Title

Description

Related Report

[Journal Article] A system for on-demand video lectures2006

Author(s)

Journal Title

Description

Related Report

[Journal Article] LODEM : A system for on-demand video lectures.2006

Author(s)

Journal Title

Related Report

[Journal Article] 伝統的モンゴル語と現代モンゴル語を対象とした双方向的な翻字手法2006

Author(s)

Journal Title

Related Report

[Journal Article] A System for Summarizing and Visualizing Arguments in Subjective Documents : Toward Supporting Decision Making2006

Author(s)

Journal Title

Related Report

[Journal Article] Modeling Impression in Probabilistic Transliteration into Chinese2006

Author(s)

Journal Title

Related Report

[Journal Article] Exploiting Dynamic Passage Retrieval for Spoken Question Recognition and Context Processing towards Speech-driven Information Access Dialogue2006

Author(s)

Journal Title

Related Report

[Journal Article] Extraction and Organization of Encyclopedic Knowledge Information Using the World Wide Web2005

Author(s)

Journal Title

NAID

Description

Related Report

[Journal Article] Extraction and Organization of Encyclopedic Knowledge Information Using the World Wide Web2005

Author(s)

Journal Title

NAID

Description

Related Report

[Journal Article] Extraction and Organization of Encyclopedic Knowledge Information Using the World Wide Web2005

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Toward the Automatic Compilation of Multimedia Encyclopedias : Associating Images with Term Descriptions on the Web2005

Author(s)

Journal Title