2005 Fiscal Year Final Research Report Summary

Study on Information Utilization System for Heterogeneous Contents

Research Project

Project/Area Number	13224087
Research Category	Grant-in-Aid for Scientific Research on Priority Areas
Allocation Type	Single-year Grants
Review Section	Science and Engineering
Research Institution	National Institute of Informatics
Principal Investigator	ADACHI Jun National Institute of Informatics, Software Research Division, Professor, ソフトウェア研究系, 教授 (80143551)
Co-Investigator(Kenkyū-buntansha)	AIZAWA Akiko National Institute of Informatics, Research Center for Information Resources, Professor, 情報学資源研究センター, 教授 (90222447) KANDO Noriko National Institute of Informatics, Software Research Division, Professor, ソフトウェア研究系, 教授 (80270445) KAGEURA Kyo Tokyo University, Graduate school of Education, Associate Professor, 教育学研究科, 助教授 (00211152) TAKASU Atsuhiro National Institute of Informatics, Research Center for Testbeds and Prototyping, Professor, 実証研究センター, 教授 (90216648) AIHARA Kenro National Institute of Informatics, Software Research Division, Associate Professor, ソフトウェア研究系, 助教授 (90300706)
Project Period (FY)	2001 – 2005
Keywords	Informatics / Information Retrieval / Text Processing / Text Mining / Multimedia Processing / Data Engineering
Research Abstract	This project aims at developing technology for utilizing the heterogeneous contents. We studied link and structural analysis of Webs, cross-media processing technology, epistemological framework of the Web and developed corpora for evaluating information utilization methods for the Web. 1) We developed an information extraction and organization methods using the textual and graphical structure of the Web -Web page clustering methods based on the link structure -Topic tracking using non-linear time-content analysis 2) We proposed some advanced methods for processing and utilizing multimedia as follows, focusing on media heterogeneity: -topic detection from multilingual text collection -user adaptive text summarization based on content types -crossmedia search by enhancing annotation-based image retrieval model with content-based features -JuNii+: user interface for image retrieval -utilizing interview video archives for learning 3) We organized a series of evaluation workshops "NTCIR", in which a number of researchers participated to develop new testbeds, each of which consists of a common test data for research on heterogeneous digital content. As the results, for instance, we built up a terabyte-scale dataset by crawling the -jp domain, and established evaluation methodologies to meet the practical situation. These contributed to the progress of the research in this area 4) We analyzed the epistemological framework within which engineers process and model the Web information sources, contrasting it with the modern system of printed books. On the basis of the analysis, we concluded that it is hard to directly apply the model defined by the quintessentially modern concept of information accumulation as represented in the ideal of libraries, and showed that "information editing" would be necessary to explore fully the potential of web information sources.

Research Products
(34 results)

All 2006 2005 2004 2003 2002 2001 Other

All Journal Article (32 results) Book (2 results)

[Journal Article] Korean-Japanese Story Link Detection based on Event Term Weighting on Timelines and Multilingual Spaces2006
- Author(s)
  Lee, K-S., Kageura, K.
- Journal Title
  
  Information Processing and Management 42・2
  
  Pages: 935-946
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] An empirical study on retrieval models for different document genres : Patents and newspaper articles2006
- Author(s)
  Makoto Iwayama, Atsuhi Fujii, Noriko Kando, Yuzo Marukawa
- Journal Title
  
  Information Processing and Management 42・1
  
  Pages: 207-221
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] Korean-Japanese Story Link Detection based on Event Term Weighting on Timelines and Multilingual Spaces2006
- Author(s)
  Lee, K-S., Kageura, K.
- Journal Title
  
  Information Processing and Management Vol.42-No.2
  
  Pages: 935-946
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] An empirical study on retrieval models for different document genres : Patents and newspaper articles2006
- Author(s)
  Makoto Iwayama, Atsuhi Fujii, Noriko Kando, Yuzo Marukawa
- Journal Title
  
  Information Processing and Management Vol.42, No.1
  
  Pages: 207-221
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] レコード同定問題に関する研究の課題と現状2005
- Author(s)
  相澤彰子, 大山敬三, 高須淳宏, 安達淳
- Journal Title
  
  電子情報通信学会論文誌,Dl J88-D1・3
  
  Pages: 576-589
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] リンク情報の利用によるWeb検索性能の改善2005
- Author(s)
  正田備也, 高須淳宏, 安達淳
- Journal Title
  
  情報処理学会論文誌「データベース」 SIG8(TOD26)
  
  Pages: 48-59
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] Cross-language Information Retrieval : the Roard Ahead2005
- Author(s)
  Frederic C.Gey, Noriko Kando, Carol Peters
- Journal Title
  
  Information Processing and Managemen 41・3
  
  Pages: 415-431
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] Are Open-domain Question Answering Technologies Useful for Information Access Dialogues? - An Empirical Study and a Proposal of a Novel Challenge2005
- Author(s)
  Tsuneaki Kato, Jun'ichi Fukumoto, Fumito Masui, Noriko Kando
- Journal Title
  
  ACM Transactions of Asian Language Information Processing 4・3
  
  Pages: 243-262
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] 利用者の情報要求を考慮した観点に基づく複数文書要約とその評価2005
- Author(s)
  関洋平, 江口浩二, 神門典子
- Journal Title
  
  情報処理学会論文誌データベース SIG8 (TOD26)・46
  
  Pages: 106-119
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] Techniques and Research Trends in Record Linkage Studies2005
- Author(s)
  Akiko Aizawa, Atsuhiro Takasu, Keizo Oyama, Jun Adachi
- Journal Title
  
  Journal of IEICE D1,VOL.J88-D1-N0.3
  
  Pages: 576-589
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Improving Web search Performance with Hyperlink Information2005
- Author(s)
  Tomonari Masada, Atsuhiro Takasu, Jun Adachi
- Journal Title
  
  IPSJ Transactions on Databases Vol.46-STG8 (TOD 26)
  
  Pages: 48-59
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Cross-language Information Retrieval : the Roard Ahead2005
- Author(s)
  Frederic C.Gey, Noriko Kando, Carol Peters
- Journal Title
  
  Information Processing and Management Vol.41-No.3
  
  Pages: 415-431
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Multi-Document Viewpoint Summarization Based on Users' Information Needs and its Evaluation2005
- Author(s)
  Yohei Seki, Koji Eguchi, Noriko Kando
- Journal Title
  
  PSJ Transactions on Databases Vol.43, SIG8 (TOD26)
  
  Pages: 106-119
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Implicit Ambiguity Resolution Based on Cluster Analysis in Cross-Language Information Retrieval2004
- Author(s)
  Kyung-Soon Lee, Kyo Kageura, Key-Sun Choi.
- Journal Title
  
  Information Processing ＆ Management 40・1
  
  Pages: 145-159
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] テキスト認識エラーモデルによる引用文献文字列からの書誌要素の抽出2004
- Author(s)
  高須淳宏, 相原健郎
- Journal Title
  
  電子情報通信学会論文誌 J87-D-II・6
  
  Pages: 1298-1308
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] Recent trends in computational terminology2004
- Author(s)
  Kageura, K., Daille, B., Nakagawa, H., Chien, L-F.
- Journal Title
  
  Terminology 10・1
  
  Pages: 1-21
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] Decomposing the Web Graph into Parametarized Connected Components2004
- Author(s)
  Tomonari Masada, Atsuhiro Takasu, Jun Adachi
- Journal Title
  
  IEICE Transactions on Information and Systems E87-D, 2
  
  Pages: 380-388
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] Implicit Ambiguity Resolution Based on Cluster Analysis in Cross-Language Information Retrieval2004
- Author(s)
  Kyung-Soon Lee, Kyo Kageura, Key-Sun Choi
- Journal Title
  
  Information Processing and Management Vol.40-No.1
  
  Pages: 145-159
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Bibliographic Attribute Extraction from References Based on Text Recognition Error Model2004
- Author(s)
  Atsuhiro Takasu, Kenro Aihara
- Journal Title
  
  Journal of IEICE, D-II J87-D-II-No.6
  
  Pages: 1298-1308
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] L-F. Recent trends in computational terminology2004
- Author(s)
  Kageura, K., Daille, B., Nakagawa, H., Chien
- Journal Title
  
  Terminology Vol.10-No.1
  
  Pages: 1-21
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Decomposing the Web Graph into Parametarized Connected Components2004
- Author(s)
  Tomonari Masada, Atsuhiro Takasu, Jun Adachi
- Journal Title
  
  IEICE Transactions on Information and Systems E87-D-No.2
  
  Pages: 380-388
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] An Information-Theoretic Perspective of Tf-idf Measuress2003
- Author(s)
  Akiko Aizawa
- Journal Title
  
  Information Processing and Management 39・1
  
  Pages: 45-65
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] Evaluation Methods for Web Retrieval Tasks Considering Hyperlink Structure2003
- Author(s)
  Koji Eguchi, Keizo Oyama, Emi Ishida, Noriko Kando, Kazuko Kuriyama
- Journal Title
  
  IEICE Transactions on Information and Systems E86-D・9
  
  Pages: 1804-1813
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] 低頻度語の利用によるテキストの分類性能の改善と評価2003
- Author(s)
  相澤彰子
- Journal Title
  
  情報処理学会論文誌 44・7
  
  Pages: 1720-1730
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] An Information-Theoretic Perspective of Tf-idf Measuress2003
- Author(s)
  Akiko Aizawa
- Journal Title
  
  Information Processing and Management Vol.39-No.1
  
  Pages: 45-65
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Evaluation Methods for Web Retrieval Tasks Considering Hyperlink Structure2003
- Author(s)
  Koji Eguchi, Keizo Oyama, Emi Ishida, Noriko Kando, Kazuko Kuriyama
- Journal Title
  
  IEICE Transactions on Information and Systems E86-D-No.9
  
  Pages: 1804-1813
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Improving the Performance of Text Categorization Using Low Frequency Terms2003
- Author(s)
  Akiko Aizawa
- Journal Title
  
  Journal of Information Processing Society of Japan Vol.44-No.7
  
  Pages: 1720-1730
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] 日英言語横断検索における関連性の重ね合わせモデルの効果2002
- Author(s)
  金沢輝一, 相澤彰子, 高須淳宏, 安達淳
- Journal Title
  
  情報処理学会論文誌「データペース」 43・SIG 2(TOD 13)
  
  Pages: 1-10
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] Effectiveness of the Relevance-based Superimposition Model for Cross-language Information Retrieval2002
- Author(s)
  Teruhito Kanazawa, Akiko Aizawa, Atsuhiro Takasu, Jun Adachi
- Journal Title
  
  IPSJ Transactions on Databases Vol.43-STG(TOD 13)
  
  Pages: 1-10
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Chapter 3 : Digital Library - Its Extension and Intention Observed in System Implementations2001
- Author(s)
  Jun Aadchi
- Journal Title
  
  Digital Libraries---Flow of digital information and the future of libraries---(Series : Frontiers in Library and Information Science) Scientific Committee of the Japan Society of Library and Information Science ed. Bensei Publisher
  
  Pages: 71-86
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Chapter 2 : Information media in the electronic age as seen from the point of view of information management2001
- Author(s)
  Kyo Kageura
- Journal Title
  
  Digital Libraries---Flow of digital information and the future of libraries---(Series : Frontiers in Library and Information Science) Scientific Committee of the Japan Society of Library and Information Science ed. Bensei Publisher
  
  Pages: 47-62
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Are Open-domain Question Answering Technologies Useful for Information Access Dialogues? -An Empirical Study and a Proposal of a Novel Challenge
- Author(s)
  Tsuneaki Kato, Jun'ichi Fukumoto, Fumito Masui, Noriko Kando
- Journal Title
  
  ACM Transactions of Asian Language, Information Processing Vol.4-No.3
  
  Pages: 243-262
- Description
  「研究成果報告書概要(欧文)」より
[Book] 『電子図書館-デジタル情報の流通と図書館の未来-』「第三章電子図書館-システムの構築に見るその外延と内包-」(シリーズ図書館情報学のフロンティア)(日本図書館情報学会研究委員会編)2001
- Author(s)
  安逹淳
- Total Pages
  204
- Publisher
  勉誠出版
- Description
  「研究成果報告書概要(和文)」より
[Book] 『電子図書館-デジタル情報の流通と図書館の未来-』「第二章情報管理の前提からみた電子技術時代の資料」(シリーズ図書館情報学のフロンティア)(日本図書館情報学会研究委員会編)2001
- Author(s)
  影浦峡
- Total Pages
  204
- Publisher
  勉誠出版
- Description
  「研究成果報告書概要(和文)」より

2005 Fiscal Year Final Research Report Summary

Study on Information Utilization System for Heterogeneous Contents

Principal Investigator

ADACHI Jun National Institute of Informatics, Software Research Division, Professor, ソフトウェア研究系, 教授 (80143551)

Research Products

[Journal Article] Korean-Japanese Story Link Detection based on Event Term Weighting on Timelines and Multilingual Spaces2006

Author(s)

Journal Title

Description

[Journal Article] An empirical study on retrieval models for different document genres : Patents and newspaper articles2006

Author(s)

Journal Title

Description

[Journal Article] Korean-Japanese Story Link Detection based on Event Term Weighting on Timelines and Multilingual Spaces2006

Author(s)

Journal Title

Description

[Journal Article] An empirical study on retrieval models for different document genres : Patents and newspaper articles2006

Author(s)

Journal Title

Description

[Journal Article] レコード同定問題に関する研究の課題と現状2005

Author(s)

Journal Title

Description

[Journal Article] リンク情報の利用によるWeb検索性能の改善2005

Author(s)

Journal Title

Description

[Journal Article] Cross-language Information Retrieval : the Roard Ahead2005

Author(s)

Journal Title

Description

[Journal Article] Are Open-domain Question Answering Technologies Useful for Information Access Dialogues? - An Empirical Study and a Proposal of a Novel Challenge2005

Author(s)

Journal Title

Description

[Journal Article] 利用者の情報要求を考慮した観点に基づく複数文書要約とその評価2005

Author(s)

Journal Title

Description

[Journal Article] Techniques and Research Trends in Record Linkage Studies2005

Author(s)

Journal Title

Description

[Journal Article] Improving Web search Performance with Hyperlink Information2005

Author(s)

Journal Title

Description

[Journal Article] Cross-language Information Retrieval : the Roard Ahead2005

Author(s)

Journal Title

Description

[Journal Article] Multi-Document Viewpoint Summarization Based on Users' Information Needs and its Evaluation2005

Author(s)

Journal Title

Description

[Journal Article] Implicit Ambiguity Resolution Based on Cluster Analysis in Cross-Language Information Retrieval2004

Author(s)

Journal Title

Description

[Journal Article] テキスト認識エラーモデルによる引用文献文字列からの書誌要素の抽出2004

Author(s)

Journal Title

Description

[Journal Article] Recent trends in computational terminology2004

Author(s)

Journal Title

Description

[Journal Article] Decomposing the Web Graph into Parametarized Connected Components2004

Author(s)

Journal Title

Description

[Journal Article] Implicit Ambiguity Resolution Based on Cluster Analysis in Cross-Language Information Retrieval2004

Author(s)

Journal Title

Description

[Journal Article] Bibliographic Attribute Extraction from References Based on Text Recognition Error Model2004

Author(s)

Journal Title

[Book] 『電子図書館-デジタル情報の流通と図書館の未来-』「第三章電子図書館-システムの構築に見るその外延と内包-」(シリーズ図書館情報学のフロンティア)(日本図書館情報学会研究委員会編)2001

[Book] 『電子図書館-デジタル情報の流通と図書館の未来-』「第二章情報管理の前提からみた電子技術時代の資料」(シリーズ図書館情報学のフロンティア)(日本図書館情報学会研究委員会編)2001