A study on multi-Iingual information retrieval with structured index

Research Project

Project/Area Number	11680432
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	情報システム学(含情報図書館学)
Research Institution	National Institute of Informatics
Principal Investigator	ADACHI Jum National Institute of Informatics, Research Center for Information Research Director, 情報学資源研究センター, センター長 (80143551)
Co-Investigator(Kenkyū-buntansha)	TAKASU Atsuhiro National Institute of Informatics,Software Research.Div,Associate Professor, ソフトウェア研究系, 助教授 (90216648)
Project Period (FY)	1999 – 2001
Project Status	Completed (Fiscal Year 2001)
Budget Amount *help	¥3,800,000 (Direct Cost: ¥3,800,000) Fiscal Year 2001: ¥800,000 (Direct Cost: ¥800,000) Fiscal Year 2000: ¥800,000 (Direct Cost: ¥800,000) Fiscal Year 1999: ¥2,200,000 (Direct Cost: ¥2,200,000)
Keywords	informnation retrieval / structured index / morphological analysis / test collection / cross-lingual retrieval / binary tree / dependency relationship between words / 係受け
Research Abstract	In this research, we have investigated the potentiality of a novel information retrieval method which is named "Structured Index." In this method, we generate an index represented in a binary-tree structure, which is created through a dependency analysis between words that compose titles of scientific papers. This method is expected to outperform conventional keyword-based information retrieval methods, because this index would be more appropriate for matching to the intention embedded in users' queries. Furthermore, this method can be more suitable for cross-lingual information retrieval since index is more concept-oriented. Firstly, we made a fundamental software system with Japanese language morphological analysis of paper titles and dependency analysis between words. Based on these analyzes, we investigated (1) a method appropriate for index structuring, and (2) a general algorithm for retrieval processing. After these preliminary works, we have made a practical retrieval software … More system which was applied to one of the largest Japanese test collections, NTCIR. We made a structured index for title and abstract fields and the evaluation has shown that our new method outperforms conventional methods. We also designed an approach to apply our method to English information retrieval. In terms of cross-lingual information retrieval, another method "Relevance Superimposition (RS) Model" that we also have proposed concurrently has shown better performance, and we decided to choose RS model for our cross-lingual information retrieval. Experiments on the test collections have shown that our system achieved better retrieval performance in both Japanese and English collections. The software we have implemented is composed of two parts, i. e., the language-independent part and the language-dependent part. This structure is better for further development of functionality enhancement for other languages. We also made a web-base user interface of information retrieval for the demonstration of our research achievements. Less

Report

(4 results)

2001 Annual Research Report Final Research Report Summary
2000 Annual Research Report
1999 Annual Research Report

Research Products
(21 results)

All Other

All Publications (21 results)

[Publications] Matsumura, Atsushi: "The Effect of Information Retrieval Method Using Dependency Relationship Between Words"Conference Proceedings of RIAO2000, Paris, France. 1043-1058 (2000)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] Matsumura, Atsushi: "Effect of Dependency Relationship and Ordered Co-occurrence of Words on Japanese Information Retrieval"The Proceedings of Fifth International Workshop on Information Retrieval with Asian Languages IRAL2000, Hong Kong. 199-200 (2000)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] 松村敦: "全文検索における構造化インデクスの性能評価"情報処理学会研究報告. 2000-DBS-1 22. 353-360 (2000)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] 松村敦: "単語間の順序付共起関係と係受け関係を用いた抄録検索"情報処理学会第60回全国大会. 3U-7. (2000)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] 松村敦: "情報検索における単語間の関係の効果"情報処理学会データベース研究会資料. 125-34. 683-691 (2001)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] 安達淳: "メタデータを中心に構成した文書画像の電子図書館システム"信学論 D-I. Vol.J84-D-I, No.6. 257-264 (2001)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] Matsumura, Atsushi: "The effect of information retrieval method using dependency relationship between words"The conference proceedings of RIAO2000, Paris, France. 1043-1058 (2000)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] Matsumura, Atsushi: "Effect of dependency relationship and ordered co-occurrence of words on Japanese information retrieval"The proceedings of 5th International workshop on information retrieval with Asian languages IRAL2QOO, Hone Kong. 199-200 (2000)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] Matsumura, Atsushi: "Performance evaluation of structured index in full-test retrieval(in Japanese)"IPSJ SIG Notes. Vo. 2OOO-DBS-1, No. 22. 353-360 (2000)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] Matsumura, Atsushi: "Retrieval of abjstracs using ordered co-occurrence and dependency relation ship between words(in Japanese)"The proceeding of IPSJ the 60 the annual conference. 3U-7 (2000)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] Matsumura, Atsushi: "Effect of dependency relationship between words in information retrieval(in Japanese)"IPSJ SIG Note. Vol. 125, No. 34. 683-691 (2000)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] Matsumura, Atsushi: "Digital library system of document images focusing on metadata(in Japanese)"Trans. of IEICE. Vol. J84-D-1, No.6. 768-776 (2001)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2001 Final Research Report Summary
[Publications] 松村敦: "情報検索における単語間の関係の効果"情報処理学会データベース研究会資料. 125-34. 257-264 (2001)
- Related Report
  2001 Annual Research Report
[Publications] 安達淳: "メタデータを中心に構成した文書画像の電子図書館システム"信学論D-I. Vol.J84-D-I, No.6. 768-776 (2001)
- Related Report
  2001 Annual Research Report
[Publications] Matsumura,Atsushi: "The Effect of Information Retrieval Method Using Dependency Relationship Between Words"Conference Proceedings of RIAO2000,Paris, France. 1043-1058 (2000)
- Related Report
  2000 Annual Research Report
[Publications] Matsumura,Atsushi: "Effect of Dependency Relationship and Ordered Co-occurrence of Words on Japanese Information Retrieval"The Proceedings of Fifth International Workshop on Information Retrieval with Asian Languages IRAL2000,Hong Kong. 199-200 (2000)
- Related Report
  2000 Annual Research Report
[Publications] 松村敦: "全文検索における構造化インデクスの性能評価"情報処理学会研究報告. 2000-DBS-122. 353-360 (2000)
- Related Report
  2000 Annual Research Report
[Publications] 松村敦: "単語間の順序付共起関係と係受け関係を用いた抄録検索"情報処理学会第60回全国大会. 3U-7. (2000)
- Related Report
  2000 Annual Research Report
[Publications] A. Matumura, J. Adachi, A. Takasu: "Structured Index System at NTCIRI"Proceedings of the 1st NTCIR Workshop. 117-122 (1999)
- Related Report
  1999 Annual Research Report
[Publications] A. Matsumura, A. Takasu, J. Adachi: "Structured Index at IREX"Proceeding of the IREX Workshop. 57-60 (1999)
- Related Report
  1999 Annual Research Report
[Publications] 松村敦、高須淳宏、安達淳: "単語間の係受け関係を用いた情報検索手法の評価"情報処理学会論文誌(データベース). 41,SIG1(TOD5). 22-30 (2000)
- Related Report
  1999 Annual Research Report

A study on multi-Iingual information retrieval with structured index

Principal Investigator

ADACHI Jum National Institute of Informatics, Research Center for Information Research Director, 情報学資源研究センター, センター長 (80143551)

¥3,800,000 (Direct Cost: ¥3,800,000)

Report

Research Products

[Publications] Matsumura, Atsushi: "The Effect of Information Retrieval Method Using Dependency Relationship Between Words"Conference Proceedings of RIAO2000, Paris, France. 1043-1058 (2000)

Description

Related Report

[Publications] Matsumura, Atsushi: "Effect of Dependency Relationship and Ordered Co-occurrence of Words on Japanese Information Retrieval"The Proceedings of Fifth International Workshop on Information Retrieval with Asian Languages IRAL2000, Hong Kong. 199-200 (2000)

Description

Related Report

[Publications] 松村 敦: "全文検索における構造化インデクスの性能評価"情報処理学会研究報告. 2000-DBS-1 22. 353-360 (2000)

Description

Related Report

[Publications] 松村 敦: "単語間の順序付共起関係と係受け関係を用いた抄録検索"情報処理学会第60回全国大会. 3U-7. (2000)

Description

Related Report

[Publications] 松村 敦: "情報検索における単語間の関係の効果"情報処理学会データベース研究会資料. 125-34. 683-691 (2001)

Description

Related Report

[Publications] 安達 淳: "メタデータを中心に構成した文書画像の電子図書館システム"信学論 D-I. Vol.J84-D-I, No.6. 257-264 (2001)

Description

Related Report

[Publications] Matsumura, Atsushi: "The effect of information retrieval method using dependency relationship between words"The conference proceedings of RIAO2000, Paris, France. 1043-1058 (2000)

Description

Related Report

[Publications] Matsumura, Atsushi: "Effect of dependency relationship and ordered co-occurrence of words on Japanese information retrieval"The proceedings of 5th International workshop on information retrieval with Asian languages IRAL2QOO, Hone Kong. 199-200 (2000)

Description

Related Report

[Publications] Matsumura, Atsushi: "Performance evaluation of structured index in full-test retrieval(in Japanese)"IPSJ SIG Notes. Vo. 2OOO-DBS-1, No. 22. 353-360 (2000)

Description

Related Report

[Publications] Matsumura, Atsushi: "Retrieval of abjstracs using ordered co-occurrence and dependency relation ship between words(in Japanese)"The proceeding of IPSJ the 60 the annual conference. 3U-7 (2000)

Description

Related Report

[Publications] Matsumura, Atsushi: "Effect of dependency relationship between words in information retrieval(in Japanese)"IPSJ SIG Note. Vol. 125, No. 34. 683-691 (2000)

Description

Related Report

[Publications] Matsumura, Atsushi: "Digital library system of document images focusing on metadata(in Japanese)"Trans. of IEICE. Vol. J84-D-1, No.6. 768-776 (2001)

Description

Related Report

[Publications] 松村敦: "情報検索における単語間の関係の効果"情報処理学会データベース研究会資料. 125-34. 257-264 (2001)

Related Report

[Publications] 安達淳: "メタデータを中心に構成した文書画像の電子図書館システム"信学論D-I. Vol.J84-D-I, No.6. 768-776 (2001)

Related Report

[Publications] Matsumura,Atsushi: "The Effect of Information Retrieval Method Using Dependency Relationship Between Words"Conference Proceedings of RIAO2000,Paris, France. 1043-1058 (2000)

Related Report

[Publications] Matsumura,Atsushi: "Effect of Dependency Relationship and Ordered Co-occurrence of Words on Japanese Information Retrieval"The Proceedings of Fifth International Workshop on Information Retrieval with Asian Languages IRAL2000,Hong Kong. 199-200 (2000)

Related Report

[Publications] 松村敦: "全文検索における構造化インデクスの性能評価"情報処理学会研究報告. 2000-DBS-122. 353-360 (2000)

Related Report

[Publications] 松村敦: "単語間の順序付共起関係と係受け関係を用いた抄録検索"情報処理学会第60回全国大会. 3U-7. (2000)

Related Report

[Publications] A. Matumura, J. Adachi, A. Takasu: "Structured Index System at NTCIRI"Proceedings of the 1st NTCIR Workshop. 117-122 (1999)

Related Report

[Publications] A. Matsumura, A. Takasu, J. Adachi: "Structured Index at IREX"Proceeding of the IREX Workshop. 57-60 (1999)

Related Report

[Publications] 松村敦、高須淳宏、安達淳: "単語間の係受け関係を用いた情報検索手法の評価"情報処理学会論文誌(データベース). 41,SIG1(TOD5). 22-30 (2000)

Related Report

[Publications] 松村敦: "全文検索における構造化インデクスの性能評価"情報処理学会研究報告. 2000-DBS-1 22. 353-360 (2000)

[Publications] 松村敦: "単語間の順序付共起関係と係受け関係を用いた抄録検索"情報処理学会第60回全国大会. 3U-7. (2000)

[Publications] 松村敦: "情報検索における単語間の関係の効果"情報処理学会データベース研究会資料. 125-34. 683-691 (2001)

[Publications] 安達淳: "メタデータを中心に構成した文書画像の電子図書館システム"信学論 D-I. Vol.J84-D-I, No.6. 257-264 (2001)