大規模知識処理のための技術統合に関する研究開発

研究課題

研究課題/領域番号	12680373
研究種目	基盤研究(C)
配分区分	補助金
応募区分	一般
研究分野	知能情報学
研究機関	北陸先端科学技術大学院大学
研究代表者	佐藤賢二北陸先端科学技術大学院大学, 知識科学研究科, 助教授 (10215783)
研究期間 (年度)	2000 – 2001
研究課題ステータス	完了 (2001年度)
配分額 *注記	3,500千円 (直接経費: 3,500千円) 2001年度: 1,000千円 (直接経費: 1,000千円) 2000年度: 2,500千円 (直接経費: 2,500千円)
キーワード	ゲノムデータベース / サーチエンジン / データマイニング / 相関ルール発見 / 専門用語 / オントロジー
研究概要	本研究では、ゲノムデータベースに大量に蓄積されているテキスト情報を対象に、サーチエンジンとデータマイニングの技術を組み合わせ、統合することにより、利用者の知識発見を支援するシステムを構築することを試みた。まず、一般的なサーチエンジンの運用中に構築されるインデックス情報を知職発見の源泉とみなし、これを用いて類似文書のクラスタリングを行うことを試みた。その結果、ゲノムデータベースのように多様な専門用語を含む文書に対しては、類似性の判定に用いるキーワードを専門用語に限定する必要があることが分かった。しかし、一般に専門用語は複数のワードから成るため、その存在は直接的にはインデックスに表れない。これを解決するために、ゲノムデータベースから専門用語らしき部分を抽出し、その出現情報とターム間の包含関係を解析して、一種のオントロジーを構築した。さらに、ゲノムデータベース中に存在するリンク情報と、オントロジーが提供する言語情報を用いて、サーチエンジンの検索結晶集合の意味を高速かつ容易に提示するデータマイニングシステムを構築した。システム構築に際しては、ゲノムデータベースからの全文検索システムに相関ルール発見機能を導入することにより、利用者が着目している検索結果集合に共通かつ特有なリンク情報や言語情報を提示することができた。相関ルール発見については、当該の集合に関係するリンク情報や言語情報だけを高速に切り出し、冗長な情報をコンパクトにまとめた上でマイニングを行うことにより、大規模なゲノムデータベースに対し、Web上でも十分な応答速度で知識発見サービスを行うことが可能になった。さらに、マイニングの結果を単にリスト表示するのではなく、二次元の表の形で表示することにより、ユーザが着目するエントリ集合の意味を把握しやすくした。具体的には、マイニングの結果として得られるルールの重要度に従って各アイテム)とエントリ集合をソートすることにより、着目するエントリ集合がいくつかのグループやサブグループに分かれることを視覚的に表現した。

報告書

(3件)

2001 実績報告書研究成果報告書概要
2000 実績報告書

研究成果
(13件)

すべてその他

すべて文献書誌 (13件)

[文献書誌] K.Satou et al.: "A Framework for Quick-and-Pinpoint Data Mining and its Application to Heterogeneous Genome Databases"Knowledge-Based Intelligent Information Engineering Systems & Allied Technologies (KES'2001). Part1. 773-777 (2001)
- 説明
  「研究成果報告書概要(和文)」より
- 関連する報告書
  2001 研究成果報告書概要
[文献書誌] T.Naitou et al.: "A System for Finding Association Rules from Microarray Data Public Databases"Genome Informatics 2000. 356-357 (2000)
- 説明
  「研究成果報告書概要(和文)」より
- 関連する報告書
  2001 研究成果報告書概要
[文献書誌] T.Oyama et al.: "Mining Association Rules Related to Protein-Protein Interactions"Genome Informatics 2000. 358-359 (2000)
- 説明
  「研究成果報告書概要(和文)」より
- 関連する報告書
  2001 研究成果報告書概要
[文献書誌] T.Yagyuu et al.: "Toward Automatic Construction of Extensional Ontology from Genome Databases"Genome Informatics 2000. 442-443 (2000)
- 説明
  「研究成果報告書概要(和文)」より
- 関連する報告書
  2001 研究成果報告書概要
[文献書誌] T.Sakai, et al.: "Toward Automatic Recognition of Field Description Syntax and Parser Generation for Genome Databases"Genome Informatics 2000. 444-445 (2000)
- 説明
  「研究成果報告書概要(和文)」より
- 関連する報告書
  2001 研究成果報告書概要
[文献書誌] K. Satou, Y. Fuseda, A. Konagaya, and T. Takagi.: "A Framework for Quick-and-Pinpoint Data Mining and its Application to Heterogeneous Genome Databases"Knowledge-Based Intelligent Information Engineering Systems & Allied Technologies (KES'2001). Part 1. 773-777 (2001)
- 説明
  「研究成果報告書概要(欧文)」より
- 関連する報告書
  2001 研究成果報告書概要
[文献書誌] T. Naitou and K. Satou: "A System for Finding Association Rules from Microarray Data and Public Databases"Genome Informatics. 2000. 356-357 (2000)
- 説明
  「研究成果報告書概要(欧文)」より
- 関連する報告書
  2001 研究成果報告書概要
[文献書誌] T. Oyama, K. Kitano, K. Satou, and T. Ito: "Mining Association Rules Related to Protein-Protein Interactions"Genome Informatics. 2000. 358-359 (2000)
- 説明
  「研究成果報告書概要(欧文)」より
- 関連する報告書
  2001 研究成果報告書概要
[文献書誌] T. Yagyuu and K. Satou: "Toward Automatic Construction of Extensional Ontology from Genome Databases"Genome Informatics 2000. 442-443 (2000)
- 説明
  「研究成果報告書概要(欧文)」より
- 関連する報告書
  2001 研究成果報告書概要
[文献書誌] T. Sakai and K. Satou: "Toward Automatic Recognition of Field Description Syntax and Parser Generation for Genome Databases"Genome Informatics. 2003. 444-445 (2000)
- 説明
  「研究成果報告書概要(欧文)」より
- 関連する報告書
  2001 研究成果報告書概要
[文献書誌] 佐藤賢二: "A Framework for Quick-and Pinpoint Date Mining and its Application to Heterogeneous Genome Databases"Knowledge-Based Intelligent Information Engineering Systems & Allied Technologies. 1. 773-777 (2001)
- 関連する報告書
  2001 実績報告書
[文献書誌] 柳生拓也: "Toward Automatic Construction of Extensional Ontology from Genome Databases"Genome Informatics Workshop 2000. 442-443 (2000)
- 関連する報告書
  2000 実績報告書
[文献書誌] 坂井武夫: "Toward Automatic Recognition of Field Description Syntax and Parser Generation for Genome Databases"Genome Informatics Workshop 2000. 444-445 (2000)
- 関連する報告書
  2000 実績報告書

大規模知識処理のための技術統合に関する研究開発

研究代表者

佐藤 賢二 北陸先端科学技術大学院大学, 知識科学研究科, 助教授 (10215783)

3,500千円 (直接経費: 3,500千円)

報告書

研究成果

[文献書誌] K.Satou et al.: "A Framework for Quick-and-Pinpoint Data Mining and its Application to Heterogeneous Genome Databases"Knowledge-Based Intelligent Information Engineering Systems & Allied Technologies (KES'2001). Part1. 773-777 (2001)

説明

関連する報告書

[文献書誌] T.Naitou et al.: "A System for Finding Association Rules from Microarray Data Public Databases"Genome Informatics 2000. 356-357 (2000)

説明

関連する報告書

[文献書誌] T.Oyama et al.: "Mining Association Rules Related to Protein-Protein Interactions"Genome Informatics 2000. 358-359 (2000)

説明

関連する報告書

[文献書誌] T.Yagyuu et al.: "Toward Automatic Construction of Extensional Ontology from Genome Databases"Genome Informatics 2000. 442-443 (2000)

説明

関連する報告書

[文献書誌] T.Sakai, et al.: "Toward Automatic Recognition of Field Description Syntax and Parser Generation for Genome Databases"Genome Informatics 2000. 444-445 (2000)

説明

関連する報告書

[文献書誌] K. Satou, Y. Fuseda, A. Konagaya, and T. Takagi.: "A Framework for Quick-and-Pinpoint Data Mining and its Application to Heterogeneous Genome Databases"Knowledge-Based Intelligent Information Engineering Systems & Allied Technologies (KES'2001). Part 1. 773-777 (2001)

説明

関連する報告書

[文献書誌] T. Naitou and K. Satou: "A System for Finding Association Rules from Microarray Data and Public Databases"Genome Informatics. 2000. 356-357 (2000)

説明

関連する報告書

[文献書誌] T. Oyama, K. Kitano, K. Satou, and T. Ito: "Mining Association Rules Related to Protein-Protein Interactions"Genome Informatics. 2000. 358-359 (2000)

説明

関連する報告書

[文献書誌] T. Yagyuu and K. Satou: "Toward Automatic Construction of Extensional Ontology from Genome Databases"Genome Informatics 2000. 442-443 (2000)

説明

関連する報告書

[文献書誌] T. Sakai and K. Satou: "Toward Automatic Recognition of Field Description Syntax and Parser Generation for Genome Databases"Genome Informatics. 2003. 444-445 (2000)

説明

関連する報告書

[文献書誌] 佐藤 賢二: "A Framework for Quick-and Pinpoint Date Mining and its Application to Heterogeneous Genome Databases"Knowledge-Based Intelligent Information Engineering Systems & Allied Technologies. 1. 773-777 (2001)

関連する報告書

[文献書誌] 柳生拓也: "Toward Automatic Construction of Extensional Ontology from Genome Databases"Genome Informatics Workshop 2000. 442-443 (2000)

関連する報告書

[文献書誌] 坂井武夫: "Toward Automatic Recognition of Field Description Syntax and Parser Generation for Genome Databases"Genome Informatics Workshop 2000. 444-445 (2000)

関連する報告書

佐藤賢二北陸先端科学技術大学院大学, 知識科学研究科, 助教授 (10215783)

[文献書誌] 佐藤賢二: "A Framework for Quick-and Pinpoint Date Mining and its Application to Heterogeneous Genome Databases"Knowledge-Based Intelligent Information Engineering Systems & Allied Technologies. 1. 773-777 (2001)