Knowledge Discovery from Databases using Machine Learning and Data Envelopment Analysis and Its Application to Decision Support Systems

Research Project

Project/Area Number	13680460
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	Intelligent informatics
Research Institution	Aoyama Gakuin University
Principal Investigator	INAZUMI Hiroshige Aoyama Gakuin University, College of Science and Engineering, Professor, 理工学部, 教授 (00168402)
Project Period (FY)	2001 – 2004
Project Status	Completed (Fiscal Year 2004)
Budget Amount *help	¥3,200,000 (Direct Cost: ¥3,200,000) Fiscal Year 2004: ¥800,000 (Direct Cost: ¥800,000) Fiscal Year 2003: ¥800,000 (Direct Cost: ¥800,000) Fiscal Year 2002: ¥800,000 (Direct Cost: ¥800,000) Fiscal Year 2001: ¥800,000 (Direct Cost: ¥800,000)
Keywords	Data envelop analysis / Machine Learning / Decision Tree / Knowledge discovery / クラスタリング
Research Abstract	Gene expression data is one of the genome data which became available due to mapping. This data is collected by using DNA microarray. DNA microarray technology has now made it possible to monitor the expression levels of thousands of genes simultaneously. Elucidating the patterns hidden in gene expression data offers a tremendous opportunity for an enhanced understanding of functional genornics. However, the large number of genes and the complexity of biological networks greatly increase the challenges of comprehending and interpreting the resulting mass of data. A first step toward addressing this challenge is the use of clustering techniques, which is essential in the data mining process to reveal natural structures and identify interesting patterns in the underlying data. Gene expression data is meaningful to cluster both genes and samples. The goal of clustering samples is to find the phenotype structures or substructures of the samples. The phenotypes of samples can be discriminat … More ed through only a small subset of genes whose expression levels strongly correlated with the class distinction. These genes are called "informative genes". Before clustering samples, it is essential to select these informative genes from the entire monitored genes. We propose a new clustering method using DEA (Data Envelopment Analysis). DEA solves optimization problems with multiple input/output models, which is commonly used to evaluate the efficiency of a number of Decision Making Units, DMUs, by comparing against a peer directly. We applied DEA to gene expression data using genes as DMUs. Selection of informative genes using DEA collects genes which have different expression patterns to each other. Then applied DEA using samples as DMUs, and clustered them according to DEA results. For example, we tested with the well known Leukemia data, 47 ALL samples and 25 AML samples. Selected informative genes had higher classification accuracy than the genes with high gain ratio, and discovered subclusters of given classes. Sample clustering can identify each cluster's representative sample and their characteristic points, which can be helpful to explain the clusters. We can conclude that DEA clustering has a high explanation capability. Our future work is to consider about combining DEA clustering with other clustering algorithms, and the application to time-series data Less

Report

(5 results)

2004 Annual Research Report Final Research Report Summary
2003 Annual Research Report
2002 Annual Research Report
2001 Annual Research Report

Research Products
(43 results)

All 2005 2004 2003 2002 2001 Other

All Journal Article (29 results) Publications (14 results)

[Journal Article] A Step towards GUI-based Graph Mining using Multiple GBI-for Chemical Database-2005
- Author(s)
  Eitaro Tanaka, Hiroshige Inazumi
- Journal Title
  
  The Japanese Society of Artificial Intelligence, Special Interest Group on Knowledge Based Systems SIG-KBS-A405
  
  Pages: 1-6
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] A Step towards Graph Clustering based on Inclusion Relations among a set of Subgraphs2005
- Author(s)
  Akiko Hayami, Hiroshige Inazumi
- Journal Title
  
  The Japanese Society of Artificial Intelligence, Special Interest Group on Knowledge Based Systems SIG-KBS-A405
  
  Pages: 81-86
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] A Step towards New Clustering Algorithm using DEA- for Gene Expression Data-2005
- Author(s)
  Masako Hoshino, Hiroyuki Oono, Hiroshige Inauzmi
- Journal Title
  
  The 19^<th> Annual Conference of Japan Society of Artificial Intelligence 3A1-01
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] A Step towards Visual Graph Clustering based on Inclusion Measure among a Set of Subgraphs2005
- Author(s)
  Yuichiro Ishii, Eitaro Tanaka, Akiko Hayami, Hiroyuki Oono, Hiroshige Inazumi
- Journal Title
  
  The 20^<th> Annual Conference of Japan Society of Artificial Intelligence 3C1-03
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] GBI(Graph-Based Induction)法の拡張とGUIによるグラフマイニング支援環境の構築-化学物質を対象として-2005
- Author(s)
  田中栄太郎, 稲積宏誠
- Journal Title
  
  人工知能学会知識ベースシステム研究会合同研究会
  
  Pages: 1-6
- Related Report
  2004 Annual Research Report
[Journal Article] 部分構造の包含関係を指標とするグラフクラスタリングの提案-化学物質を対象として-2005
- Author(s)
  速水亜希子, 稲積宏誠
- Journal Title
  
  人工知能学会知識ベースシステム研究会合同研究会
  
  Pages: 81-86
- Related Report
  2004 Annual Research Report
[Journal Article] The substructure extraction form Molecules by Extension of the GBI method2004
- Author(s)
  Eitaro Tanaka, Akiko Hayami, Hiroshige Inauzmi
- Journal Title
  
  The 66^<th> National Convention of Information Processing Society of Japan vol.3
  
  Pages: 171-172
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] Decision Tree for the Example Consisting of Attributes with Partial Order Structures and Application to Molecule Database2004
- Author(s)
  Akiko Hayami, Eitaro Tanaka, Hiroshige Inazumi
- Journal Title
  
  The 66^<th> National Convention of Information Processing Society of Japan vol.3
  
  Pages: 175-176
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] A Step towards Gene Expression Data Analysis Using DEA2004
- Author(s)
  Masako Hoshino, Hiroshige Inazumi
- Journal Title
  
  The 66^<th> National Convention of Information Processing Society of Japan vol.3
  
  Pages: 177-178
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] A Step towards Feature Extraction from Nucleotide Sequence Database using Series-Compacting Method2004
- Author(s)
  Yasunari Kurokawa, Eitaro Tanaka, Hiroshige Inazumi
- Journal Title
  
  The Japanese Society of Artificial Intelligence, Special Interest Group on Knowledge Based Systems SIG-KBS-A304
  
  Pages: 159-164
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] The Substructure extraction form Molecules by Extension of the GBI Method2004
- Author(s)
  Eitaro Tanaka, Hiroshige Inazumi
- Journal Title
  
  The Japanese Society of Artificial Intelligence, Special Interest Group on Knowledge Based Systems SIG-KBS-A304
  
  Pages: 171-176
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] A Step towards Gene Expression Data Analysis using the Method of Data Envelopment Analysis(DEA)2004
- Author(s)
  Masako Hoshino, Hiroshige Inauzmi
- Journal Title
  
  The Japanese Society of Artificial Intelligence, Special Interest Group on Knowledge Based Systems SIG-KBS-A304
  
  Pages: 151-157
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] A Step towards Substructure Exploration from Gene Expression Patterns2004
- Author(s)
  Masako Hoshino, Hiroshige Inauzmi
- Journal Title
  
  The 15^<th> International Conference on Genome Informatics GIW2004
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] マウス肝発ガン初期過程における遺伝子発現解析用Oligonucleotide Microarrayの開発2004
- Author(s)
  戸田香織, 原田基裕, 仲地豊, 近藤恭光, 中島圓, 浜田修一, 鈴木孝昌, 兵庫淳志, 星埜雅子, 田代英夫, 榊佳之, 伊藤尚, 稲積宏誠, 降旗千恵
- Journal Title
  
  第27回日本分子生物学会年会
  
  Pages: 269-269
- Related Report
  2004 Annual Research Report
[Journal Article] A Step Towards Substructure Exploration from Gene Expression Patterns,2004
- Author(s)
  Masako Hoshino, Hiroshige Inazumi
- Journal Title
  
  GIW2004
- Related Report
  2004 Annual Research Report
[Journal Article] 部分構造情報を用いた新規化合物生成支援ツールの開発2004
- Author(s)
  石井雄一郎, 田中栄太郎, 速水亜希子, 穂積宏誠
- Journal Title
  
  コンピュータ化学会2004春年会
  
  Pages: 1003-1003
- Related Report
  2004 Annual Research Report
[Journal Article] 化学構造データベースからの有効な部分構造抽出法に関する考察2003
- Author(s)
  田中栄太朗
- Journal Title
  
  情報処理学会第65回全国大会講演論文集 3
  
  Pages: 137-138
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] 化学物質の部分構造とその包含関係からの知識発見2003
- Author(s)
  橋本桂
- Journal Title
  
  情報処理学会第65回全国大会講演論文集 3
  
  Pages: 139-140
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] 抗菌活性ジテルペンのデータマイニング2003
- Author(s)
  稲積宏誠
- Journal Title
  
  日本化学会第82回春季大会 3
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] 化学構造情報を用いた知識発見と知識表現に関する考察2003
- Author(s)
  速水亜希子
- Journal Title
  
  情報処理学会第65回全国大会講演論文集 3
  
  Pages: 141-142
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] 化学物質の性質を決定する特徴的な部分的特徴発見の試み2003
- Author(s)
  田中栄太朗
- Journal Title
  
  コンピュータ化学会2003春年会 1O02
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] Finding Effective Substructures of Molecules from Chemical Database2003
- Author(s)
  Eitaro Tanaka, Tetsuo Tsuda, Hiroshige Inazumi
- Journal Title
  
  The 65^<th> National Convention of Information Processing Society of Japan vol.3
  
  Pages: 137-138
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] Knowledge Discovery from Substructures and the Inclusive Relations of Molecules2003
- Author(s)
  Katsura Hashimoto, Tetsuo Tsuda, Hiroshige Inazumi
- Journal Title
  
  The 65^<th> National Convention of Information Processing Society of Japan vol.3
  
  Pages: 139-140
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] Knowledg Disocovery and the Knowledge Representation from Substructures of Molecules2003
- Author(s)
  Akiko Hayami, Eitaro Tanaka, Hiroshige Inazumi
- Journal Title
  
  The 65^<th> National Convention of Information Processing Society of Japan vo1.3
  
  Pages: 141-142
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] 複合属性による領域分割を用いた決定木DTMACC2002
- Author(s)
  稲積宏誠
- Journal Title
  
  人工知能学会論文誌第17巻,第1号
  
  Pages: 44-52
- NAID
  10015770640
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] DTMACC : Decision Trees with Multiple Attributes Concept Clustering2002
- Author(s)
  Yusuke Kushi, Hiroshige Inazumi
- Journal Title
  
  Transactions of The Japanese Society of Artificial Intelligence vol.17,no.1
  
  Pages: 44-52
- NAID
  10015770640
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] 包絡分析法と遺伝的アルゴリズムによる事例ベース意思決定支援モデル2001
- Author(s)
  稲積宏誠
- Journal Title
  
  情報処理学会研究会論文誌 : 数理モデル化と応用 Vol.42,No.SIG5(TOM4)
  
  Pages: 89-98
- NAID
  110002725841
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] 複合属性を含む決定木生成アルゴリズムによる確率分布からの命題抽出2001
- Author(s)
  稲積宏誠
- Journal Title
  
  人工知能学会全国大会(第13回)論文集 2D2-01
- NAID
  130005022972
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] A New Scheme of Case-based Decision Support Systems by Using DEA and GA Techniwues2001
- Author(s)
  Hiroshige Inzumi, Ken-ichiro Suzuki, Kazuya Kusumoto
- Journal Title
  
  The Information Processing Society of Japan, Transactions on Mathematical Modeling and its Applications Vol.42,No.SIG5(TOM4)
  
  Pages: 89-98
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Publications] 田中栄太朗, 速水亜希子, 稲積宏誠: "化学物質の性質を決定する特徴的な部分的特徴発見の試み"コンピュータ化学会2003春年会. 1O02(CD-ROM). (2003)
- Related Report
  2003 Annual Research Report
[Publications] 田中栄太朗, 稲積宏誠: "GBI(Graph-Based Induction)法の拡張による化学物質からの部分構造抽出方法の検討"庵報処理学会第66回全国大会講演論文集. 2V-2. 171-172 (2004)
- Related Report
  2003 Annual Research Report
[Publications] 速水亜希子, 田中栄太朗, 稲積宏誠: "半順序構造をもつ属性からなる事例への決定木適用と化学物質分析への応用"情報処理学会第66回全国大会講演論文集. 2V-4. 175-176 (2004)
- Related Report
  2003 Annual Research Report
[Publications] 星埜雅子, 稲積宏誠: "包絡分析法を用いた遺伝子発現データ解析の試み"情報処理学会第66回全国大会講演論文集. 2V-3. 173-174 (2004)
- Related Report
  2003 Annual Research Report
[Publications] 田中栄太朗, 稲積宏誠: "GBI(Graph-Based Induction))法の拡張による化学物質からの部分構造抽出方法の検討"人工知能学会人工知能基礎論研究会・知識ベースシステム研究会合同研究会. 171-176 (2004)
- Related Report
  2003 Annual Research Report
[Publications] 星埜雅子, 稲積宏誠: "遺伝子発現データに対する包絡分析法の適用"人工知能学会人工知能基礎論研究会・知識ベースシステム研究会合・同研究会. 151-157 (2004)
- Related Report
  2003 Annual Research Report
[Publications] 黒川泰成, 田中栄太朗, 稲積宏誠: "系列縮約による塩基配列群からの特徴抽出の試み"人工知能学会人工知能基礎論研究会・知識ベースシステム研究会合同研究会. 159-164 (2004)
- Related Report
  2003 Annual Research Report
[Publications] 田中栄太朗, 津田哲夫, 吉澤有美, 稲積宏誠: "化学構造データベースからの有効な部分構造抽出法に関する考察"情報処理学会第65回全国大会講演論文集. (未定). (2003)
- Related Report
  2002 Annual Research Report
[Publications] 橋本桂, 津田哲夫, 吉澤有美, 稲積宏誠: "化学物質の部分構造とその包含関係からの知識発見"情報処理学会第65回全国大会講演論文集. (未定). (2003)
- Related Report
  2002 Annual Research Report
[Publications] 速水亜希子, 田中栄太朗, 吉澤有美, 稲積宏誠: "化学構造情報を用いた知識発見と知識表現に関する考察"情報処理学会第65回全国大会講演論文集. (未定). (2003)
- Related Report
  2002 Annual Research Report
[Publications] 稲積宏誠, 田中栄太朗, 上條めぐみ, 高田由貴, 木村純二: "抗菌活性ジテルペンのデータマイニング"日本化学会第82回春季大会(研究発表). (2003)
- Related Report
  2002 Annual Research Report
[Publications] 稲積宏誠, 鈴木賢一郎, 楠村和哉: "包絡分析法と遺伝的アルゴリズムによる事例ベース意思決定支援モデル"情報処理学会研究会論文誌:数理モデル化と応用. SIG5(TOM4). 89-98 (2001)
- Related Report
  2001 Annual Research Report
[Publications] 稲積宏誠, 櫛雄介, 吉澤有美: "複合属性を含む決定木生成アルゴリズムによる確率分析からの命題抽出"人工知能学会全国大会(第13回)論文集(CD-ROM). 2D2-01. (2001)
- Related Report
  2001 Annual Research Report
[Publications] 櫛雄介, 稲積宏誠: "複合属性による領域分割を用いた決定木:DTMACC"人工知能学会論文誌. 17-1. 44-52 (2002)
- Related Report
  2001 Annual Research Report

Knowledge Discovery from Databases using Machine Learning and Data Envelopment Analysis and Its Application to Decision Support Systems

Principal Investigator

INAZUMI Hiroshige Aoyama Gakuin University, College of Science and Engineering, Professor, 理工学部, 教授 (00168402)

¥3,200,000 (Direct Cost: ¥3,200,000)

Report

Research Products

[Journal Article] A Step towards GUI-based Graph Mining using Multiple GBI-for Chemical Database-2005

Author(s)

Journal Title

Description

Related Report

[Journal Article] A Step towards Graph Clustering based on Inclusion Relations among a set of Subgraphs2005

Author(s)

Journal Title

Description

Related Report

[Journal Article] A Step towards New Clustering Algorithm using DEA- for Gene Expression Data-2005

Author(s)

Journal Title

Description

Related Report

[Journal Article] A Step towards Visual Graph Clustering based on Inclusion Measure among a Set of Subgraphs2005

Author(s)

Journal Title

Description

Related Report

[Journal Article] GBI(Graph-Based Induction)法の拡張とGUIによるグラフマイニング支援環境の構築-化学物質を対象として-2005

Author(s)

Journal Title

Related Report

[Journal Article] 部分構造の包含関係を指標とするグラフクラスタリングの提案-化学物質を対象として-2005

Author(s)

Journal Title

Related Report

[Journal Article] The substructure extraction form Molecules by Extension of the GBI method2004

Author(s)

Journal Title

Description

Related Report

[Journal Article] Decision Tree for the Example Consisting of Attributes with Partial Order Structures and Application to Molecule Database2004

Author(s)

Journal Title

Description

Related Report

[Journal Article] A Step towards Gene Expression Data Analysis Using DEA2004

Author(s)

Journal Title

Description

Related Report

[Journal Article] A Step towards Feature Extraction from Nucleotide Sequence Database using Series-Compacting Method2004

Author(s)

Journal Title

Description

Related Report

[Journal Article] The Substructure extraction form Molecules by Extension of the GBI Method2004

Author(s)

Journal Title

Description

Related Report

[Journal Article] A Step towards Gene Expression Data Analysis using the Method of Data Envelopment Analysis(DEA)2004

Author(s)

Journal Title

Description

Related Report

[Journal Article] A Step towards Substructure Exploration from Gene Expression Patterns2004

Author(s)

Journal Title

Description

Related Report

[Journal Article] マウス肝発ガン初期過程における遺伝子発現解析用Oligonucleotide Microarrayの開発2004

Author(s)

Journal Title

Related Report

[Journal Article] A Step Towards Substructure Exploration from Gene Expression Patterns,2004

Author(s)

Journal Title

Related Report

[Journal Article] 部分構造情報を用いた新規化合物生成支援ツールの開発2004

Author(s)

Journal Title