Development of Language Information System and Systematic Extraction of Latent Knowledge, Based on Graph Computation of Semantic Network

Research Project

Project/Area Number	18500192
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	情報図書館学・人文社会情報学
Research Institution	Tokyo Institute of Technology
Principal Investigator	AKAMA Hiroyuki Tokyo Institute of Technology, Graduate School of Decision Science and Technology, Associate Professor (60242301)
Co-Investigator(Kenkyū-buntansha)	NISHINA Kikuko Tokyo Institute of Technology, International Student Center, Professor (40198479) SHIMIZU Yumiko Musashi Institute of Technology, Faculty of Environmental and Information Studies, Assistant Professor (30298020) MIYAKE Maki Osaka University, Department of Language and Culture, Assistant Professor (80448018)
Project Period (FY)	2006 – 2007
Project Status	Completed (Fiscal Year 2007)
Budget Amount *help	¥3,200,000 (Direct Cost: ¥2,900,000、Indirect Cost: ¥300,000) Fiscal Year 2007: ¥1,300,000 (Direct Cost: ¥1,000,000、Indirect Cost: ¥300,000) Fiscal Year 2006: ¥1,900,000 (Direct Cost: ¥1,900,000)
Keywords	Semantic Network / Graph Theory / Latent Knowledge / Educational Information System / Hidden Knowledge
Research Abstract	We developed a new method of solving the cluster-size imbalance problem observed when documents and corpora are processed with MCL. The Branching MCL (BMCL) or the latent adjacency matrix can resize overly inclusive Markov clusters (core clusters) into appropriate subsets. This method is applied to a semantic network built from the large-scale corpus of Gakken's Large Dictionary of Japanese (GLDJ), covering 100,000 words, definitions, examples, and grammatical explanations. The effectiveness of these techniques is currently being tested by creating a clustered semantic network for the GLDJ. As the applications of the graph clustering to the field of Humanities, we made the semantic networks from the lexical co-occurrence data of some historical documents or novels : the books of two contemporary thinkers, Cabanis and Mesmer, to measure the similarity of thinking between them ; the very famous novel of Saint-Exupery, "Le petit prince" to objectively propose a method of word sense disambiguation applicable to his enigmatic word usage. In this study we proposed as an alternative to the keyword-based clustering a new windowing method called Incrementally Advancing Window (TAW) that generates co-occurring word pairs that can be used as inputs to the Incremental Routing Algorithm. The results of the MCL applied to co-occurrence and/or adjacency data matrices were evaluated by using the indexes as weighted curvature, modularity Q and F measure.

Report

(3 results)

2007 Annual Research Report Final Research Report Summary
2006 Annual Research Report

Research Products
(42 results)

All 2008 2007 2006 Other

All Journal Article (17 results) (of which Peer Reviewed: 4 results) Presentation (24 results) Remarks (1 results)

[Journal Article] L'elaboration d'un reseau semantique par le raffinement du Markov Clustering -A partir des donnees lexicales du roman de Saint-Exupery, 《Lepetit prince》2008
- Author(s)
  Hiroyuki Akama, Maki Miyake, Jaeyoung Jung
- Journal Title
  
  Actes des 9es Journees internationales d'Analyse Statistique des Donnees Textuelles, Lyon, Presses Universitaires de Lyon 1
  
  Pages: 57-68
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2007 Final Research Report Summary
- Peer Reviewed
[Journal Article] Jaeyoung Jung L'elaboration d'un reseau semantique par le raffinement du Markov Clustering -A partir des donnees lexicales du roman de Saint-Exupery,《Le petit prince》2008
- Author(s)
  Hiroyuki, Akama, Maki, Miyake
- Journal Title
  
  Actes des 9es Journees internationales d'Analyse Statistique des Donnees Textuelles, Lyon, Presses Universitaires de Lyon
  
  Pages: 57-68
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2007 Final Research Report Summary
[Journal Article] L'elaboration d'un reseau semantique par le raffinement du Markov Clustering-A partir des donnees lexicales du roman de Saint-Exupery,≪Lepetit prince≫2008
- Author(s)
  Hiroyuki Akama, Maki Miyake, Jaeyoung Jung
- Journal Title
  
  Actes des 9es Journees internationales d'Analyse Statistique des Donnees Textuelles, Lyon, Presses Universitaires de Lyon 1
  
  Pages: 57-68
- Related Report
  2007 Annual Research Report
- Peer Reviewed
[Journal Article] 赤間啓之、コンピュータと計量的手法を利用した文学・思想作品の研究2007
- Author(s)
  赤間啓之
- Journal Title
  
  フランス語学研究 41
  
  Pages: 82-86
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2007 Annual Research Report 2007 Final Research Report Summary
[Journal Article] ネットワーク分析におけるテキストマイニング-クラスタリング係数、Markov Clusteringを中心として2007
- Author(s)
  三宅真紀
- Journal Title
  
  言語文化研究プロジェクト、電子化言語資料分析研究
  
  Pages: 3-16
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2007 Annual Research Report 2007 Final Research Report Summary
[Journal Article] Research of the Works of Literature and Ideas by Using Computers and Quantitative Analysis (Japanese)2007
- Author(s)
  Hiroyuki, Akama
- Journal Title
  
  Studies on French Linguistics 41
  
  Pages: 82-86
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2007 Final Research Report Summary
[Journal Article] Text Mining in Network Analysis-With a Central Focus on Clustering Coefficient and Markov Clustering(Japanese)2007
- Author(s)
  Maki, Miyake
- Journal Title
  
  Research Project on Language and Culture, Analytical Research of Electronic Documents
  
  Pages: 3-16
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2007 Final Research Report Summary
[Journal Article] グラフクラスタリングを用いたソシュールの概念ネットワーク解析2007
- Author(s)
  赤間啓之, 鄭在玲, 三宅真紀
- Journal Title
  
  情報処理学会研究報告 2007-9
  
  Pages: 33-40
- NAID
  110006202717
- Related Report
  2006 Annual Research Report
[Journal Article] 絵と指示対象間の関係が概念伝達に及ぼす影響の考察2006
- Author(s)
  清水由美子、赤間啓之
- Journal Title
  
  映像情報メディア学会誌 60・3
  
  Pages: 418-424
- NAID
  110006838430
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2007 Final Research Report Summary
- Peer Reviewed
[Journal Article] 携帯メールの絵文字と意味の広がり2006
- Author(s)
  清水由美子、赤間啓之
- Journal Title
  
  感性工学研究論文集 6・3
  
  Pages: 3-10
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2007 Final Research Report Summary
- Peer Reviewed
[Journal Article] Influences of the Relationship of a Picture and its Referent on Understandability of a Concept (Japanese)2006
- Author(s)
  Yumiko, Shimizu, Hiroyuki, Akama
- Journal Title
  
  Journal of the Institute of Image Information And Television Engineers 60・3
  
  Pages: 418-424
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2007 Final Research Report Summary
[Journal Article] Various Meanings of Mobile Icons-Their Understandability Depending on Semantic Categories(Japanese)2006
- Author(s)
  Yumiko, Shimizu, Hiroyuki, Mama
- Journal Title
  
  Journal on Kansei Engineering 6・3
  
  Pages: 310-310
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2007 Final Research Report Summary
[Journal Article] テキスト分析における2部グラフクラスタリングの可能性2006
- Author(s)
  赤間啓之, 三宅真紀, 鄭在玲
- Journal Title
  
  情報処理学会研究報告 NL-174
  
  Pages: 19-24
- NAID
  10018221557
- Related Report
  2006 Annual Research Report
[Journal Article] 絵と指示対象間の関係が概念伝達に及ぼす影響の考察2006
- Author(s)
  清水由美子, 赤間啓之
- Journal Title
  
  映像情報メディア学会誌 60・3
  
  Pages: 418-424
- NAID
  110006838430
- Related Report
  2006 Annual Research Report
[Journal Article] Markov Cluster Shortest Path Founded upon the Alibi-breaking Algorithm2006
- Author(s)
  Jaeyoung Jung, Maki Miyake, Hiroyuki Akama
- Journal Title
  
  CICLing-2006, Springer Verlag Berlin Heidelberg LNCS 3878
  
  Pages: 55-58
- Related Report
  2006 Annual Research Report
[Journal Article] Recurrent Markov Cluster (RMCL) Algorithm for the Refinement of the Semantic Network2006
- Author(s)
  Jaeyoung Jung, Maki, Miyake, Hiroyuki Akama
- Journal Title
  
  LREC2006(International Conference on Language Resources and Evaluation) Proceedings
  
  Pages: 1428-1432
- Related Report
  2006 Annual Research Report
[Journal Article] Development of a Web-based Composition Support System - Using Graph Clustering Methodologies Applied to an Associative Concepts Dictionary2006
- Author(s)
  Jaeyoung Jung, Maki Miyake, Nobuyasu Makoshi, Hiroyuki Akama
- Journal Title
  
  ICALT-2006
  
  Pages: 431-435
- Related Report
  2006 Annual Research Report
[Presentation] Irrationality or limited rationality in network science2008
- Author(s)
  Hiroyuki Akama
- Organizer
  Workshop on Psychological, Economic, and Environmental Rationality 2008
- Place of Presentation
  Tokyo(Japan)
- Year and Date
  2008-01-24
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2007 Final Research Report Summary
[Presentation] Irrationality or limited rationality in network science2008
- Author(s)
  Hiroyuki, Akama
- Organizer
  Workshop on Psychological, Economic, and Environmental Rationality 2008
- Place of Presentation
  Tokyo (Japan)
- Year and Date
  2008-01-24
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2007 Annual Research Report 2007 Final Research Report Summary
[Presentation] How to Take Advantage of the Limitations with Markov Clustering?-The Foundations of Branching Markov Clustering(BMCL)2008
- Author(s)
  Hiroyuki Akama
- Organizer
  IJCNLP-2008
- Place of Presentation
  Hyderabad(lndia)
- Year and Date
  2008-01-09
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2007 Final Research Report Summary
[Presentation] How to Take Advantage of the Limitations with Markov Clustering?-The Foundations of Branching Markov Clustering (BMCL)2008
- Author(s)
  Hiroyuki, Akama
- Organizer
  IJCNLP-2008
- Place of Presentation
  Hyderabad (India)
- Year and Date
  2008-01-09
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2007 Final Research Report Summary
[Presentation] How to Take Advantage of the Limitations with Markov Clustering?-The Foundations of Branching Markov Clustering (BMCL)2008
- Author(s)
  Hiroyuki Akama
- Organizer
  IJCNLP-2008
- Place of Presentation
  Hyderabad(India)
- Year and Date
  2008-01-09
- Related Report
  2007 Annual Research Report
[Presentation] グラフクラスタリングを用いた、歴史事象のシミュレーションの可能性について2007
- Author(s)
  赤間啓之
- Organizer
  じんもんこん2007
- Place of Presentation
  京都大学
- Year and Date
  2007-12-14
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2007 Annual Research Report 2007 Final Research Report Summary
[Presentation] Possibility of Simulation for Historical Phenomena Using Graph Clustering (Japanese)2007
- Author(s)
  Hiroyuki, Akama
- Organizer
  Jinmonkon 2007
- Place of Presentation
  Kyoto University
- Year and Date
  2007-12-14
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2007 Final Research Report Summary
[Presentation] Remaking the Markov Clusters of Word Data, Using the Example of"The Little Prince"by saint-exupery2007
- Author(s)
  Hiroyuki Akama
- Organizer
  SIG-CH
- Place of Presentation
  Taipei(Taiwan)
- Year and Date
  2007-09-27
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2007 Final Research Report Summary
[Presentation] Remaking the Markov Clusters of Word Data, Using the Example of "The Little Prince" by Saint-Exupery2007
- Author(s)
  Hiroyuki, Akama
- Organizer
  SIG-CH
- Place of Presentation
  Taipei (Taiwan)
- Year and Date
  2007-09-27
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2007 Final Research Report Summary
[Presentation] Remaking the Markov Clusters of Word Data, Using the Example of "The Little Prince" by Saint-Exupery2007
- Author(s)
  Hiroyuki Akama
- Organizer
  SIG-CH
- Place of Presentation
  Taipei(Taiwan)
- Year and Date
  2007-09-27
- Related Report
  2007 Annual Research Report
[Presentation] Building a clustered semantic network for an Entire Large Dictionary of Japanese,2007
- Author(s)
  Hiroyuki Akama
- Organizer
  PACLING-2007
- Place of Presentation
  Melbourne(Australia)
- Year and Date
  2007-09-20
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2007 Annual Research Report 2007 Final Research Report Summary
[Presentation] Building a clustered semantic network for an Entire Large Dictionary of Japanese2007
- Author(s)
  Hiroyuki, Akama
- Organizer
  PACLING-2007
- Place of Presentation
  Melbourne (Australia)
- Year and Date
  2007-09-20
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2007 Final Research Report Summary
[Presentation] グラフクラスタリングを用いた文献解析の諸技法に関して-カバニスとメスメルのテキストを例に2007
- Author(s)
  赤間啓之
- Organizer
  情報処理学会・人文科学とコンピュータ研究会
- Place of Presentation
  龍谷大学
- Year and Date
  2007-07-27
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2007 Annual Research Report 2007 Final Research Report Summary
[Presentation] On the Techniques of Document Analysis Using Graph Clustering-Taking the Examples from the Texts of Cabanis and Mesmer-(Japanese)2007
- Author(s)
  Hiroyuki, Akama
- Organizer
  SIG-CH
- Place of Presentation
  Ryukoku University
- Year and Date
  2007-07-27
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2007 Final Research Report Summary
[Presentation] 近代ストア主義とメスメール主義の思想的類似性に関するグラフ言語学的分析2007
- Author(s)
  赤間啓之
- Organizer
  情報処理学会・人文科学とコンピュータ研究会
- Place of Presentation
  神奈川工科大学
- Year and Date
  2007-05-18
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2007 Annual Research Report 2007 Final Research Report Summary
[Presentation] Graph-based Linguistic Analysis on the Ideological Similarity between the Mesmerism and the Modern Stoicism (Japanese)2007
- Author(s)
  Hiroyuki, Akama
- Organizer
  SIG-CH
- Place of Presentation
  Kanagawa Institute of Technology
- Year and Date
  2007-05-18
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2007 Final Research Report Summary
[Presentation] グラフクラスタリングを用いたソシュールの概念ネットワーク解析2007
- Author(s)
  赤間啓之、鄭在玲、三宅真紀
- Organizer
  情報処理学会・人文科学とコンピュータ研究会
- Place of Presentation
  総合研究大学院大学
- Year and Date
  2007-01-27
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2007 Final Research Report Summary
[Presentation] Analysis of the Semantic Network of Saussure Using Graph Clustering (Japanese)2007
- Author(s)
  Hiroyuki, Akama
- Organizer
  SIG-CH
- Place of Presentation
  The Graduate University for Advanced Studies
- Year and Date
  2007-01-27
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2007 Final Research Report Summary
[Presentation] テキスト分析における2部グラフクラスタリングの可能性2006
- Author(s)
  赤間啓之、三宅真紀、鄭在玲
- Organizer
  電子情報通信学会研究会、言語理解とコミュニケーション研究会
- Place of Presentation
  函館公立みらい大学
- Year and Date
  2006-07-27
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2007 Final Research Report Summary
[Presentation] Possibilities of the Bipartite Graph Clustering in Text Analysis (Japanese)2006
- Author(s)
  Hiroyuki, Akama
- Organizer
  IEICE-NLC
- Place of Presentation
  Futur University -Hakodate
- Year and Date
  2006-07-27
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2007 Final Research Report Summary
[Presentation] Development of a Web-based Composition Support System--Using Graph Clustering Methodologies Applied to an Associative Concepts Dictionary2006
- Author(s)
  Jaeyoung Jung, Maki Miyake, Nobuyasu Makoshi, Hiroyuki Akama
- Organizer
  ICALT-2006
- Place of Presentation
  ケルクラーデ、オランダ
- Year and Date
  2006-07-06
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2007 Final Research Report Summary
[Presentation] Development of a Web-based Composition Support System -- Using Graph Clustering Methodologies Applied to an Associative Concepts Dictionary2006
- Author(s)
  Hiroyuki, Akama
- Organizer
  ICALT-2006
- Place of Presentation
  Kerkrade, Netherlands
- Year and Date
  2006-07-06
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2007 Final Research Report Summary
[Presentation] Recurrent Markov Cluster(RMCL)Algorithm for the Refinement of the Semantic Network2006
- Author(s)
  Jaeyoung Jung, Maki Miyake, Hiroyuki Akama
- Organizer
  LREC2006(lnternational Conference on Language Resources and Evaluation)
- Place of Presentation
  ジェノヴァ、イタリア
- Year and Date
  2006-05-26
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2007 Final Research Report Summary
[Presentation] Recurrent Markov Cluster (RMCL) Algorithm for the Refinement of the Semantic Network2006
- Author(s)
  Hiroyuki, Akama
- Organizer
  LREC2006 (International Conference on Language Resources and Evaluation)
- Place of Presentation
  Genoa, Italy
- Year and Date
  2006-05-26
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2007 Final Research Report Summary
[Remarks] 「研究成果報告書概要(和文)」より
- URL
  http://dl.dp.hum.titech.ac.ip/wiki/?SEMNET
- Related Report
  2007 Final Research Report Summary

Development of Language Information System and Systematic Extraction of Latent Knowledge, Based on Graph Computation of Semantic Network

Principal Investigator

AKAMA Hiroyuki Tokyo Institute of Technology, Graduate School of Decision Science and Technology, Associate Professor (60242301)

¥3,200,000 (Direct Cost: ¥2,900,000、Indirect Cost: ¥300,000)

Report

Research Products

[Journal Article] L'elaboration d'un reseau semantique par le raffinement du Markov Clustering -A partir des donnees lexicales du roman de Saint-Exupery, 《Lepetit prince》2008

Author(s)

Journal Title

Description

Related Report

[Journal Article] Jaeyoung Jung L'elaboration d'un reseau semantique par le raffinement du Markov Clustering -A partir des donnees lexicales du roman de Saint-Exupery,《Le petit prince》2008

Author(s)

Journal Title

Description

Related Report

[Journal Article] L'elaboration d'un reseau semantique par le raffinement du Markov Clustering-A partir des donnees lexicales du roman de Saint-Exupery,≪Lepetit prince≫2008

Author(s)

Journal Title

Related Report

[Journal Article] 赤間啓之、コンピュータと計量的手法を利用した文学・思想作品の研究2007

Author(s)

Journal Title

Description

Related Report

[Journal Article] ネットワーク分析におけるテキストマイニング-クラスタリング係数、Markov Clusteringを中心として2007

Author(s)

Journal Title

Description

Related Report

[Journal Article] Research of the Works of Literature and Ideas by Using Computers and Quantitative Analysis (Japanese)2007

Author(s)

Journal Title

Description

Related Report

[Journal Article] Text Mining in Network Analysis-With a Central Focus on Clustering Coefficient and Markov Clustering(Japanese)2007

Author(s)

Journal Title

Description

Related Report

[Journal Article] グラフクラスタリングを用いたソシュールの概念ネットワーク解析2007

Author(s)

Journal Title

NAID

Related Report

[Journal Article] 絵と指示対象間の関係が概念伝達に及ぼす影響の考察2006

Author(s)

Journal Title

NAID

Description

Related Report

[Journal Article] 携帯メールの絵文字と意味の広がり2006

Author(s)

Journal Title

Description

Related Report

[Journal Article] Influences of the Relationship of a Picture and its Referent on Understandability of a Concept (Japanese)2006

Author(s)

Journal Title

Description

Related Report

[Journal Article] Various Meanings of Mobile Icons-Their Understandability Depending on Semantic Categories(Japanese)2006

Author(s)

Journal Title

Description

Related Report

[Journal Article] テキスト分析における2部グラフクラスタリングの可能性2006

Author(s)

Journal Title

NAID

Related Report

[Journal Article] 絵と指示対象間の関係が概念伝達に及ぼす影響の考察2006

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Markov Cluster Shortest Path Founded upon the Alibi-breaking Algorithm2006

Author(s)

Journal Title

Related Report