Structurized Term Extraction from Academic Text Corpora

Research Project

Project/Area Number	19500135
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	Intelligent informatics
Research Institution	National Institute of Informatics
Principal Investigator	KOYAMA Teruo National Institute of Informatics, 情報社会相関研究系, 教授 (80124410)
Co-Investigator(Kenkyū-buntansha)	TAKEUCHI Koichi 岡山大学, 大学院・自然科学研究科, 講師 (80311174)
Project Period (FY)	2007 – 2009
Project Status	Completed (Fiscal Year 2009)
Budget Amount *help	¥4,160,000 (Direct Cost: ¥3,200,000、Indirect Cost: ¥960,000) Fiscal Year 2009: ¥1,300,000 (Direct Cost: ¥1,000,000、Indirect Cost: ¥300,000) Fiscal Year 2008: ¥1,300,000 (Direct Cost: ¥1,000,000、Indirect Cost: ¥300,000) Fiscal Year 2007: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000)
Keywords	用語抽出 / 用語体系化 / 自然言語処理 / 形態素解析 / 部分研究領域 / 複合語構造解析 / 部分研究領域同定 / 用語分類 / 語彙概念構造
Research Abstract	In this study, we established a method for comprehensive term extraction from domain text corpora with high precision. The method is based on basically two new principles. One is the reconsideration and modification of Japanese morpheme classification, and another is the evaluating the certainty of composite boarders. We also have developed methods to structurize terms from two points of view, namely, nesting relations between composites, and the term relationships to various research subdomains.

Report

(4 results)

2009 Annual Research Report Final Research Report ( PDF )
2008 Annual Research Report
2007 Annual Research Report

Research Products
(12 results)

All 2010 2009 2008 2007 Other

All Journal Article (5 results) Presentation (5 results) Remarks (2 results)

[Journal Article] 日本語テキストからの複合語用語抽出2010
- Author(s)
  小山照夫
- Journal Title
  
  情報知識学会誌 vol.19、No.4
  
  Pages: 306-315
- NAID
  10025992156
- Related Report
  2009 Final Research Report
[Journal Article] 日本語テキストからの複合語用語抽出2010
- Author(s)
  小山照夫
- Journal Title
  
  情報知識学会誌 19
  
  Pages: 306-315
- NAID
  10025992156
- Related Report
  2009 Annual Research Report
[Journal Article] 形態素出現パタンに基づく文書集合類似性評価2008
- Author(s)
  小山照夫, 竹内孔一
- Journal Title
  
  情報処理学会研究報告2008-NL-188 2008-NL-188
  
  Pages: 51-56
- NAID
  110007082300
- Related Report
  2008 Annual Research Report
[Journal Article] 用語クラスタリングに基づく部分研究領域推定と用語分類2008
- Author(s)
  小山照夫、竹内孔一
- Journal Title
  
  情報処理学会研究報告 2008-NL-183
  
  Pages: 87-92
- NAID
  110006623479
- Related Report
  2007 Annual Research Report
[Journal Article] 日本語複合語用語の入れ子関係に基づく体系的階層化2007
- Author(s)
  小山照夫、竹内孔一
- Journal Title
  
  電子情報通信学会技術研究報告 NLC2007-1-28
  
  Pages: 49-54
- Related Report
  2007 Annual Research Report
[Presentation] 候補の接続関係を考慮した複合語用語抽出2009
- Author(s)
  小山照夫、竹内孔一
- Organizer
  情報処理学会研究報告、情報処理学会
- Place of Presentation
  京都大学
- Year and Date
  2009-09-29
- Related Report
  2009 Final Research Report
[Presentation] 候補の接続関係を考慮した複合語用語抽出2009
- Author(s)
  小山照夫
- Organizer
  情報処理学会自然言語処理研究会
- Place of Presentation
  京都大学
- Year and Date
  2009-09-29
- Related Report
  2009 Annual Research Report
[Presentation] 形態素出現パタンに基づく文書集合類似性評価2008
- Author(s)
  小山照夫、竹内孔一
- Organizer
  情処研報, 情報処理学会
- Place of Presentation
  九州大学
- Year and Date
  2008-11-26
- Related Report
  2009 Final Research Report
[Presentation] 用語クラスタリングに基づく部分研究領域推定と用語分類2008
- Author(s)
  小山照夫、竹内孔一
- Organizer
  情処研報、情報処理学会
- Place of Presentation
  国立情報学研究所
- Year and Date
  2008-01-22
- Related Report
  2009 Final Research Report
[Presentation] 日本語複合語用語の入れ子関係に基づく階層的体系化、信学技報2007
- Author(s)
  小山照夫、竹内孔一
- Organizer
  電子情報通信学会
- Place of Presentation
  徳島大学
- Year and Date
  2007-07-24
- Related Report
  2009 Final Research Report
[Remarks]
- URL
  http://research.nii.ac.jp/~koyama/official/tmrec/
- Related Report
  2009 Final Research Report
[Remarks]
- URL
  http://research.nii.ac.jp/~koyama/official/tmrec/index.html
- Related Report
  2009 Annual Research Report

Structurized Term Extraction from Academic Text Corpora

Principal Investigator

KOYAMA Teruo National Institute of Informatics, 情報社会相関研究系, 教授 (80124410)

¥4,160,000 (Direct Cost: ¥3,200,000、Indirect Cost: ¥960,000)

Report

Research Products

[Journal Article] 日本語テキストからの複合語用語抽出2010

Author(s)

Journal Title

NAID

Related Report

[Journal Article] 日本語テキストからの複合語用語抽出2010

Author(s)

Journal Title

NAID

Related Report

[Journal Article] 形態素出現パタンに基づく文書集合類似性評価2008

Author(s)

Journal Title

NAID

Related Report

[Journal Article] 用語クラスタリングに基づく部分研究領域推定と用語分類2008

Author(s)

Journal Title

NAID

Related Report

[Journal Article] 日本語複合語用語の入れ子関係に基づく体系的階層化2007

Author(s)

Journal Title

Related Report

[Presentation] 候補の接続関係を考慮した複合語用語抽出2009

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 候補の接続関係を考慮した複合語用語抽出2009

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 形態素出現パタンに基づく文書集合類似性評価2008

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 用語クラスタリングに基づく部分研究領域推定と用語分類2008

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 日本語複合語用語の入れ子関係に基づく階層的体系化、信学技報2007

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Remarks]

URL

Related Report

[Remarks]

URL

Related Report