• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2003 Fiscal Year Final Research Report Summary

Basic Studies on Automatic Text Summarization as an Aid for Human Intellectual Activities

Research Project

Project/Area Number 13680444
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionToyohashi University of Technology

Principal Investigator

MASUYAMA Shigeru  Toyohashi University of Technology, Department of Knowledge-based Information Engineering, Professor, 工学部, 教授 (60173762)

Project Period (FY) 2001 – 2003
Keywordsautomatic text summarization / multiple document summarization / user interaction / disabbreviation / paraphrasing / information access technology / natural language processing
Research Abstract

In this project, we put the focus on the following two themes :
1.sentence reduction for summarization, 2. multiple document summarization, and the following results were obtained.
As for the sentence reduction, we proposed a method for deleting adnominal verb phrases. This method is based on the observation that if the kinds of verbs which modify the noun modified by the verb is limited, then the adnominal verb phrase can be easily associated with by the noun and maybe deleted. Such diversity of modifying verbs is measured by entropy. We also proposed a method of deleting multiple adnominal phrases. The degree of deletability of an adnominal phrase is estimated by the importance of the noun in the phrase and mutual information.
We developed a multiple document summarization system GOLD. Previous experiences show that a good automatic summarization system can be developed by combining appropreately a number of summarization techniques. Thus we developed GOLD by combining a variety of summarization techniques both conventional and newly introduced. The evalution results at TSC 2 of NTCIR 3 was satisfactory.
In a multiple document summarization, the document set to be summarized usually has multiple topics. However, a user may not necessarily to be interested in all topics. Thus, a user customized summary is needed. To cope with this need, we developed a multiple document summarization system with user interaction. The system suggests keywords extracted from the document set to be summarized and the user choose appropreate keywords among them. The evalution results at TSC 3 of NTCIR 4 was remarkable, in particular, in content evaluation.
As a related results, we proposed a method for acquiring knowledge from a single corpus on disabbreviations for Japanese nouns. This knowledge is useful e.g., for information retrieval, word sense disambiguation and summarization.

  • Research Products

    (13 results)

All Other

All Publications (13 results)

  • [Publications] 酒井 浩之, 増山 繁: "動詞連体修飾節の省略可能性に関するコーパスからの知識獲得"電子情報通信学会論文誌. D-II(採録決定). (2004)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Tatsumi Yoshida, SHigeru Masuyama: "Multiple summarization system GOLD"IEICE Transactions on Information and Systems. E86-D・9. 1719-1727 (2003)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Hiroyuki Sakai, Shigeru Masuyama: "A statistical method for acquiring knowledge about the abbreviationpossibility of some of multiple adnomial phrases"IEICE Transactions on Information and Systems. E86-D・9. 1710-1718 (2003)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 酒井 浩之, 増山 繁: "名詞とその略語の対応関係のコーパスからの自動獲得"電子情報通信学会論文誌. D-II. 1624-1628 (2002)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 酒井 浩之, 篠原直嗣, 山本和英, 増山 繁: "連用修飾表現の省略可能性に関する知識の獲得"自然言語処理. Vol.9,No.3. 41-62 (2002)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 大竹 清敬, 岡本 大吾, 児玉 充, 増山 繁: "重要文抽出,自由作成要約に対応した新聞記事要約システムYELLOW"情報処理学会論文誌「データベース」. Vol.43,o.SIG2(TOD13). 37-42 (2002)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 茨木俊秀他62名著(計63名): "アルゴリズム工学-計算困難問題への挑戦-"共立出版. 280 (2001)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Hiroyuki Sakai, Shigeru Masuyama: "Knowledge Acquisition about the deletion possibility of adnominal verb phrases."Trans. IEICE D-II. Vol.J87-D-II.

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Tatsumi Yoshida, Shigeru Masuyama: "Multiple summarization system GOLD, IEICE Transactions on Information and Systems."IEICE Transactions on Information and Systems. Vol.E86-D, No.9. 1719-1727 (2003)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Hiroyuki Sakai, Shigeru Masuyama: "A statistical method for acquiring knowledge about the abbreviation possibility of some of multiple adnominal phrases."Transactions on Information and Systems. Vol.E86-D, No.9. 1710-1718 (2003)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Hiroyuki Sakai, Shigeru Masuyama: "Knowledge-acquisition of relation between abbreviations and their original words."Trans. IEICE D-II. Vol.J85-D-II, No.10. 1624-1628 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Hiroyuki Sakai, Naotsugu Shinohara, Shigeru Masuyama, Kazuhide Yamamoto: "Knowledge Acquisition about the abbreviation possibility of verb phrases."IPSJ Trans. on Databases. Vol.9, No.3. 41-62 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Kiyonori Ohtake, Daigo Okamoto, Mitsuru Kodama, Shigeru Masuyama: "A summarization system YELLOW for Japanese newspaper articles."Vol.43, No.SIG 2(TOD 13). 37-47 (2002)

    • Description
      「研究成果報告書概要(欧文)」より

URL: 

Published: 2005-04-19  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi