• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Basic Studies on Automatic Text Summarization as an Aid for Human Intellectual Activities

Research Project

Project/Area Number 13680444
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionToyohashi University of Technology

Principal Investigator

MASUYAMA Shigeru  Toyohashi University of Technology, Department of Knowledge-based Information Engineering, Professor, 工学部, 教授 (60173762)

Project Period (FY) 2001 – 2003
Project Status Completed (Fiscal Year 2003)
Budget Amount *help
¥3,500,000 (Direct Cost: ¥3,500,000)
Fiscal Year 2003: ¥1,200,000 (Direct Cost: ¥1,200,000)
Fiscal Year 2002: ¥1,100,000 (Direct Cost: ¥1,100,000)
Fiscal Year 2001: ¥1,200,000 (Direct Cost: ¥1,200,000)
Keywordsautomatic text summarization / multiple document summarization / user interaction / disabbreviation / paraphrasing / information access technology / natural language processing / 統計的手法 / 情報検索 / 検索者支援 / 文内削除 / 知識獲得 / 教師無し学習
Research Abstract

In this project, we put the focus on the following two themes :
1.sentence reduction for summarization, 2. multiple document summarization, and the following results were obtained.
As for the sentence reduction, we proposed a method for deleting adnominal verb phrases. This method is based on the observation that if the kinds of verbs which modify the noun modified by the verb is limited, then the adnominal verb phrase can be easily associated with by the noun and maybe deleted. Such diversity of modifying verbs is measured by entropy. We also proposed a method of deleting multiple adnominal phrases. The degree of deletability of an adnominal phrase is estimated by the importance of the noun in the phrase and mutual information.
We developed a multiple document summarization system GOLD. Previous experiences show that a good automatic summarization system can be developed by combining appropreately a number of summarization techniques. Thus we developed GOLD by combining a variety of summarization techniques both conventional and newly introduced. The evalution results at TSC 2 of NTCIR 3 was satisfactory.
In a multiple document summarization, the document set to be summarized usually has multiple topics. However, a user may not necessarily to be interested in all topics. Thus, a user customized summary is needed. To cope with this need, we developed a multiple document summarization system with user interaction. The system suggests keywords extracted from the document set to be summarized and the user choose appropreate keywords among them. The evalution results at TSC 3 of NTCIR 4 was remarkable, in particular, in content evaluation.
As a related results, we proposed a method for acquiring knowledge from a single corpus on disabbreviations for Japanese nouns. This knowledge is useful e.g., for information retrieval, word sense disambiguation and summarization.

Report

(4 results)
  • 2003 Annual Research Report   Final Research Report Summary
  • 2002 Annual Research Report
  • 2001 Annual Research Report
  • Research Products

    (27 results)

All Other

All Publications (27 results)

  • [Publications] 酒井 浩之, 増山 繁: "動詞連体修飾節の省略可能性に関するコーパスからの知識獲得"電子情報通信学会論文誌. D-II(採録決定). (2004)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] Tatsumi Yoshida, SHigeru Masuyama: "Multiple summarization system GOLD"IEICE Transactions on Information and Systems. E86-D・9. 1719-1727 (2003)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] Hiroyuki Sakai, Shigeru Masuyama: "A statistical method for acquiring knowledge about the abbreviationpossibility of some of multiple adnomial phrases"IEICE Transactions on Information and Systems. E86-D・9. 1710-1718 (2003)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] 酒井 浩之, 増山 繁: "名詞とその略語の対応関係のコーパスからの自動獲得"電子情報通信学会論文誌. D-II. 1624-1628 (2002)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] 酒井 浩之, 篠原直嗣, 山本和英, 増山 繁: "連用修飾表現の省略可能性に関する知識の獲得"自然言語処理. Vol.9,No.3. 41-62 (2002)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] 大竹 清敬, 岡本 大吾, 児玉 充, 増山 繁: "重要文抽出,自由作成要約に対応した新聞記事要約システムYELLOW"情報処理学会論文誌「データベース」. Vol.43,o.SIG2(TOD13). 37-42 (2002)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] 茨木俊秀他62名著(計63名): "アルゴリズム工学-計算困難問題への挑戦-"共立出版. 280 (2001)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] Hiroyuki Sakai, Shigeru Masuyama: "Knowledge Acquisition about the deletion possibility of adnominal verb phrases."Trans. IEICE D-II. Vol.J87-D-II.

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] Tatsumi Yoshida, Shigeru Masuyama: "Multiple summarization system GOLD, IEICE Transactions on Information and Systems."IEICE Transactions on Information and Systems. Vol.E86-D, No.9. 1719-1727 (2003)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] Hiroyuki Sakai, Shigeru Masuyama: "A statistical method for acquiring knowledge about the abbreviation possibility of some of multiple adnominal phrases."Transactions on Information and Systems. Vol.E86-D, No.9. 1710-1718 (2003)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] Hiroyuki Sakai, Shigeru Masuyama: "Knowledge-acquisition of relation between abbreviations and their original words."Trans. IEICE D-II. Vol.J85-D-II, No.10. 1624-1628 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] Hiroyuki Sakai, Naotsugu Shinohara, Shigeru Masuyama, Kazuhide Yamamoto: "Knowledge Acquisition about the abbreviation possibility of verb phrases."IPSJ Trans. on Databases. Vol.9, No.3. 41-62 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] Kiyonori Ohtake, Daigo Okamoto, Mitsuru Kodama, Shigeru Masuyama: "A summarization system YELLOW for Japanese newspaper articles."Vol.43, No.SIG 2(TOD 13). 37-47 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] 酒井 浩之, 増山 繁: "動詞連体修飾節の省略可能性に関するコーパスからの知識獲得"電子情報通信学会論文誌. D-II(採録決定). (2004)

    • Related Report
      2003 Annual Research Report
  • [Publications] Tatsumi Yoshida, Shigeru Masuyama: "Multiple summarization system GOLD"IEICE Transactions on Information and Systems. E-86-D・9. 1719-1727 (2003)

    • Related Report
      2003 Annual Research Report
  • [Publications] Hiroyuki Sakai, Shigeru Masuyama: "A statistical method for acquiring knowledge about the abbreviationpossibility of some of multiple adnomial phrases"IEICE Transactions on Information and Systems. E86-D・9. 1710-1718 (2003)

    • Related Report
      2003 Annual Research Report
  • [Publications] 酒井浩之, 増山繁: "名詞とその略語の対応関係のコーパスからの自動獲得"電子情報通信学会論文誌. D-II, No.10. 1624-1628 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] 酒井浩之, 篠原直嗣, 山本和英, 増山繁: "連用修飾表現の省略可能性に関する知識の獲得"自然言語処理. Vol.9, No.3. 41-62 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] Kiyonori Ohtake, Kazuhide Yamamoto, Yuji Toma, Shigeru Masuyama, Seiichi Nakagawa: "Newscast Speech Summarization via Sentence Shortening based on Prosodic Features"Proc. of SSPR2003. (To appear). (2003)

    • Related Report
      2002 Annual Research Report
  • [Publications] Hiroyuki Sakai, Shigeru Masuyama: "Unsupervised Knowledge Acquisition about the Deletion Possibility of Adnomial Verb Phrase"Proc. of WSQA2002(Coling2002 Post-Conference Workshop, Multilingual Summarization and Question Answering 2002). 49-56 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] 酒井浩之 他: "連用修飾表現の省略可能性に関する知識の獲得"自然言語処理. (掲載予定).

    • Related Report
      2001 Annual Research Report
  • [Publications] 大竹 清敬 他: "重要文抽出,自由作成要約に対応した新聞記事要約システムYELLOW"情報処理学会論文誌「データベース」. (掲載予定).

    • Related Report
      2001 Annual Research Report
  • [Publications] Kiyonori Ohtake et al.: "Yet Another summarization System with Two Modules using Empilical Knowledge"Proc.NTCIR Workshop 2 Meeting. 331-340 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] Hiroyuki Sakai et al.: "On Retrieval Support System by Suggesting Terms to a User"Proc.NTCIR Workshop 2 Meeting. 222-226 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] Hiroyuki Sakai et al.: "A Retrieval Support System by Suggesting Terms to a User"Proc.of ICCPOL2001(19th International Conference on Computer Processing of Oriental Languages). 77-80 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] Kiyonori Ohtake et al.: "Elimination of Multiple Modifiers in Summarization"Proc.of ICCPOL2001(19th International Conference on Computer Processing of Oriental Languages). 282-285 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] 茨木俊秀 他: "アルゴリズム工学-計算困難問題への挑戦"共立出版. 280 (2001)

    • Related Report
      2001 Annual Research Report

URL: 

Published: 2001-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi