• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

High Compression-Rate Automatic Summarization of Newspaper Articles Based on Combined Use of Significant Sentence Extraction and Sentence Compression

Research Project

Project/Area Number 16500077
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionThe University of Electro-Communications

Principal Investigator

OZEKI Kazuhiko  The University of Electro-Communications, Faculty of Electro-Communications, Professor, 電気通信学部, 教授 (50214135)

Co-Investigator(Kenkyū-buntansha) TAKAGI Kazuyuki  The University of Electro-Communications, Faculty of Electro-Communications, Research Associate, 電気通信学部, 助手 (70272755)
Project Period (FY) 2004 – 2006
Project Status Completed (Fiscal Year 2006)
Budget Amount *help
¥3,000,000 (Direct Cost: ¥3,000,000)
Fiscal Year 2006: ¥700,000 (Direct Cost: ¥700,000)
Fiscal Year 2005: ¥700,000 (Direct Cost: ¥700,000)
Fiscal Year 2004: ¥1,600,000 (Direct Cost: ¥1,600,000)
Keywordstext summarization / sentence compression / phrase significance / inter-phrase dependency / phrase alignment / dependency path length / information retention / grammatical naturalness / 係り受け / 概念距離
Research Abstract

1.In this work, we use a corpus in which pairs of newspaper articles and corresponding hand-made short summaries are contained. This corpus provides information about how humans make short summaries. To obtain such information effectively, phrase alignment is necessary between the original sentence and its summary. We developed a phrase aligner that makes use of conceptual distance and inter-phrase dependency.
2.Before the research period started, we were using the inter-phrase dependency strength estimated from the distribution of dependency distance in the set of original sentences. This method misses, however, the relationship between the original sentence and its summary. In this work, we estimated the inter-phrase dependency strength from the relative frequency of phrase pairs that exist in the original sentence with a certain dependency path length and remain having modifier-modified relation in the corresponding summary. The result of a subjective evaluation experiment showed sig … More nificant improvement in the quality of compressed sentences.
3.In the phrase extraction type sentence compression, which is employed in this research, phrases that are not in modifier-modified relation in the original sentence sometimes appear to have modifier-modified relation in the compressed sentence. Such a phenomenon may degrade the readability of compressed sentences. We worked out a method to modify the phrase ending of the modifier-phrase for improving the readability of compressed sentences. The result of subjective evaluation experiment showed the effectiveness of the method.
4.We reformulated our sentence compression method in a probabilistic framework. In calculating the probability that a compressed sentence is generated from an original sentence, quantities similar to phrase significance and inter-phrase dependency appear, which can be estimated from a training corpus. It was shown that this probabilistic approach attains comparable performance as our former, heuristic approach. Less

Report

(4 results)
  • 2006 Annual Research Report   Final Research Report Summary
  • 2005 Annual Research Report
  • 2004 Annual Research Report
  • Research Products

    (19 results)

All 2007 2006 2005

All Journal Article (19 results)

  • [Journal Article] 確率的な手法による日本語文簡約2007

    • Author(s)
      福冨 諭
    • Journal Title

      言語処理学会第13回年次大会発表論文集

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Japanese sentence compression using probabilistic approach2007

    • Author(s)
      Satoshi Fukutomi, Kazuyuki Takagi, Kazuhiko Ozeki
    • Journal Title

      Proc. of NLP200 D5-2

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] 確率的な手法による日本語文簡約2007

    • Author(s)
      福冨 諭
    • Journal Title

      言語処理学会第13回年次大会発表論文集 D5-2(印刷中)

    • Related Report
      2006 Annual Research Report
  • [Journal Article] 文節抽出型文簡約における読みやすさ向上のための文節末修正2006

    • Author(s)
      福冨 諭
    • Journal Title

      言語処理学会第12回年次大会発表論文集

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] 係り受け経路長を利用した新聞記事の自動簡約2006

    • Author(s)
      山形 究
    • Journal Title

      言語処理学会第12回年次大会発表論文集

      Pages: 2-11

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Sentence compression using statistical information about dependency path length2006

    • Author(s)
      Kiwamu Yamagata
    • Journal Title

      Proceedings of TSD 2006 (Lecture Notes in Artificial Intelligence, Springer-Verlag) 4188

      Pages: 127-134

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Modification of phrase-ending for improving readability in sentence compression by phrase selection2006

    • Author(s)
      Satoshi Fukutomi, Kazuyuki Takagi, Kazuhiko Ozeki
    • Journal Title

      Proc. of NLP2006 D5-5

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Newspaper article summarization using dependency path length2006

    • Author(s)
      Kiwamu Yamagata, Satoshi Fukutomi, Kazuyuki Takagi, Kazuhiko Ozeki
    • Journal Title

      Proc. of NLP2006 P2-11

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Sentence compression using statistical information about dependency path length2006

    • Author(s)
      Kiwamu Yamagata, Satoshi Fukutomi, Kazuyuki Takagi, Kazuhiko Ozeki
    • Journal Title

      Proc. of TSD 2006(LNAI 4188, Springer)

      Pages: 127-134

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Sentence compression using statistical information about dependency path length2006

    • Author(s)
      Kiwamu Yamagata
    • Journal Title

      Proc. TSD 2006 (LNAI 4188)

      Pages: 127-134

    • Related Report
      2006 Annual Research Report
  • [Journal Article] Sentence compression : p progress report2006

    • Author(s)
      Kazuhiko Ozeki
    • Journal Title

      The 6th China-Japan Natural language Processing Joint Research Promotion Conference (CD作成中)

    • Related Report
      2006 Annual Research Report
  • [Journal Article] 文節抽出型文簡約における読みやすさ向上のための文節末修正2006

    • Author(s)
      福冨 諭
    • Journal Title

      言語処理学会第12回年次大会発表論文集 (発表予定)

    • Related Report
      2005 Annual Research Report
  • [Journal Article] 係り受け経路長を利用した新聞記事の自動簡約2006

    • Author(s)
      山形 究
    • Journal Title

      言語処理学会第12回年次大会発表論文集 (発表予定)

    • Related Report
      2005 Annual Research Report
  • [Journal Article] 概念距離と係り受けを利用した要約文の文節対応付け2005

    • Author(s)
      福冨 諭
    • Journal Title

      情報処理学会第67回全国大会講演論文集

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] 概念距離と係り受けを利用した要約文の文節対応付け2005

    • Author(s)
      福冨 諭
    • Journal Title

      言語処理学会第11回年次大会発表論文集

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Aligning phrases in original text and its summary using concept distance and inter-phrase dependency2005

    • Author(s)
      Satoshi Fukutomi, Kazuyuki Takagi, Kazuhiko Ozeki
    • Journal Title

      Proc. of 67th Annual Meeting of IPSJ 5J-2

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Aligning phrases in original text and its summary using concept distance and inter-phrase dependency2005

    • Author(s)
      Satoshi Fukutomi, Kazuyuki Takagi, Kazuhiko Ozeki
    • Journal Title

      Proc. of NLP2005 D3-8

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] 概念距離と係り受けを利用した要約文の文節対応付け2005

    • Author(s)
      福冨 論
    • Journal Title

      情報処理学会第67回全国大会講演論文集 第2分冊

      Pages: 119-120

    • Related Report
      2004 Annual Research Report
  • [Journal Article] 概念距離と係り受けを利用した要約文の文節対応付け2005

    • Author(s)
      福冨 論
    • Journal Title

      言語処理学会第11回年次大会発表論文集 (発表予定)

    • Related Report
      2004 Annual Research Report

URL: 

Published: 2004-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi