Construction and Summarization of Lecture Contents Using Both Slides and Lecture Speech

Research Project

Project/Area Number	23700115
Research Category	Grant-in-Aid for Young Scientists (B)
Allocation Type	Multi-year Fund
Research Field	Media informatics/Database
Research Institution	Toyohashi University of Technology
Principal Investigator	TSUCHIYA Masatoshi 豊橋技術科学大学, 情報メディア基盤センター, 准教授 (70378256)
Project Period (FY)	2011-04-28 – 2015-03-31
Project Status	Completed (Fiscal Year 2014)
Budget Amount *help	¥4,160,000 (Direct Cost: ¥3,200,000、Indirect Cost: ¥960,000) Fiscal Year 2013: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000) Fiscal Year 2012: ¥1,300,000 (Direct Cost: ¥1,000,000、Indirect Cost: ¥300,000) Fiscal Year 2011: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000)
Keywords	自動要約 / 音声認識 / 固有表現 / 機能表現
Outline of Final Research Achievements	Because lecture speech contains spoken phenomena such as filled pauses and silent pauses, a robust automatic speech recoginition method is necessary in order to realize automatic summarization of lecture speech. Our method consists of two steps: 1st step is to predict filler insertion locations and pause insertion locations against loosely transcribed corpora which has no pause information using filler insertion model and pause insertion model learned from precisely transcribed corpora including filler information and pause information, and 2nd step is to construct a language model based on both loosely transcribed corpora and predicted information. And more, a method to detect lecture specific named entities was developed. The human annotation scheme to map lecture slides and lecture speech transcriptions was also established.

Report

(5 results)

2014 Annual Research Report Final Research Report ( PDF )
2013 Research-status Report
2012 Research-status Report
2011 Research-status Report

Research Products
(8 results)

All 2013 2012 2011 Other

All Journal Article (1 results) (of which Peer Reviewed: 1 results) Presentation (7 results)

[Journal Article] ポーズを考慮した話し言葉言語モデルの構築2012
- Author(s)
  太田健吾, 土屋雅稔, 中川聖一
- Journal Title
  
  情報処理学会論文誌
  
  Volume: 53 Pages: 889-900
- NAID
  110008767197
- Related Report
  2011 Research-status Report
- Peer Reviewed
[Presentation] プライバシ保護のための音声からの人名除去とその評価2013
- Author(s)
  川口亮，土屋雅稔，中川聖一
- Organizer
  日本音響学会2013年秋季研究発表会
- Place of Presentation
  豊橋技術科学大学
- Related Report
  2013 Research-status Report
[Presentation] 整形された書き起こしからの整形・非整形部分の自動検出2012
- Author(s)
  太田健吾, 土屋雅稔, 中川聖一
- Organizer
  第6回音声ドキュメント処理ワークショップ
- Place of Presentation
  豊橋技術科学大学
- Related Report
  2011 Research-status Report
[Presentation] 代表・派生関係を利用した日本語機能表現の解析方式の評価2012
- Author(s)
  鈴木敬文, 阿部佑亮, 宇津呂武仁, 松吉俊, 土屋雅稔
- Organizer
  言語処理学会第18回年次大会
- Place of Presentation
  広島市立大学
- Related Report
  2011 Research-status Report
[Presentation] 『現代日本語書き言葉均衡コーパス』における複合辞の検出と評価2012
- Author(s)
  鈴木敬文, 阿部佑亮, 宇津呂武仁, 松吉俊, 土屋雅稔
- Organizer
  コーパス日本語学ワークショップ
- Place of Presentation
  国立国語研究所
- Related Report
  2011 Research-status Report
[Presentation] Detection of Precisely Transcribed Parts from Inexact Transcribed Corpus2011
- Author(s)
  Kengo Ohta, Masatoshi Tsuchiya, Seiichi Nakagawa
- Organizer
  Automatic Speech Recognition and Understanding Workshop
- Place of Presentation
  アメリカ・ハワイ
- Related Report
  2011 Research-status Report
[Presentation] Developing Partially-Transcribed Speech Corpus from Edited Transcriptions
- Author(s)
  Kengo Ohta, Masatoshi Tsuchiya, Seiichi Nakagawa
- Organizer
  The 8th International Conference on Language Resources and Evaluation (LREC2012)
- Place of Presentation
  Istanbul, Turkey
- Related Report
  2012 Research-status Report
[Presentation] 川口亮, 土屋雅稔, 中川聖一
- Author(s)
  音声ドキュメント中の人名抽出
- Organizer
  日本音響学会2013年春季研究発表会
- Place of Presentation
  東京工科大学
- Related Report
  2012 Research-status Report

Construction and Summarization of Lecture Contents Using Both Slides and Lecture Speech

Principal Investigator

TSUCHIYA Masatoshi 豊橋技術科学大学, 情報メディア基盤センター, 准教授 (70378256)

¥4,160,000 (Direct Cost: ¥3,200,000、Indirect Cost: ¥960,000)

Report

Research Products

[Journal Article] ポーズを考慮した話し言葉言語モデルの構築2012

Author(s)

Journal Title

NAID

Related Report

[Presentation] プライバシ保護のための音声からの人名除去とその評価2013

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] 整形された書き起こしからの整形・非整形部分の自動検出2012

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] 代表・派生関係を利用した日本語機能表現の解析方式の評価2012

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] 『現代日本語書き言葉均衡コーパス』における複合辞の検出と評価2012

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] Detection of Precisely Transcribed Parts from Inexact Transcribed Corpus2011

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] Developing Partially-Transcribed Speech Corpus from Edited Transcriptions

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] 川口 亮, 土屋 雅稔, 中川 聖一

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] 川口亮, 土屋雅稔, 中川聖一