Construction and Summarization of Lecture Contents Using Both Slides and Lecture Speech
Project/Area Number |
23700115
|
Research Category |
Grant-in-Aid for Young Scientists (B)
|
Allocation Type | Multi-year Fund |
Research Field |
Media informatics/Database
|
Research Institution | Toyohashi University of Technology |
Principal Investigator |
TSUCHIYA Masatoshi 豊橋技術科学大学, 情報メディア基盤センター, 准教授 (70378256)
|
Project Period (FY) |
2011-04-28 – 2015-03-31
|
Project Status |
Completed (Fiscal Year 2014)
|
Budget Amount *help |
¥4,160,000 (Direct Cost: ¥3,200,000、Indirect Cost: ¥960,000)
Fiscal Year 2013: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000)
Fiscal Year 2012: ¥1,300,000 (Direct Cost: ¥1,000,000、Indirect Cost: ¥300,000)
Fiscal Year 2011: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000)
|
Keywords | 自動要約 / 音声認識 / 固有表現 / 機能表現 |
Outline of Final Research Achievements |
Because lecture speech contains spoken phenomena such as filled pauses and silent pauses, a robust automatic speech recoginition method is necessary in order to realize automatic summarization of lecture speech. Our method consists of two steps: 1st step is to predict filler insertion locations and pause insertion locations against loosely transcribed corpora which has no pause information using filler insertion model and pause insertion model learned from precisely transcribed corpora including filler information and pause information, and 2nd step is to construct a language model based on both loosely transcribed corpora and predicted information. And more, a method to detect lecture specific named entities was developed. The human annotation scheme to map lecture slides and lecture speech transcriptions was also established.
|
Report
(5 results)
Research Products
(8 results)