2017 Fiscal Year Final Research Report

A study of automatic English caption generation for Japanese lecture speech

Research Project

PDF

Project/Area Number	25280062
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Partial Multi-year Fund
Section	一般
Research Field	Perceptual information processing
Research Institution	Toyohashi University of Technology
Principal Investigator	nakagawa seiichi 豊橋技術科学大学, リーディング大学院教育推進機構, 特命教授 (20115893)
Co-Investigator(Kenkyū-buntansha)	秋葉友良豊橋技術科学大学, 工学(系)研究科(研究院), 准教授 (00356346) 井佐原均豊橋技術科学大学, 情報メディア基盤センター, 教授 (20358881) 山本一公中部大学, 工学部, 准教授 (40324230) 土屋雅稔豊橋技術科学大学, 情報メディア基盤センター, 准教授 (70378256)
Project Period (FY)	2013-04-01 – 2018-03-31
Keywords	講義の字幕化 / 講義音声 / 日本語講義 / 音声認識 / 日英翻訳 / 英日翻訳 / 音声翻訳
Outline of Final Research Achievements	First of all, we developed Japanese and English lecture speech recognition systems based on DNN. Next, we studied on English to Japanese translation for lecture speech. We transcribed English lectures by using plural English recognition systems and translated to Japanese based on a statistical machine translation system (SMT). Then, we selected the best translation from plural translation candidates by using a re-scoring technique based on language models and improved the translation result. For attacking mis-recognition errors, we simulated speech recognizers and used them as the training data for the SMT. By 1-year extension of the research period, we developed a neural machine translation system (NMT), and combine SMT and NMT. By re-scoring based on back translation for these translation candidates, we improved remarkably the translation performance. Finally, we applied the above techniques to the translation from Japanese lecture speech to English caption.
Free Research Field	音声言語情報処理