2016 Fiscal Year Final Research Report

Simultaneous speech translation methods for news and lectures in foreign languages

Research Project

PDF

Project/Area Number	24240032
Research Category	Grant-in-Aid for Scientific Research (A)
Allocation Type	Single-year Grants
Section	一般
Research Field	Perception information processing/Intelligent robotics
Research Institution	Nara Institute of Science and Technology
Principal Investigator	Nakamura Satoshi 奈良先端科学技術大学院大学, 情報科学研究科, 教授 (30263429)
Co-Investigator(Kenkyū-buntansha)	松本裕治奈良先端科学技術大学院大学, 情報科学研究科, 教授 (10211575) 戸田智基奈良先端科学技術大学院大学, 情報科学研究科, 准教授 (90403328) サクリアニサクティ奈良先端科学技術大学院大学, 情報科学研究科, 助教 (00395005) Neubig Graham 奈良先端科学技術大学院大学, 情報科学研究科, 助教 (70633428) Duh Kevin 奈良先端科学技術大学院大学, 情報科学研究科, 助教 (80637322) 小町守奈良先端科学技術大学院大学, 情報科学研究科, 助教 (60581329)
Project Period (FY)	2012-05-31 – 2017-03-31
Keywords	音声情報処理 / 音声翻訳 / 音声認識 / 機械翻訳
Outline of Final Research Achievements	In this project new simultaneous speech-to-speech translation algorithms are proposed. First algorithm has a mechanism to decide to output or hold the phrases to the machine translation module until the current time based on the right probability in the phrase-based statistical machine translation. Second algorithm is able to segment the input phrase sequence based on greedy search according to POS bigram information. Third algorithm predicts next phrase or local parse tree element based on SVM with the incremental bottom-up parser. Here, the algorithm decides to output or hold the phrases again. The experiments showed that the proposed algorithms successfully realized the simultaneous speech translation. Furthermore neural machine translation algorithms with attention mechanisms are investigated. The 80 hours of J-E interpretation data, 50 hours of JP lecture transcription data, and 22 hours of J-E translation data are collected to be used for simultaneous speech translation research.
Free Research Field	知能コミュニケーション