2014 Fiscal Year Final Research Report

Research on advanced robust speech synthesis and its applications to multi-lingual speech communication

Research Project

PDF

Project/Area Number	24300071
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Partial Multi-year Fund
Section	一般
Research Field	Perception information processing/Intelligent robotics
Research Institution	Tokyo Institute of Technology
Principal Investigator	KOBAYASHI Takao 東京工業大学, 総合理工学研究科(研究院), 教授 (70153616)
Co-Investigator(Kenkyū-buntansha)	NOSE Takashi 東北大学, 大学院工学研究科, 講師 (90550591)
Research Collaborator	KORIYAMA Tomoki 東京工業大学, 大学院総合理工学研究科, 助教 (50749124) ARIFIANTO Dhany スラバヤ工科大学, 工学物理学科, 講師
Project Period (FY)	2012-04-01 – 2015-03-31
Keywords	テキスト音声合成 / 統計的パラメトリック音声合成 / HMM音声合成 / 表現豊かな音声合成 / 韻律 / クロスリンガル音声合成 / 音声スタイル制御
Outline of Final Research Achievements	The purpose of the research is to develop advanced techniques that enable us to model acoustic features of prosodic information as well as spectral information with being less dependent on quality and quantity of training speech data for synthesizing natural-sounding and diverse expressive speech. We have proposed several robust techniques such as style control and prosody modeling ones and showed their effectiveness through objective and subjective evaluation tests. We have also applied the proposed techniques to under-resourced languages. Furthermore, we examined a cross-lingual speech synthesis technique for universal speech communication.
Free Research Field	音声情報処理