2014 Fiscal Year Final Research Report
Research on advanced robust speech synthesis and its applications to multi-lingual speech communication
Project/Area Number |
24300071
|
Research Category |
Grant-in-Aid for Scientific Research (B)
|
Allocation Type | Partial Multi-year Fund |
Section | 一般 |
Research Field |
Perception information processing/Intelligent robotics
|
Research Institution | Tokyo Institute of Technology |
Principal Investigator |
KOBAYASHI Takao 東京工業大学, 総合理工学研究科(研究院), 教授 (70153616)
|
Co-Investigator(Kenkyū-buntansha) |
NOSE Takashi 東北大学, 大学院工学研究科, 講師 (90550591)
|
Research Collaborator |
KORIYAMA Tomoki 東京工業大学, 大学院総合理工学研究科, 助教 (50749124)
ARIFIANTO Dhany スラバヤ工科大学, 工学物理学科, 講師
|
Project Period (FY) |
2012-04-01 – 2015-03-31
|
Keywords | テキスト音声合成 / 統計的パラメトリック音声合成 / HMM音声合成 / 表現豊かな音声合成 / 韻律 / クロスリンガル音声合成 / 音声スタイル制御 |
Outline of Final Research Achievements |
The purpose of the research is to develop advanced techniques that enable us to model acoustic features of prosodic information as well as spectral information with being less dependent on quality and quantity of training speech data for synthesizing natural-sounding and diverse expressive speech. We have proposed several robust techniques such as style control and prosody modeling ones and showed their effectiveness through objective and subjective evaluation tests. We have also applied the proposed techniques to under-resourced languages. Furthermore, we examined a cross-lingual speech synthesis technique for universal speech communication.
|
Free Research Field |
音声情報処理
|