2011 Fiscal Year Final Research Report
Research on robust spoken language interfaces for diverse voice variability and expressivity
Project/Area Number |
21300063
|
Research Category |
Grant-in-Aid for Scientific Research (B)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Perception information processing/Intelligent robotics
|
Research Institution | Tokyo Institute of Technology |
Principal Investigator |
KOBAYASHI Takao 東京工業大学, 大学院・総合理工学研究科, 教授 (70153616)
|
Co-Investigator(Renkei-kenkyūsha) |
NAGAHASHI Hiroshi 東京工業大学, 像情報工学研究所, 教授 (20143084)
|
Research Collaborator |
NOSE Takashi 東京工業大学, 大学院・総合理工学研究科, 助教 (90550591)
|
Project Period (FY) |
2009 – 2011
|
Keywords | 音声情報処理 |
Research Abstract |
The purpose of the research is to develop techniques that make the human-computer interaction using speech input/output more robust for variations of users' emotional states, speaking styles, preferences, and expressivity. We have proposed techniques using a quantized fundamental frequency prosodic context for robust speech synthesis and an extended context set for spontaneous conversational speech synthesis. We have also proposed techniques for robust speech recognition including extraction of paralinguistic information and rapid model adaptation.
|