2017 Fiscal Year Final Research Report
Research of Human-Kind Dialogue System with Recognition and Synthesis of Various Speech Based on State Estimation
Project/Area Number |
15H02720
|
Research Category |
Grant-in-Aid for Scientific Research (B)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Perceptual information processing
|
Research Institution | Tohoku University |
Principal Investigator |
Nose Takashi 東北大学, 工学研究科, 准教授 (90550591)
|
Co-Investigator(Kenkyū-buntansha) |
伊藤 彰則 東北大学, 工学研究科, 教授 (70232428)
千葉 祐弥 東北大学, 工学研究科, 助教 (30780936)
|
Co-Investigator(Renkei-kenkyūsha) |
MORI Hiroki 宇都宮大学, 大学院工学研究科, 准教授 (10302184)
|
Project Period (FY) |
2015-04-01 – 2018-03-31
|
Keywords | 音声対話 / 感情音声合成 / 感情認識 / 音声認識 / 感情音声コーパス |
Outline of Final Research Achievements |
In this research project, we improved and advanced techniques of recognition and synthesis of various speech, and studied a state estimation technique of system users and its applications to realize a dialogue system kind to users. Specifically, (1) We studied the validity of using emotions and a technique for emotion estimation. (2) We proposed and evaluated a sentence selection technique based on extended entropy where phonetic and prosodic contexts are taken into account. (3) We recorded and analyzed dialogue data for willingness estimation. (4) We constructed a large-scale emotional speech corpus that can be used for emotional speech synthesis/recognition and emotion estimation. (5) We proposed and evaluated variance compensation and taylor-made speech synthesis as a technique of synthesizing various and high-quality speech synthesis.
|
Free Research Field |
音声情報処理
|