2009 Fiscal Year Final Research Report
Speech synthesis with communicative prosody driven by the impressions of output lexicons
Project/Area Number |
18300063
|
Research Category |
Grant-in-Aid for Scientific Research (B)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Perception information processing/Intelligent robotics
|
Research Institution | Waseda University |
Principal Investigator |
SAGISAKA Yoshinori Waseda University, 理工学術院, 教授 (70339737)
|
Co-Investigator(Renkei-kenkyūsha) |
KOBAYASHI Tetsunori 早稲田大学, 理工学術院, 教授 (30162001)
NONDA Masaaki 早稲田大学, スポーツ科学学術院, 教授 (90367095)
|
Project Period (FY) |
2006 – 2009
|
Keywords | 音声情報処理 |
Research Abstract |
A scheme for communicative prosody generation was proposed to synthesize speech needed for conversational purposes. Using the correlation between communicative prosody and impression attributes of lexicons constituting output, the proposed scheme enables prosody control for conversational speech output. Perceptual experiments showed the superiority of the speech synthesized with the proposed communicative prosody to the conventional one with reading style prosody. Further application to Chinese and English speech synthesis and the reverse technology of impression extraction from speech clarified the usefulness of the proposed approach.
|
Research Products
(18 results)