2014 Fiscal Year Final Research Report
A Study on Speech Synthesis with Rich Personality Based on Automatic Scoring of Reproduction of Speaker Identity
Project/Area Number |
24500223
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Multi-year Fund |
Section | 一般 |
Research Field |
Perception information processing/Intelligent robotics
|
Research Institution | Ritsumeikan University |
Principal Investigator |
|
Project Period (FY) |
2012-04-01 – 2015-03-31
|
Keywords | 声質 / 個人性 / 多様性 / 韻律 / 音声合成 / 音声分析 |
Outline of Final Research Achievements |
This research addresses measurement of personality and analysis of diversity in speech aiming at realizing speech synthesis with rich personalization. I proposed a new method for measuring the difference of voice quality based on feature parameters of speech. The similarity of voice quality is calculated by weighted Euclidean distance of MFCC parameters which represent spectrum features of speech. I analyzed the relationship between prosodic information and personality perception using synthetic speech in which phonemic information is removed but prosodic information, such as intonation, is preserved. I also analyzed various types of speech which include dialect, character voices in ‘Anime’, announcer voices, emotional voices,and so on.
|
Free Research Field |
音声情報処理
|