2014 Fiscal Year Final Research Report

A Study on Speech Synthesis with Rich Personality Based on Automatic Scoring of Reproduction of Speaker Identity

Research Project

Project/Area Number	24500223
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Multi-year Fund
Section	一般
Research Field	Perception information processing/Intelligent robotics
Research Institution	Ritsumeikan University
Principal Investigator	YAMASHITA Yoichi 立命館大学, 情報理工学部, 教授 (80174689)
Project Period (FY)	2012-04-01 – 2015-03-31
Keywords	声質 / 個人性 / 多様性 / 韻律 / 音声合成 / 音声分析
Outline of Final Research Achievements	This research addresses measurement of personality and analysis of diversity in speech aiming at realizing speech synthesis with rich personalization. I proposed a new method for measuring the difference of voice quality based on feature parameters of speech. The similarity of voice quality is calculated by weighted Euclidean distance of MFCC parameters which represent spectrum features of speech. I analyzed the relationship between prosodic information and personality perception using synthetic speech in which phonemic information is removed but prosodic information, such as intonation, is preserved. I also analyzed various types of speech which include dialect, character voices in ‘Anime’, announcer voices, emotional voices，and so on.
Free Research Field	音声情報処理