A Study on Speech Synthesis with Rich Personality Based on Automatic Scoring of Reproduction of Speaker Identity

Research Project

Project/Area Number	24500223
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Multi-year Fund
Section	一般
Research Field	Perception information processing/Intelligent robotics
Research Institution	Ritsumeikan University
Principal Investigator	YAMASHITA Yoichi 立命館大学, 情報理工学部, 教授 (80174689)
Project Period (FY)	2012-04-01 – 2015-03-31
Project Status	Completed (Fiscal Year 2014)
Budget Amount *help	¥4,810,000 (Direct Cost: ¥3,700,000、Indirect Cost: ¥1,110,000) Fiscal Year 2014: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000) Fiscal Year 2013: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000) Fiscal Year 2012: ¥2,470,000 (Direct Cost: ¥1,900,000、Indirect Cost: ¥570,000)
Keywords	声質 / 個人性 / 多様性 / 韻律 / 音声合成 / 音声分析 / スペクトル / パラ言語情報 / 感情 / 話者性 / 重み付きユークリッド距離
Outline of Final Research Achievements	This research addresses measurement of personality and analysis of diversity in speech aiming at realizing speech synthesis with rich personalization. I proposed a new method for measuring the difference of voice quality based on feature parameters of speech. The similarity of voice quality is calculated by weighted Euclidean distance of MFCC parameters which represent spectrum features of speech. I analyzed the relationship between prosodic information and personality perception using synthetic speech in which phonemic information is removed but prosodic information, such as intonation, is preserved. I also analyzed various types of speech which include dialect, character voices in ‘Anime’, announcer voices, emotional voices，and so on.

Report

(4 results)

2014 Annual Research Report Final Research Report ( PDF )
2013 Research-status Report
2012 Research-status Report

Research Products
(11 results)

All 2015 2013 Other

All Journal Article (2 results) (of which Peer Reviewed: 2 results) Presentation (9 results)

[Journal Article] A review of paralinguistic information processing for natural speech communication2013
- Author(s)
  Yoichi Yamashita
- Journal Title
  
  Acoustical Science and Technology
  
  Volume: 34 Issue: 2 Pages: 73-79
- DOI
  10.1250/ast.34.73
- NAID
  130003360938
- ISSN
  0369-4232, 1346-3969, 1347-5177
- Related Report
  2013 Research-status Report 2012 Research-status Report
- Peer Reviewed
[Journal Article] A Generation Error Function Considering Dynamic Properties of Speech Parameters for Minimum Generation Error Training for Hidden Markov Model-based Speech Synthesis2013
- Author(s)
  D.Khanh Ninh, M.Morise and Y.Yamashita
- Journal Title
  
  Acoustical Science and Technology
  
  Volume: 34 Pages: 123-132
- NAID
  130003360934
- Related Report
  2012 Research-status Report
- Peer Reviewed
[Presentation] 音声中の検索語検出における共起情報の検討2015
- Author(s)
  小田原一成, 新妻雅弘, 山下洋一
- Organizer
  日本音響学会2015年春季研究発表会
- Place of Presentation
  中央大学 (東京都・文京区)
- Year and Date
  2015-03-16
- Related Report
  2014 Annual Research Report
[Presentation] 非可聴域の音信号を用いた音信号通信における性能改善の検討2015
- Author(s)
  銭コウ, 森勢将雅, 新妻雅弘, 山下洋一
- Organizer
  日本音響学会2015年春季研究発表会
- Place of Presentation
  中央大学 (東京都・文京区)
- Year and Date
  2015-03-16
- Related Report
  2014 Annual Research Report
[Presentation] 韻律情報の知覚による個人性識別
- Author(s)
  摺木啓一郎, 森勢将雅, 山下洋一
- Organizer
  電子情報通信学会技術研究報告, SP2013-52
- Place of Presentation
  新潟大学 (新潟県)
- Related Report
  2013 Research-status Report
[Presentation] 韻律情報における個人性知覚の分析
- Author(s)
  摺木啓一郎, 森勢将雅, 山下洋一
- Organizer
  日本音響学会2013年秋季研究発表会講演論文集
- Place of Presentation
  豊橋技術科学大学 (愛知県)
- Related Report
  2013 Research-status Report
[Presentation] パラ言語情報認識のための個人性の分析
- Author(s)
  島川智行, 山下洋一
- Organizer
  日本音響学会2013年秋季研究発表会講演論文集
- Place of Presentation
  豊橋技術科学大学 (愛知県)
- Related Report
  2013 Research-status Report
[Presentation] 特定話者に対するパラ言語情報の認識
- Author(s)
  島川智行, 山下洋一
- Organizer
  電子情報通信学会技術研究報告, SP2013-103
- Place of Presentation
  名城大学 (愛知県)
- Related Report
  2013 Research-status Report
[Presentation] An adaptive weighting approach for minimum generation error training considering dynamic features in HMM-based speech synthesis
- Author(s)
  D.Khanh Ninh, M.Morise and Y.Yamashita
- Organizer
  Proc. of 2012 Autumn Meeting of Acoustical Society of Japan
- Place of Presentation
  Shinshu Univ. (Nagano)
- Related Report
  2012 Research-status Report
[Presentation] 孤立発声母音を用いた声質類似度の評価と自動推定
- Author(s)
  辻村祥平, 森勢将雅, 山下洋一
- Organizer
  電子情報通信学会技術研究報告
- Place of Presentation
  同志社大学 (京都府)
- Related Report
  2012 Research-status Report
[Presentation] パラ言語情報処理のための対話音声の収録とラベリング
- Author(s)
  島川智行, 森勢将雅, 山下洋一
- Organizer
  電子情報通信学会技術研究報告
- Place of Presentation
  同志社大学 (京都府)
- Related Report
  2012 Research-status Report

A Study on Speech Synthesis with Rich Personality Based on Automatic Scoring of Reproduction of Speaker Identity

Principal Investigator

YAMASHITA Yoichi 立命館大学, 情報理工学部, 教授 (80174689)

¥4,810,000 (Direct Cost: ¥3,700,000、Indirect Cost: ¥1,110,000)

Report

Research Products

[Journal Article] A review of paralinguistic information processing for natural speech communication2013

Author(s)

Journal Title

DOI

NAID

ISSN

Related Report

[Journal Article] A Generation Error Function Considering Dynamic Properties of Speech Parameters for Minimum Generation Error Training for Hidden Markov Model-based Speech Synthesis2013

Author(s)

Journal Title

NAID

Related Report

[Presentation] 音声中の検索語検出における共起情報の検討2015

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 非可聴域の音信号を用いた音信号通信における性能改善の検討2015

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 韻律情報の知覚による個人性識別

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] 韻律情報における個人性知覚の分析

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] パラ言語情報認識のための個人性の分析

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] 特定話者に対するパラ言語情報の認識

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] An adaptive weighting approach for minimum generation error training considering dynamic features in HMM-based speech synthesis

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] 孤立発声母音を用いた声質類似度の評価と自動推定

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] パラ言語情報処理のための対話音声の収録とラベリング

Author(s)

Organizer

Place of Presentation

Related Report