2012 Fiscal Year Final Research Report

A study of voice conversion based on sophisticated control of speaker identity founded on tensor analysis.

Research Project

PDF

Project/Area Number	23800015
Research Category	Grant-in-Aid for Research Activity Start-up
Allocation Type	Single-year Grants
Research Field	Perception information processing/Intelligent robotics
Research Institution	The University of Tokyo
Principal Investigator	SAITO Daisuke 東京大学, 大学院・情報理工学系研究科, 助教 (40615150)
Project Period (FY)	2011 – 2012
Keywords	音声工学 / 音声合成 / 声質変換 / テンソル解析
Research Abstract	In this study、 we have developed voice conversion methods which realize sophisticated and flexible control of speaker identities. These techniques can be applied to welfare services and entertainment software. In this study, we have proposed a method to construct a speaker space using tensor analysis. In this method, various information included in speech utterances are properly decomposed, and these decomposed factors can be utilized for various applications in speech processing. As one of the applications of this method, a style conversion system from speaking style to singing style has been developed.

Research Products
(5 results)

All Journal Article (2 results) (of which Peer Reviewed: 2 results) Presentation (3 results)

[Journal Article] 空間写像に基づく母音と鼻子音を対象としたジェスチャー-音声変換システム2012
- Author(s)
  國越晶,喬宇,齋藤大輔,峯松信明,広瀬啓吉
- Journal Title
  
  情報処理学会論文誌
  
  Volume: vol.53 Pages: 2291-2301
- Peer Reviewed
[Journal Article] Statistical voice conversion based on noisy channel model2012
- Author(s)
  D. Saito, S. Watanabe, A. Nakamura, N.Minematsu
- Journal Title
  
  IEEE Transaction on Audio, Speech and Language Processing
  
  Volume: 20 Pages: 1784-1794
- Peer Reviewed
[Presentation] 声質空間上での変換を用いた歌声らしさの転写2012
- Author(s)
  齋藤大輔,石原達馬,橘秀幸,亀岡弘和,嵯峨山茂樹
- Organizer
  日本音響学会秋季研究発表会
- Place of Presentation
  信州大学,長野
- Year and Date
  20120919-20120921
[Presentation] Effects of speaker adaptive training on tensor-based arbitrary speaker conversion2012
- Author(s)
  D. Saito, N. Minematsu, K. Hirose
- Organizer
  Proc. INTERSPEECH
- Place of Presentation
  Portland, Oregon, USA.
- Year and Date
  20120909-20120913
[Presentation] One-to-many voice conversion based on tensor representation of speaker space2011
- Author(s)
  D. Saito, K. Yamamoto, N. Minematsu, K.Hirose
- Organizer
  Proc. INTERSPEECH
- Place of Presentation
  Florence, Italy.
- Year and Date
  2011-08-30