A study of voice conversion based on sophisticated control of speaker identity founded on tensor analysis.

Research Project

Project/Area Number	23800015
Research Category	Grant-in-Aid for Research Activity Start-up
Allocation Type	Single-year Grants
Research Field	Perception information processing/Intelligent robotics
Research Institution	The University of Tokyo
Principal Investigator	SAITO Daisuke 東京大学, 大学院・情報理工学系研究科, 助教 (40615150)
Project Period (FY)	2011 – 2012
Project Status	Completed (Fiscal Year 2012)
Budget Amount *help	¥3,250,000 (Direct Cost: ¥2,500,000、Indirect Cost: ¥750,000) Fiscal Year 2012: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000) Fiscal Year 2011: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000)
Keywords	音声工学 / 音声合成 / 声質変換 / テンソル解析
Research Abstract	In this study、 we have developed voice conversion methods which realize sophisticated and flexible control of speaker identities. These techniques can be applied to welfare services and entertainment software. In this study, we have proposed a method to construct a speaker space using tensor analysis. In this method, various information included in speech utterances are properly decomposed, and these decomposed factors can be utilized for various applications in speech processing. As one of the applications of this method, a style conversion system from speaking style to singing style has been developed.

Report

(3 results)

2012 Annual Research Report Final Research Report ( PDF )
2011 Annual Research Report

Research Products
(18 results)

All 2013 2012 2011

All Journal Article (3 results) (of which Peer Reviewed: 3 results) Presentation (15 results)

[Journal Article] 空間写像に基づく母音と鼻子音を対象としたジェスチャー-音声変換システム2012
- Author(s)
  國越晶,喬宇,齋藤大輔,峯松信明,広瀬啓吉
- Journal Title
  
  情報処理学会論文誌
  
  Volume: vol.53 Pages: 2291-2301
- NAID
  110009464377
- Related Report
  2012 Final Research Report
- Peer Reviewed
[Journal Article] Statistical voice conversion based on noisy channel model2012
- Author(s)
  D. Saito, S. Watanabe, A. Nakamura, N.Minematsu
- Journal Title
  
  IEEE Transaction on Audio, Speech and Language Processing
  
  Volume: 20 Pages: 1784-1794
- Related Report
  2012 Annual Research Report 2012 Final Research Report
- Peer Reviewed
[Journal Article] 空間写像に基づく母音と鼻子音を対象としたジェスチャー－音声変換システム2012
- Author(s)
  國越晶, 喬宇, 齋藤大輔, 峯松信明, 広瀬啓吉
- Journal Title
  
  情報処理学会論文誌
  
  Volume: 53 Pages: 2291-2301
- NAID
  110009464377
- Related Report
  2012 Annual Research Report
- Peer Reviewed
[Presentation] Eigenvoice-based character conversion for arbitrary speakers using various character voices of a skilled voice actor2013
- Author(s)
  T. Pongkittiphan, D. Saito, N. Minematsu, K. Hirose
- Organizer
  RISP International Workshop on Nonlinear Circuits, Communication and Signal Processing
- Place of Presentation
  Hawaii, USA
- Related Report
  2012 Annual Research Report
[Presentation] Character conversion based on eigenvoice technique2012
- Author(s)
  T.Pongkittiphan, N.Minematsu, D.Saito, K.Hirose
- Organizer
  日本音響学会春季研究発表会
- Place of Presentation
  神奈川大学,横浜
- Year and Date
  2012-03-13
- Related Report
  2011 Annual Research Report
[Presentation] 声質空間上での変換を用いた歌声らしさの転写2012
- Author(s)
  齋藤大輔,石原達馬,橘秀幸,亀岡弘和,嵯峨山茂樹
- Organizer
  日本音響学会秋季研究発表会
- Place of Presentation
  信州大学,長野
- Related Report
  2012 Final Research Report
[Presentation] Effects of speaker adaptive training on tensor-based arbitrary speaker conversion2012
- Author(s)
  D. Saito, N. Minematsu, K. Hirose
- Organizer
  Proc. INTERSPEECH
- Place of Presentation
  Portland, Oregon, USA.
- Related Report
  2012 Final Research Report
[Presentation] Effects of speaker adaptive training on tensor-based arbitrary speaker conversion2012
- Author(s)
  D. Saito, N. Minematsu, K. Hirose
- Organizer
  Annual Conference of the International Speech Communication Association (INTERSPEECH)
- Place of Presentation
  Portland, Oregon, USA
- Related Report
  2012 Annual Research Report
[Presentation] Tensor-based speaker space construction for arbitrary speaker conversion2012
- Author(s)
  D. Saito, N. Minematsu, K. Hirose
- Organizer
  International Conference on Signal Processing
- Place of Presentation
  Beijing, China
- Related Report
  2012 Annual Research Report
[Presentation] Eignvoice-based character conversion and its evaluation2012
- Author(s)
  T. Pongkittiphan, D. Saito, N. Minematsu, K. Hirose
- Organizer
  電子情報通信学会音声研究会
- Place of Presentation
  NTT厚木研究開発センター, 神奈川
- Related Report
  2012 Annual Research Report
[Presentation] 声質空間上での変換に基づく歌声らしさの転写に関する検討2012
- Author(s)
  齋藤大輔, 石原達馬, 橘秀幸, 亀岡弘和, 嵯峨山茂樹
- Organizer
  情報処理学会音楽情報科学研究会
- Place of Presentation
  近江町交流プラザ, 石川
- Related Report
  2012 Annual Research Report
[Presentation] テンソル表現に基づく任意話者声質変換に対する話者正規化学習の効果2012
- Author(s)
  齋藤大輔, 峯松信明, 広瀬啓吉
- Organizer
  電子情報通信学会音声研究会
- Place of Presentation
  東北工業大学, 仙台
- Related Report
  2012 Annual Research Report
[Presentation] テンソル表現に基づく任意話者声質変換における話者正規化学習の検討2012
- Author(s)
  齋藤大輔, 峯松信明, 広瀬啓吉
- Organizer
  日本音響学会秋季研究発表会
- Place of Presentation
  信州大学, 長野
- Related Report
  2012 Annual Research Report
[Presentation] 声質空間上での変換を用いた歌声らしさの転写2012
- Author(s)
  齋藤大輔, 石原達馬, 橘秀幸, 亀岡弘和, 嵯峨山茂樹
- Organizer
  日本音響学会秋季研究発表会
- Place of Presentation
  信州大学, 長野
- Related Report
  2012 Annual Research Report
[Presentation] 話者空間のテンソル表現に基づく任意話者声質変換2011
- Author(s)
  齋藤大輔, 山本敬介, 峯松信明, 広瀬啓吉
- Organizer
  電子情報通信学会音声研究会
- Place of Presentation
  九州大学,福岡
- Year and Date
  2011-11-28
- Related Report
  2011 Annual Research Report
[Presentation] 話者空間のテンソル表現を用いた-対多声質変換2011
- Author(s)
  齋藤大輔, 山本敬介, 峯松信明, 広瀬啓吉
- Organizer
  日本音響学会秋季研究発表会
- Place of Presentation
  島根大学,松江
- Year and Date
  2011-09-14
- Related Report
  2011 Annual Research Report
[Presentation] One-to-many voice conversion based on tensor representation of speaker space2011
- Author(s)
  D. Saito, K. Yamamoto, N. Minematsu, K.Hirose
- Organizer
  Proc. INTERSPEECH
- Place of Presentation
  Florence, Italy.
- Year and Date
  2011-08-30
- Related Report
  2012 Final Research Report
[Presentation] One-to-many voice conversion based on tensor representation of speaker space2011
- Author(s)
  D.Saito, K.Yamamoto, N.Minematsu, K.Hirose
- Organizer
  INTERSPEECH
- Place of Presentation
  Florence, Italy
- Year and Date
  2011-08-30
- Related Report
  2011 Annual Research Report

A study of voice conversion based on sophisticated control of speaker identity founded on tensor analysis.

Principal Investigator

SAITO Daisuke 東京大学, 大学院・情報理工学系研究科, 助教 (40615150)

¥3,250,000 (Direct Cost: ¥2,500,000、Indirect Cost: ¥750,000)

Report

Research Products

[Journal Article] 空間写像に基づく母音と鼻子音を対象としたジェスチャー-音声変換システム2012

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Statistical voice conversion based on noisy channel model2012

Author(s)

Journal Title

Related Report

[Journal Article] 空間写像に基づく母音と鼻子音を対象としたジェスチャー－音声変換システム2012

Author(s)

Journal Title

NAID

Related Report

[Presentation] Eigenvoice-based character conversion for arbitrary speakers using various character voices of a skilled voice actor2013

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] Character conversion based on eigenvoice technique2012

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 声質空間上での変換を用いた歌声らしさの転写2012

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] Effects of speaker adaptive training on tensor-based arbitrary speaker conversion2012

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] Effects of speaker adaptive training on tensor-based arbitrary speaker conversion2012

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] Tensor-based speaker space construction for arbitrary speaker conversion2012

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] Eignvoice-based character conversion and its evaluation2012

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] 声質空間上での変換に基づく歌声らしさの転写に関する検討2012

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] テンソル表現に基づく任意話者声質変換に対する話者正規化学習の効果2012

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] テンソル表現に基づく任意話者声質変換における話者正規化学習の検討2012

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] 声質空間上での変換を用いた歌声らしさの転写2012

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] 話者空間のテンソル表現に基づく任意話者声質変換2011

Author(s)

Organizer

Place of Presentation