• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

A study of voice conversion based on sophisticated control of speaker identity founded on tensor analysis.

Research Project

Project/Area Number 23800015
Research Category

Grant-in-Aid for Research Activity Start-up

Allocation TypeSingle-year Grants
Research Field Perception information processing/Intelligent robotics
Research InstitutionThe University of Tokyo

Principal Investigator

SAITO Daisuke  東京大学, 大学院・情報理工学系研究科, 助教 (40615150)

Project Period (FY) 2011 – 2012
Project Status Completed (Fiscal Year 2012)
Budget Amount *help
¥3,250,000 (Direct Cost: ¥2,500,000、Indirect Cost: ¥750,000)
Fiscal Year 2012: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000)
Fiscal Year 2011: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000)
Keywords音声工学 / 音声合成 / 声質変換 / テンソル解析
Research Abstract

In this study、 we have developed voice conversion methods which realize sophisticated and flexible control of speaker identities. These techniques can be applied to welfare services and entertainment software. In this study, we have proposed a method to construct a speaker space using tensor analysis. In this method, various information included in speech utterances are properly decomposed, and these decomposed factors can be utilized for various applications in speech processing. As one of the applications of this method, a style conversion system from speaking style to singing style has been developed.

Report

(3 results)
  • 2012 Annual Research Report   Final Research Report ( PDF )
  • 2011 Annual Research Report
  • Research Products

    (18 results)

All 2013 2012 2011

All Journal Article (3 results) (of which Peer Reviewed: 3 results) Presentation (15 results)

  • [Journal Article] 空間写像に基づく母音と鼻子音を対象としたジェスチャー-音声変換システム2012

    • Author(s)
      國越晶,喬宇,齋藤大輔,峯松信明,広瀬啓吉
    • Journal Title

      情報処理学会論文誌

      Volume: vol.53 Pages: 2291-2301

    • NAID

      110009464377

    • Related Report
      2012 Final Research Report
    • Peer Reviewed
  • [Journal Article] Statistical voice conversion based on noisy channel model2012

    • Author(s)
      D. Saito, S. Watanabe, A. Nakamura, N.Minematsu
    • Journal Title

      IEEE Transaction on Audio, Speech and Language Processing

      Volume: 20 Pages: 1784-1794

    • Related Report
      2012 Annual Research Report 2012 Final Research Report
    • Peer Reviewed
  • [Journal Article] 空間写像に基づく母音と鼻子音を対象としたジェスチャー-音声変換システム2012

    • Author(s)
      國越晶, 喬宇, 齋藤大輔, 峯松信明, 広瀬啓吉
    • Journal Title

      情報処理学会論文誌

      Volume: 53 Pages: 2291-2301

    • NAID

      110009464377

    • Related Report
      2012 Annual Research Report
    • Peer Reviewed
  • [Presentation] Eigenvoice-based character conversion for arbitrary speakers using various character voices of a skilled voice actor2013

    • Author(s)
      T. Pongkittiphan, D. Saito, N. Minematsu, K. Hirose
    • Organizer
      RISP International Workshop on Nonlinear Circuits, Communication and Signal Processing
    • Place of Presentation
      Hawaii, USA
    • Related Report
      2012 Annual Research Report
  • [Presentation] Character conversion based on eigenvoice technique2012

    • Author(s)
      T.Pongkittiphan, N.Minematsu, D.Saito, K.Hirose
    • Organizer
      日本音響学会春季研究発表会
    • Place of Presentation
      神奈川大学,横浜
    • Year and Date
      2012-03-13
    • Related Report
      2011 Annual Research Report
  • [Presentation] 声質空間上での変換を用いた歌声らしさの転写2012

    • Author(s)
      齋藤大輔,石原達馬,橘秀幸,亀岡弘和,嵯峨山茂樹
    • Organizer
      日本音響学会秋季研究発表会
    • Place of Presentation
      信州大学,長野
    • Related Report
      2012 Final Research Report
  • [Presentation] Effects of speaker adaptive training on tensor-based arbitrary speaker conversion2012

    • Author(s)
      D. Saito, N. Minematsu, K. Hirose
    • Organizer
      Proc. INTERSPEECH
    • Place of Presentation
      Portland, Oregon, USA.
    • Related Report
      2012 Final Research Report
  • [Presentation] Effects of speaker adaptive training on tensor-based arbitrary speaker conversion2012

    • Author(s)
      D. Saito, N. Minematsu, K. Hirose
    • Organizer
      Annual Conference of the International Speech Communication Association (INTERSPEECH)
    • Place of Presentation
      Portland, Oregon, USA
    • Related Report
      2012 Annual Research Report
  • [Presentation] Tensor-based speaker space construction for arbitrary speaker conversion2012

    • Author(s)
      D. Saito, N. Minematsu, K. Hirose
    • Organizer
      International Conference on Signal Processing
    • Place of Presentation
      Beijing, China
    • Related Report
      2012 Annual Research Report
  • [Presentation] Eignvoice-based character conversion and its evaluation2012

    • Author(s)
      T. Pongkittiphan, D. Saito, N. Minematsu, K. Hirose
    • Organizer
      電子情報通信学会音声研究会
    • Place of Presentation
      NTT厚木研究開発センター, 神奈川
    • Related Report
      2012 Annual Research Report
  • [Presentation] 声質空間上での変換に基づく歌声らしさの転写に関する検討2012

    • Author(s)
      齋藤大輔, 石原達馬, 橘秀幸, 亀岡弘和, 嵯峨山茂樹
    • Organizer
      情報処理学会音楽情報科学研究会
    • Place of Presentation
      近江町交流プラザ, 石川
    • Related Report
      2012 Annual Research Report
  • [Presentation] テンソル表現に基づく任意話者声質変換に対する話者正規化学習の効果2012

    • Author(s)
      齋藤大輔, 峯松信明, 広瀬啓吉
    • Organizer
      電子情報通信学会音声研究会
    • Place of Presentation
      東北工業大学, 仙台
    • Related Report
      2012 Annual Research Report
  • [Presentation] テンソル表現に基づく任意話者声質変換における話者正規化学習の検討2012

    • Author(s)
      齋藤大輔, 峯松信明, 広瀬啓吉
    • Organizer
      日本音響学会秋季研究発表会
    • Place of Presentation
      信州大学, 長野
    • Related Report
      2012 Annual Research Report
  • [Presentation] 声質空間上での変換を用いた歌声らしさの転写2012

    • Author(s)
      齋藤大輔, 石原達馬, 橘秀幸, 亀岡弘和, 嵯峨山茂樹
    • Organizer
      日本音響学会秋季研究発表会
    • Place of Presentation
      信州大学, 長野
    • Related Report
      2012 Annual Research Report
  • [Presentation] 話者空間のテンソル表現に基づく任意話者声質変換2011

    • Author(s)
      齋藤大輔, 山本敬介, 峯松信明, 広瀬啓吉
    • Organizer
      電子情報通信学会音声研究会
    • Place of Presentation
      九州大学,福岡
    • Year and Date
      2011-11-28
    • Related Report
      2011 Annual Research Report
  • [Presentation] 話者空間のテンソル表現を用いた-対多声質変換2011

    • Author(s)
      齋藤大輔, 山本敬介, 峯松信明, 広瀬啓吉
    • Organizer
      日本音響学会秋季研究発表会
    • Place of Presentation
      島根大学,松江
    • Year and Date
      2011-09-14
    • Related Report
      2011 Annual Research Report
  • [Presentation] One-to-many voice conversion based on tensor representation of speaker space2011

    • Author(s)
      D. Saito, K. Yamamoto, N. Minematsu, K.Hirose
    • Organizer
      Proc. INTERSPEECH
    • Place of Presentation
      Florence, Italy.
    • Year and Date
      2011-08-30
    • Related Report
      2012 Final Research Report
  • [Presentation] One-to-many voice conversion based on tensor representation of speaker space2011

    • Author(s)
      D.Saito, K.Yamamoto, N.Minematsu, K.Hirose
    • Organizer
      INTERSPEECH
    • Place of Presentation
      Florence, Italy
    • Year and Date
      2011-08-30
    • Related Report
      2011 Annual Research Report

URL: 

Published: 2011-09-05   Modified: 2019-07-29  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi