• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2012 Fiscal Year Final Research Report

A study of voice conversion based on sophisticated control of speaker identity founded on tensor analysis.

Research Project

  • PDF
Project/Area Number 23800015
Research Category

Grant-in-Aid for Research Activity Start-up

Allocation TypeSingle-year Grants
Research Field Perception information processing/Intelligent robotics
Research InstitutionThe University of Tokyo

Principal Investigator

SAITO Daisuke  東京大学, 大学院・情報理工学系研究科, 助教 (40615150)

Project Period (FY) 2011 – 2012
Keywords音声工学 / 音声合成 / 声質変換 / テンソル解析
Research Abstract

In this study、 we have developed voice conversion methods which realize sophisticated and flexible control of speaker identities. These techniques can be applied to welfare services and entertainment software. In this study, we have proposed a method to construct a speaker space using tensor analysis. In this method, various information included in speech utterances are properly decomposed, and these decomposed factors can be utilized for various applications in speech processing. As one of the applications of this method, a style conversion system from speaking style to singing style has been developed.

  • Research Products

    (5 results)

All 2012 2011

All Journal Article (2 results) (of which Peer Reviewed: 2 results) Presentation (3 results)

  • [Journal Article] 空間写像に基づく母音と鼻子音を対象としたジェスチャー-音声変換システム2012

    • Author(s)
      國越晶,喬宇,齋藤大輔,峯松信明,広瀬啓吉
    • Journal Title

      情報処理学会論文誌

      Volume: vol.53 Pages: 2291-2301

    • Peer Reviewed
  • [Journal Article] Statistical voice conversion based on noisy channel model2012

    • Author(s)
      D. Saito, S. Watanabe, A. Nakamura, N.Minematsu
    • Journal Title

      IEEE Transaction on Audio, Speech and Language Processing

      Volume: 20 Pages: 1784-1794

    • Peer Reviewed
  • [Presentation] 声質空間上での変換を用いた歌声らしさの転写2012

    • Author(s)
      齋藤大輔,石原達馬,橘秀幸,亀岡弘和,嵯峨山茂樹
    • Organizer
      日本音響学会秋季研究発表会
    • Place of Presentation
      信州大学,長野
    • Year and Date
      20120919-20120921
  • [Presentation] Effects of speaker adaptive training on tensor-based arbitrary speaker conversion2012

    • Author(s)
      D. Saito, N. Minematsu, K. Hirose
    • Organizer
      Proc. INTERSPEECH
    • Place of Presentation
      Portland, Oregon, USA.
    • Year and Date
      20120909-20120913
  • [Presentation] One-to-many voice conversion based on tensor representation of speaker space2011

    • Author(s)
      D. Saito, K. Yamamoto, N. Minematsu, K.Hirose
    • Organizer
      Proc. INTERSPEECH
    • Place of Presentation
      Florence, Italy.
    • Year and Date
      2011-08-30

URL: 

Published: 2014-08-29  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi