• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Development of Continuous Voice Morphing Using Separated Vocal TractArea Functions, Glottal Source Waves, and Prosodic Features

Research Project

Project/Area Number 22500145
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Perception information processing/Intelligent robotics
Research InstitutionUniversity of Tsukuba

Principal Investigator

TANAKA Kazuyo  筑波大学, 図書館情報メディア系, 教授 (70344207)

Co-Investigator(Kenkyū-buntansha) MIKAWA Masahiko  筑波大学, 図書館情報メディア系, 准教授 (40361357)
ITOH Yoshiaki  岩手県立大学, ソフトウエア情報学部, 准教授 (90325928)
Project Period (FY) 2010 – 2012
Project Status Completed (Fiscal Year 2012)
Budget Amount *help
¥4,160,000 (Direct Cost: ¥3,200,000、Indirect Cost: ¥960,000)
Fiscal Year 2012: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000)
Fiscal Year 2011: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
Fiscal Year 2010: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000)
Keywords音声モーフィング / 声質変換合成 / 音声合成 / 話者モーフィング / 音声加工 / 声道断面積関数 / 韻律変換 / 音声分析 / 離散コサイン変換 / 韻律制御
Research Abstract

In this project, we have developed a flexible voice morphing method, which is based on a conversion using a linear combination of the vocal tract area functions estimated from speech signals and targeted on realization of the continuity of the phonological identity of the overall interpolated area. The main features of the method are,1) to separate characteristics of the vocal tract resonances from those of glottal source waves,2) independent morphing of the vocal tract resonances and glottal source wave characteristics, and 3) conversion method of prosodic features based on DCT(digital cosinetransform) domain. We have established that a morphing system constructed from the proposed method improves the continuity of the phonological identity and the speech quality in the intermediate morphing rate.

Report

(4 results)
  • 2012 Annual Research Report   Final Research Report ( PDF )
  • 2011 Annual Research Report
  • 2010 Annual Research Report
  • Research Products

    (32 results)

All 2013 2012 2011 2010 Other

All Journal Article (11 results) (of which Peer Reviewed: 11 results) Presentation (19 results) Remarks (2 results)

  • [Journal Article] Continuous Voice Morphing Using Separated Vocal Tract Area Functions and Glottal Source Waves2013

    • Author(s)
      Kazuyo Tanaka, Yoshiki Nambu
    • Journal Title

      International Journal of Multimedia Technology

      Pages: 7-7

    • Related Report
      2012 Final Research Report
    • Peer Reviewed
  • [Journal Article] Continuous Voice Morphing Using Separated Vocal Tract Area Functions and Glottal Source Waves2013

    • Author(s)
      Kazuyo Tanaka, Yoshiki Nambu
    • Journal Title

      International Journal of Multimedia Technology, ISSN:2226-7875(online)

      Volume: Vol. 4

    • Related Report
      2012 Annual Research Report
    • Peer Reviewed
  • [Journal Article] A Study on Pitch Patterns of Japanese Speakers of English in Comparison with Native Speakers of English2012

    • Author(s)
      Tomoko Nariai, Kazuyo Tanaka
    • Journal Title

      Acoustical Science and Technology

      Volume: Vol. 33, No. 4 Pages: 247-254

    • NAID

      130001853344

    • Related Report
      2012 Final Research Report
    • Peer Reviewed
  • [Journal Article] Tomoko Nariai, Kazuyo Tanaka, "A Study on Pitch Patterns of Japanese Speakers of English in Comparison with Native Speakers of English2012

    • Author(s)
      Tomoko Nariai, Kazuyo Tanaka
    • Journal Title

      Acoustical Science and Technology

      Volume: Vol. 33, No. 4 Pages: 247-254

    • Related Report
      2012 Annual Research Report
    • Peer Reviewed
  • [Journal Article] A Study on Pitch Patterns in Japanese Speakers of English with Verification by Speech Re-synthesis2011

    • Author(s)
      Tomoko Nariai, Kazuyo Tanaka
    • Journal Title

      IEICE Transactions on Information and Systems

      Volume: Vol.E94-D, No.12 Pages: 2495-2502

    • NAID

      10030538282

    • Related Report
      2012 Final Research Report 2011 Annual Research Report
    • Peer Reviewed
  • [Journal Article] A Comparative Study of Focal Lengthening in the Speech of Native Speakers and Japanese Speakers of English2011

    • Author(s)
      Tomoko Nariai, Kazuyo Tanaka, Yoshiaki Itoh
    • Journal Title

      Acoustical Science and Technology, edited by Acoustical Society of Japan

      Volume: Vol.32, No.2 Pages: 54-61

    • NAID

      130000727451

    • Related Report
      2012 Final Research Report
    • Peer Reviewed
  • [Journal Article] An Analysis of Word Duration in Native Speakers and Japanese Speakers of English2011

    • Author(s)
      Tomoko Nariai, Kazuyo Tanaka, Yoshiaki Itoh
    • Journal Title

      Proceedings of Interspeech 2011

      Pages: 1173-1176

    • Related Report
      2011 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Development of Prototype sound Direction Control System Using a Two-Dimensional Loudspeaker Array2011

    • Author(s)
      Yasuharu Hashimoto, Masahiko Mikawa, Kazuyo Tanaka
    • Journal Title

      Proceedings of EUSIPCO 2011

      Pages: 264-268

    • Related Report
      2011 Annual Research Report
    • Peer Reviewed
  • [Journal Article] A Comparative Study of Focal Lengthening in the Speech of Native Speakers and Japanese Speakers of English2011

    • Author(s)
      Tomoka Nariai, Kazuyo Tanaka,Yoshiaki Itoh
    • Journal Title

      Acoustical Science and Technology

      Volume: 32 Pages: 54-61

    • NAID

      130000727451

    • Related Report
      2010 Annual Research Report
    • Peer Reviewed
  • [Journal Article] A study of pitch patterns of sentence utterances by Japane sespeakers of English in comparison with native speakers of English2010

    • Author(s)
      Tomoko Nariai, Kazuyo Tanaka
    • Journal Title

      Proceedings of Interspeech Satellite Workshop : Second Language Studies

      Volume: 1 Pages: 4-4

    • Related Report
      2010 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Flexible Voice Morphing based on Linear Combination of Multi-speakers' Vocal Tract Area Functions2010

    • Author(s)
      Yosiki Nambu, Masahiko Mikawa, Kazuyo Tanaka
    • Journal Title

      Proceedings of 18th European Signal Processing Conference (EUSIPCO)

      Volume: 1 Pages: 790-794

    • Related Report
      2010 Annual Research Report
    • Peer Reviewed
  • [Presentation] 韻律特徴パターンの DCT 次元圧縮による韻律の異なりを考慮した声質変換手法の検討2013

    • Author(s)
      石 睿,田中 和世,三河 正彦,羅 志偉
    • Organizer
      日本音響学会 2013 年春季研究発表会論文集
    • Place of Presentation
      東京工科大
    • Year and Date
      2013-03-15
    • Related Report
      2012 Final Research Report
  • [Presentation] Experimental Evaluation of Probabilistic Similarity for Spoken Term Detection2013

    • Author(s)
      Shi-wook Lee, Hiroaki Kojima, Kazuyo Tanaka, Yoshiaki Itoh
    • Organizer
      Proc. of International Conference on Pattern Recognition Applications and Methods (ICPRAM) 2013
    • Place of Presentation
      Sants Hotel, Barcelona,(Spain)
    • Year and Date
      2013-02-17
    • Related Report
      2012 Final Research Report
  • [Presentation] Experimental Evaluation of Probabilistic Similarity for Spoken Term Detection2013

    • Author(s)
      Shi-wook Lee, Hiroaki Kojima, Kazuyo Tanaka, Yoshiaki Itoh
    • Organizer
      Proceedings of International Conference on Pattern Recognition Applications and Methods
    • Place of Presentation
      Sants Hotel, Barcelona, Spain
    • Related Report
      2012 Annual Research Report
  • [Presentation] 韻律特徴パターンのDCT 次元圧縮による韻律の異なりを考慮した声質変換手法の検討2013

    • Author(s)
      石 睿,田中 和世,三河 正彦,羅 志偉
    • Organizer
      日本音響学会2013年春季研究発表会論文集
    • Place of Presentation
      東京工科大学
    • Related Report
      2012 Annual Research Report
  • [Presentation] Comparative Analysis of Intensity between English Speakers and Japanese Speakers of English2012

    • Author(s)
      Tomoko Nariai, Kazuyo Tanaka, Tatsuya Kawahara
    • Organizer
      Proc. of Interspeech 2012
    • Place of Presentation
      Hilton Hotel, Portland,(USA)
    • Year and Date
      2012-09-11
    • Related Report
      2012 Final Research Report
  • [Presentation] 混合正規分布間の誤差推定値近似に関する実験的考察2012

    • Author(s)
      李 時旭,児島 宏明, 田中 和世, 伊藤 慶明
    • Organizer
      日本音響学会 2012 年春季研究発表会論文集
    • Place of Presentation
      神奈川大学
    • Year and Date
      2012-03-15
    • Related Report
      2012 Final Research Report
  • [Presentation] 混合正規分布間の誤差推定値近似に関する実験的考察2012

    • Author(s)
      李時旭, 児島宏明, 田中和世, 伊藤慶明
    • Organizer
      日本音響学会2012年春季研究発表会
    • Place of Presentation
      神奈川大学
    • Year and Date
      2012-03-14
    • Related Report
      2011 Annual Research Report
  • [Presentation] Comparative Analysis of Intensity between English Speakers and Japanese Speakers of English2012

    • Author(s)
      Tomoko Nariai, Kazuyo Tanaka, Tatsuya Kawahara
    • Organizer
      Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech) 2012
    • Place of Presentation
      Hilton Hotel, Portland, USA
    • Related Report
      2012 Annual Research Report
  • [Presentation] Development of Prototype sound Direction Control System Using a Two-dimensional Loudspeaker Array2011

    • Author(s)
      Yasuharu Hashimoto, Masahiko Mikawa, Kazuyo Tanaka
    • Organizer
      Proc.of 19th European Signal Processing Conference (EUSIPCO) 2011
    • Place of Presentation
      Palau de Congressos, Barcelona,(Spain)
    • Year and Date
      2011-08-31
    • Related Report
      2012 Final Research Report
  • [Presentation] Spoken Term Detection Results Using Plural Subword Models by Estimating Detection Performance for Each Query2011

    • Author(s)
      Itoh, Yoshiaki; Iwata, Kohei; Ishigame, Masaaki; Tanaka, Kazuyo; Lee, Shi‐wook
    • Organizer
      Proc. of Interspeech 2011
    • Place of Presentation
      Palazzo dei Congressi, Florence,(Italy)
    • Year and Date
      2011-08-29
    • Related Report
      2012 Final Research Report
  • [Presentation] An Experimental Analysis of Pitch Patterns in Japanese Speakers of English with Verification by Speech Re-synthesis2011

    • Author(s)
      Tomoko Nariai, Kazuyo Tanaka
    • Organizer
      Proc. of Interspeech 2011
    • Place of Presentation
      Palazzo dei Congressi, Florence, (Italy)
    • Year and Date
      2011-08-29
    • Related Report
      2012 Final Research Report
  • [Presentation] 声道特性から分離した音源特性の異なりを考慮した声質変換手法の検討2011

    • Author(s)
      南部良季,三河正彦,田中和世
    • Organizer
      日本音響学会 2011年春季研究発表会
    • Place of Presentation
      早稲田大学
    • Year and Date
      2011-03-09
    • Related Report
      2012 Final Research Report
  • [Presentation] 声道特性から分離した音源特性の異なりを考慮した声質変換手法の検討2011

    • Author(s)
      南部良季, 三河正彦, 田中和世
    • Organizer
      日本音響学会2011年春季研究発表会
    • Place of Presentation
      早稲田大学
    • Year and Date
      2011-03-09
    • Related Report
      2010 Annual Research Report
  • [Presentation] A study of pitch patterns of sentence utterances by Japanese speakers of English in comparison with native speakers of English2010

    • Author(s)
      Tomoko Nariai, Kazuyo Tanaka
    • Organizer
      Proc. of Interspeech Satellite Workshop: Second Language Studies
    • Place of Presentation
      Waseda Univ., Tokyo
    • Year and Date
      2010-09-23
    • Related Report
      2012 Final Research Report
  • [Presentation] 日本人の英語文発声におけるパワーパターンの解析2010

    • Author(s)
      成合智子,田中和世
    • Organizer
      日本音響学会 2010 年秋季研究発表会
    • Place of Presentation
      同志社大学
    • Year and Date
      2010-09-16
    • Related Report
      2012 Final Research Report
  • [Presentation] 複数話者間における声道長の差異に着目した音声モーフィング手法の検討2010

    • Author(s)
      南部良季,三河正彦,田中和世
    • Organizer
      日本音響学会2010 年秋季研究発表会
    • Place of Presentation
      同志社大学
    • Year and Date
      2010-09-16
    • Related Report
      2012 Final Research Report
  • [Presentation] 日本人の英語文発声におけるパワーパターンの解析2010

    • Author(s)
      成合智子, 田中和世
    • Organizer
      日本音響学会2010年秋季研究発表会
    • Place of Presentation
      関西大学
    • Year and Date
      2010-09-15
    • Related Report
      2010 Annual Research Report
  • [Presentation] 複数話者間における声道長の差異に着目した音声モーフィング手法の検討2010

    • Author(s)
      南部良季, 三河正彦, 田中和世
    • Organizer
      日本音響学会2010年秋季研究発表会
    • Place of Presentation
      関西大学
    • Year and Date
      2010-09-14
    • Related Report
      2010 Annual Research Report
  • [Presentation] Flexible Voice Morphing based on Linear Combination of Multi-speakers' Vocal Tract Area Functions2010

    • Author(s)
      Yoshiki Nambu, Masahiko Mikawa, Kazuyo Tanaka
    • Organizer
      Proc. of 18th European Signal Processing Conference (EUSIPCO) 2010
    • Place of Presentation
      Congress Center, Aalborg, (Denmark)
    • Year and Date
      2010-08-25
    • Related Report
      2012 Final Research Report
  • [Remarks]

    • URL

      http://www.slis.tsukuba.ac.jp/~ktanaka/

    • Related Report
      2011 Annual Research Report
  • [Remarks]

    • URL

      http://www.slis.tsukuba.ac.jp/~ktanaka/

    • Related Report
      2010 Annual Research Report

URL: 

Published: 2010-08-23   Modified: 2019-07-29  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi