• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

A study on speech diversification techniques based on corpus design for advanced humanoid speech synthesis

Research Project

Project/Area Number 23700195
Research Category

Grant-in-Aid for Young Scientists (B)

Allocation TypeMulti-year Fund
Research Field Perception information processing/Intelligent robotics
Research InstitutionTohoku University (2013)
Tokyo Institute of Technology (2011-2012)

Principal Investigator

NOSE Takashi  東北大学, 工学(系)研究科(研究院), 講師 (90550591)

Project Period (FY) 2011 – 2012
Project Status Completed (Fiscal Year 2013)
Budget Amount *help
¥4,290,000 (Direct Cost: ¥3,300,000、Indirect Cost: ¥990,000)
Fiscal Year 2012: ¥2,080,000 (Direct Cost: ¥1,600,000、Indirect Cost: ¥480,000)
Fiscal Year 2011: ¥2,210,000 (Direct Cost: ¥1,700,000、Indirect Cost: ¥510,000)
Keywords音声合成 / 隠れマルコフモデル / 統計的音声合成 / 感情音声合成 / ヒューマノイドロボット / 音声コーパス / 統計モデル / 感情音声 / コーパスデザイン / 話し言葉音声合成 / HMM音声合成 / 対話音声合成 / 音声コーパス設計 / 音声パラメータ生成 / スタイル変換 / 歌声合成
Research Abstract

Our goal in this research is to realize more human-like, natural text-to-speech system with various emotional expressions and speaking styles, and the achievements of our studies are as follows:
(1)We proposed a novel corpus-design technique in which accent, style, and sentence-final expression are taken into account. (2)We incorporated user's subjective emotional intensities into acoustic model training to improve the performance of expressive speech synthesis. (3)We proposed an automatic labeling technique of emphasis expression using a parameter generation technique of fundamental frequency to realize emphatic speech synthesis. (4)We proposed cross-lingual speech synthesis using only a target speaker's native language speech samples to synthesis multi-lingual speech at a low cost.

Report

(4 results)
  • 2013 Annual Research Report   Final Research Report ( PDF )
  • 2012 Research-status Report
  • 2011 Research-status Report
  • Research Products

    (106 results)

All 2014 2013 2012 2011 Other

All Journal Article (56 results) (of which Peer Reviewed: 32 results) Presentation (50 results) (of which Invited: 1 results)

  • [Journal Article] Prosodic variation enhancement using unsupervised context labeling for HMM-based expressive speech synthesis2014

    • Author(s)
      Yu Maeno, Takashi Nose, Takao Kobayashi, Tomoki Koriyama, Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka
    • Journal Title

      Speech Communication

      Volume: Vol.57 Pages: 144-154

    • DOI

      10.1016/j.specom.2013.09.014

    • Related Report
      2013 Annual Research Report 2013 Final Research Report
    • Peer Reviewed
  • [Journal Article] 共有決定木を利用した話者適応に基づくクロスリンガル音声合成の評価2014

    • Author(s)
      長濱大樹, 能勢 隆, 郡山知樹, 小林隆夫
    • Journal Title

      日本音響学会2014年春季研究発表会講演論文集

      Volume: vol.1 Pages: 413-414

    • Related Report
      2013 Annual Research Report
  • [Journal Article] 音声合成のための音韻・韻律コンテキストを考慮した文選択アルゴリズムの評価2014

    • Author(s)
      荒生侑介, 能勢 隆, 郡山知樹, 篠崎隆宏, 小林隆夫
    • Journal Title

      日本音響学会2014年春季研究発表会講演論文集

      Volume: vol.1 Pages: 405-406

    • Related Report
      2013 Annual Research Report
  • [Journal Article] Robust estimation of multiple-regression HMM parameters for dimension-based expressive dialogue speech synthesis2013

    • Author(s)
      Tomohiro Nagata, Hiroki Mori, Takashi Nose
    • Journal Title

      Proceedings of 14th Annual Conference of the International Speech Communication Association (ISCA)

      Pages: 1549-1553

    • Related Report
      2013 Final Research Report
    • Peer Reviewed
  • [Journal Article] Statistical nonparametric speech synthesis using sparse Gaussian processes2013

    • Author(s)
      Tomoki Koriyama, Takashi Nose, Takao Kobayashi
    • Journal Title

      Proceedings of 14th Annual Conference of the International Speech Communication Association (ISCA)

      Pages: 1072-1076

    • NAID

      120006702716

    • Related Report
      2013 Final Research Report
    • Peer Reviewed
  • [Journal Article] A style control technique for singing voice synthesis based on multiple-regression HSMM2013

    • Author(s)
      Takashi Nose, Misa Kanemoto, Tomoki Koriyama, Takao Kobayashi
    • Journal Title

      Proceedings of 14th Annual Conference of the International Speech Communication Association (ISCA)

      Pages: 378-382

    • Related Report
      2013 Final Research Report
    • Peer Reviewed
  • [Journal Article] Frame-level acoustic modeling based on Gaussian process regression for statistical nonparametric speech synthesis2013

    • Author(s)
      Tomoki Koriyama, Takashi Nose, Takao Kobayashi
    • Journal Title

      Proceedings of 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing

      Pages: 8007-8011

    • NAID

      120006702668

    • Related Report
      2013 Final Research Report
    • Peer Reviewed
  • [Journal Article] Speaker-independent style conversion for HMM-based expressive speech synthesis2013

    • Author(s)
      Hiroki Kanagawa, Takashi Nose, Takao Kobayashi
    • Journal Title

      Proceedings of 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing

      Pages: 7864-7868

    • Related Report
      2013 Final Research Report
    • Peer Reviewed
  • [Journal Article] HMM-based expressive speech synthesis based on phrase-level F0 context labeling2013

    • Author(s)
      Yu Maeno, Takashi Nose, Takao Kobayashi, Tomoki Koriyama, Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka
    • Journal Title

      Proceedings of 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing

      Pages: 7859-7863

    • Related Report
      2013 Final Research Report
    • Peer Reviewed
  • [Journal Article] An intuitive style control technique in HMM-based expressive speech synthesis using subjective style intensity and multiple-regression global variance model2013

    • Author(s)
      Takashi Nose, Takao Kobayashi
    • Journal Title

      Speech Communication

      Volume: Vol.55, No.2 Issue: 2 Pages: 347-357

    • DOI

      10.1016/j.specom.2012.09.003

    • Related Report
      2013 Final Research Report 2012 Research-status Report
    • Peer Reviewed
  • [Journal Article] Speaker-independent style conversion for HMM-based expressive speech synthesis2013

    • Author(s)
      Hiroki Kanagawa, Takashi Nose, Takao Kobayashi
    • Journal Title

      Proc. 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013

      Volume: vol.1 Pages: 7864-7867

    • Related Report
      2013 Annual Research Report
    • Peer Reviewed
  • [Journal Article] HMM-based expressive speech synthesis based on phrase-level F0 context labeling2013

    • Author(s)
      Yu Maeno, Takashi Nose, Takao Kobayashi, Tomoki Koriyama, Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka
    • Journal Title

      Proc. 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013

      Volume: vol.1 Pages: 7859-7863

    • Related Report
      2013 Annual Research Report
    • Peer Reviewed
  • [Journal Article] A style control technique for singing voice synthesis based on multiple-regression HSMM2013

    • Author(s)
      Takashi Nose, Misa Kanemoto, Tomoki Koriyama, Takao Kobayashi
    • Journal Title

      Proc. 14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013

      Volume: vol.1 Pages: 378-382

    • Related Report
      2013 Annual Research Report
    • Peer Reviewed
  • [Journal Article] 複数ドメインコーパスからの文選択に基づくキャラクター音声合成の検討2013

    • Author(s)
      荒生侑介, 能勢 隆, 篠崎隆宏, 小林隆夫
    • Journal Title

      日本音響学会2013年秋季研究発表会講演論文集

      Volume: vol.1 Pages: 351-352

    • Related Report
      2013 Annual Research Report
  • [Journal Article] 統計モデルに基づく音声合成における話者・スタイルの多様化2013

    • Author(s)
      能勢 隆
    • Journal Title

      電子情報通信学会技術研究報告

      Volume: Vol.112, No.422 Pages: 67-72

    • Related Report
      2012 Research-status Report
  • [Journal Article] 任意話者の多様なスタイル生成のための話者正規化スタイル変換法の検討2013

    • Author(s)
      金川裕紀, 能勢 隆, 小林隆夫
    • Journal Title

      電子情報通信学会技術研究報告

      Volume: Vol.112, No.422 Pages: 73-78

    • Related Report
      2012 Research-status Report
  • [Journal Article] 多様な歌声合成のための重回帰HSMMに基づくスタイル制御法の検討2013

    • Author(s)
      能勢 隆, 金本美沙, 郡山知樹, 小林隆夫
    • Journal Title

      電子情報通信学会技術研究報告

      Volume: Vol.112, No.422 Pages: 79-84

    • Related Report
      2012 Research-status Report
  • [Journal Article] 音声合成のためのガウス過程回帰を用いたフレームレベル音響モデリングの検討2013

    • Author(s)
      郡山知樹, 能勢 隆, 小林隆夫
    • Journal Title

      日本音響学会2013年春季研究発表会講演論文集

      Volume: vol.1 Pages: 271-272

    • NAID

      120006702626

    • Related Report
      2012 Research-status Report
  • [Journal Article] HMM音声合成における話者正規化学習を用いたスタイル変換法の評価2013

    • Author(s)
      金川裕紀, 能勢 隆, 小林隆夫
    • Journal Title

      日本音響学会2013年春季研究発表会講演論文集

      Volume: vol.1 Pages: 295-296

    • Related Report
      2012 Research-status Report
  • [Journal Article] 対話音声合成のための音韻・韻律コンテキストを考慮した音声コーパス構築法の検討2013

    • Author(s)
      荒生侑介, 能勢 隆, 小林隆夫
    • Journal Title

      日本音響学会2013年春季研究発表会講演論文集

      Volume: vol.1 Pages: 499-500

    • Related Report
      2012 Research-status Report
  • [Journal Article] A speech parameter generation algorithm using local variance for HMM-based speech synthesis2012

    • Author(s)
      Vataya Chunwijitra, Takashi Nose, Takao Kobayashi
    • Journal Title

      Proceedings of 13th Annual Conference of the International Speech Communication Association (ISCA)

      Pages: 1151-1154

    • Related Report
      2013 Final Research Report
    • Peer Reviewed
  • [Journal Article] Discontinuous observation HMM for prosodic-event-based F0 generation2012

    • Author(s)
      Tomoki Koriyama, Takashi Nose, Takao Kobayashi
    • Journal Title

      Proceedings of 13th Annual Conference of the International Speech Communication Association (ISCA)

      Pages: 462-465

    • NAID

      120006702590

    • Related Report
      2013 Final Research Report
    • Peer Reviewed
  • [Journal Article] An F0 modeling technique based on prosodic events for spontaneous speech synthesis2012

    • Author(s)
      Tomoki Koriyama, Takashi Nose, Takao Kobayashi
    • Journal Title

      Proceedings of 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012)

      Pages: 4589-4593

    • NAID

      120006702508

    • Related Report
      2013 Final Research Report
    • Peer Reviewed
  • [Journal Article] HMM に基づく対話音声合成における多様な韻律生成のためのコンテクストの拡張2012

    • Author(s)
      郡山知樹, 能勢 隆, 小林隆夫
    • Journal Title

      電子情報通信学会論文誌

      Volume: Vol.J95-D, No.3 Pages: 597-607

    • NAID

      110009418768

    • Related Report
      2013 Final Research Report
    • Peer Reviewed
  • [Journal Article] Very low bit-rate F0 coding for phonetic vocoders using MSD-HMM with quantized F0 symbols2012

    • Author(s)
      Takashi Nose, Takao Kobayashi
    • Journal Title

      Speech Communication

      Volume: 54 Issue: 3 Pages: 384-392

    • DOI

      10.1016/j.specom.2011.10.002

    • Related Report
      2013 Final Research Report 2011 Research-status Report
    • Peer Reviewed
  • [Journal Article] A tone-modeling technique using a quantized F0 context to improve tone correctness in average-voice-based speech synthesis2012

    • Author(s)
      Vataya Chunwijitra, Takashi Nose, Takao Kobayashi
    • Journal Title

      Speech Communication

      Volume: Vol.54, No.2 Issue: 2 Pages: 245-255

    • DOI

      10.1016/j.specom.2011.08.006

    • Related Report
      2013 Final Research Report 2011 Research-status Report
    • Peer Reviewed
  • [Journal Article] Discontinuous observation HMM for prosodic-event-based F0 generation2012

    • Author(s)
      Tomoki Koriyama, Takashi Nose, Takao Kobayashi
    • Journal Title

      Proc. 13th Annual Conference of the International Speech Communication Association, INTERSPEECH 2012

      Volume: vol.1 Pages: 462-465

    • NAID

      120006702590

    • Related Report
      2012 Research-status Report
    • Peer Reviewed
  • [Journal Article] A speech parameter generation algorithm using local variance for HMM-based speech synthesis2012

    • Author(s)
      Vataya Chunwijitra, Takashi Nose, Takao Kobayashi
    • Journal Title

      Proc. 13th Annual Conference of the International Speech Communication Association, INTERSPEECH 2012

      Volume: vol.1 Pages: 1151-1154

    • Related Report
      2012 Research-status Report
    • Peer Reviewed
  • [Journal Article] HMM音声合成のための局所的系列内変動を考慮したパラメータ生成の検討2012

    • Author(s)
      能勢 隆, ワータヤー・チュンウィジター, 小林隆夫
    • Journal Title

      日本音響学会2012年秋季研究発表会講演論文集

      Volume: vol.1 Pages: 277-278

    • Related Report
      2012 Research-status Report
  • [Journal Article] 共有決定木を利用した話者適応に基づくクロスリンガル音声合成の検討2012

    • Author(s)
      能勢 隆, 小林隆夫
    • Journal Title

      日本音響学会2012年秋季研究発表会講演論文集

      Volume: vol.1 Pages: 279-280

    • Related Report
      2012 Research-status Report
  • [Journal Article] HMM音声合成における不特定話者スタイル変換のための話者正規化学習法の検討2012

    • Author(s)
      金川裕紀, 能勢 隆, 小林隆夫
    • Journal Title

      日本音響学会2012年秋季研究発表会講演論文集

      Volume: vol.1 Pages: 431-432

    • Related Report
      2012 Research-status Report
  • [Journal Article] HMM音声合成におけるスペクトル特徴量の局所変動のモデル化とパラメータ生成への適用2012

    • Author(s)
      能勢 隆, ワータヤー・チュンウィジター, 小林隆夫
    • Journal Title

      電子情報通信学会技術研究報告

      Volume: Vol.112, No.81 Pages: 43-48

    • NAID

      110009642342

    • Related Report
      2012 Research-status Report
  • [Journal Article] HMMに基づく対話音声合成における多様な韻律生成のためのコンテクストの拡張2012

    • Author(s)
      郡山知樹, 能勢 隆, 小林隆夫
    • Journal Title

      電子情報通信学会論文誌

      Volume: vol.J95-D, no.3 Pages: 597-607

    • NAID

      110009418768

    • Related Report
      2011 Research-status Report
    • Peer Reviewed
  • [Journal Article] 合成音声のスタイル制御における系列内変動を考慮したスペクトル・韻律パラメータの生成2012

    • Author(s)
      能勢 隆, 小林隆夫
    • Journal Title

      日本音響学会2012年春季研究発表会講演論文集

      Volume: vol.1 Pages: 307-308

    • Related Report
      2011 Research-status Report
  • [Journal Article] 観測値の不連続性を考慮したHMMに基づくF0モデル化の検討2012

    • Author(s)
      郡山知樹, 能勢 隆, 小林隆夫
    • Journal Title

      日本音響学会2012年春季研究発表会講演論文集

      Volume: vol.1 Pages: 305-306

    • NAID

      120006702505

    • Related Report
      2011 Research-status Report
  • [Journal Article] Recent development of HMM-based expressive speech synthesis and its applications2011

    • Author(s)
      Takashi Nose, Takao Kobayashi
    • Journal Title

      Proceedings of 2011 Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference

    • URL

      http://www.apsipa.org/proceedings_2011/pdf/APSIPA189.pdf

    • Related Report
      2013 Final Research Report
    • Peer Reviewed
  • [Journal Article] Speaker-independent HMM-based voice conversion using adaptive quantization of the fundamental frequency2011

    • Author(s)
      Takashi Nose, Takao Kobayashi
    • Journal Title

      Speech Communication

      Volume: Vol.53, No.7 Issue: 7 Pages: 973-985

    • DOI

      10.1016/j.specom.2011.05.001

    • Related Report
      2013 Final Research Report 2011 Research-status Report
    • Peer Reviewed
  • [Journal Article] On the use of extended context for HMM-based spontaneous conversational speech synthesis2011

    • Author(s)
      Tomoki Koriyama, Takashi Nose, Takao Kobayashi
    • Journal Title

      Proceedings of 12th Annual Conference of the International Speech Communication Association (ISCA) (INTERSPEECH 2011)

      Pages: 2657-2660

    • NAID

      120006702435

    • Related Report
      2013 Final Research Report
    • Peer Reviewed
  • [Journal Article] Performance prediction of speech recognition using average-voice-based speech synthesis2011

    • Author(s)
      Tatsuhiko Saito, Takashi Nose, Takao Kobayashi, Yohei Okato, Akio Horii
    • Journal Title

      Proceedings of 12th Annual Conference of the International Speech Communication Association (ISCA) (INTERSPEECH 2011)

      Pages: 1953-1956

    • Related Report
      2013 Final Research Report
    • Peer Reviewed
  • [Journal Article] HMM-based emphatic speech synthesis using unsupervised context labeling2011

    • Author(s)
      Yu Maeno, Takashi Nose, Takao Kobayashi, Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka
    • Journal Title

      Proceedings of 12th Annual Conference of the International Speech Communication Association (ISCA) (INTERSPEECH 2011)

      Pages: 1849-185

    • Related Report
      2013 Final Research Report
    • Peer Reviewed
  • [Journal Article] A perceptual expressivity modeling technique for speech synthesis based on multiple-regression HSMM2011

    • Author(s)
      Takashi Nose, Takao Kobayashi
    • Journal Title

      Proceedings of 12th Annual Conference of the International Speech Communication Association (ISCA) (INTERSPEECH 2011)

      Pages: 109-112

    • Related Report
      2013 Final Research Report
    • Peer Reviewed
  • [Journal Article] Very low bit-rate F0 coding for phonetic vocoder using MSD-HMM with quantized F0 context2011

    • Author(s)
      Takashi Nose, Takao Kobayashi
    • Journal Title

      Proc. 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011

      Volume: vol.1 Pages: 5236-5239

    • Related Report
      2011 Research-status Report
    • Peer Reviewed
  • [Journal Article] Tonal context labeling using quantized F0 symbols for improving tone correctness in average-voice-based speech synthesis2011

    • Author(s)
      Vataya Chunwijitra, Takashi Nose, Takao Kobayashi
    • Journal Title

      Proc. 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011

      Volume: vol.1 Pages: 4708-4711

    • Related Report
      2011 Research-status Report
    • Peer Reviewed
  • [Journal Article] On the use of extended context for HMM-based spontaneous conversational speech synthesis2011

    • Author(s)
      Tomoki Koriyama, Takashi Nose, Takao Kobayashi
    • Journal Title

      Proc. 12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011

      Volume: vol.1 Pages: 2657-2660

    • NAID

      120006702435

    • Related Report
      2011 Research-status Report
    • Peer Reviewed
  • [Journal Article] Performance prediction of speech recognition using average-voice-based speech synthesis2011

    • Author(s)
      Tatsuhiko Saito, Takashi Nose, Takao Kobayashi, Yohei Okato, Akio Horii
    • Journal Title

      Proc. 12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011

      Volume: vol.1 Pages: 1953-1956

    • Related Report
      2011 Research-status Report
    • Peer Reviewed
  • [Journal Article] HMM-based emphatic speech synthesis using unsupervised context labeling2011

    • Author(s)
      Yu Maeno, Takashi Nose, Takao Kobayashi, Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka
    • Journal Title

      Proc. 12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011

      Volume: vol.1 Pages: 1849-1852

    • Related Report
      2011 Research-status Report
    • Peer Reviewed
  • [Journal Article] A perceptual expressivity modeling technique for speech synthesis based on multiple-regression HSMM2011

    • Author(s)
      Takashi Nose, Takao Kobayashi
    • Journal Title

      Proc. 12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011

      Volume: vol.1 Pages: 109-112

    • Related Report
      2011 Research-status Report
    • Peer Reviewed
  • [Journal Article] 多様な音声合成のための強調コンテキストの自動付与の検討2011

    • Author(s)
      前野 悠, 能勢 隆, 小林 隆夫, 井島 勇祐, 中嶋 秀治, 水野 秀之, 吉岡 理
    • Journal Title

      日本音響学会2011年秋季研究発表会講演論文集

      Volume: vol.1 Pages: 335-336

    • Related Report
      2011 Research-status Report
  • [Journal Article] 対話音声合成のためのイントネーションラベルのタイミング予測2011

    • Author(s)
      郡山 知樹, 能勢 隆, 小林隆夫
    • Journal Title

      日本音響学会2011年秋季研究発表会講演論文集

      Volume: vol.1 Pages: 333-334

    • Related Report
      2011 Research-status Report
  • [Journal Article] 感情音声合成における主観的表出度合のモデル化と制御の検討2011

    • Author(s)
      能勢 隆, 小林隆夫
    • Journal Title

      日本音響学会2011年秋季研究発表会講演論文集

      Volume: vol.1 Pages: 329-330

    • Related Report
      2011 Research-status Report
  • [Journal Article] 音声合成を用いた音声認識性能予測 -残響と騒音が存在する環境での評価-2011

    • Author(s)
      太刀岡 勇気, 堀井 昭男, 岩崎 知弘, 斉藤 辰彦, 能勢 隆, 小林隆夫
    • Journal Title

      日本音響学会2011年秋季研究発表会講演論文集

      Volume: vol.1 Pages: 9-10

    • Related Report
      2011 Research-status Report
  • [Journal Article] 日本語話し言葉コーパスを用いた対話音声合成のためのコンテキストの評価2011

    • Author(s)
      郡山知樹, 能勢 隆, 小林隆夫
    • Journal Title

      電子情報通信学会技術研究報告

      Volume: vol.111, no.28 Pages: 155-160

    • NAID

      110008725586

    • Related Report
      2011 Research-status Report
  • [Journal Article] HMM音声合成のための動的特徴量を用いた音素継続長モデリングの検討2011

    • Author(s)
      能勢 隆, 小林隆夫
    • Journal Title

      電子情報通信学会技術研究報告

      Volume: vol.111, no.365 Pages: 197-202

    • NAID

      10031110919

    • Related Report
      2011 Research-status Report
  • [Journal Article] HMM音声合成における不特定話者スタイル変換の検討2011

    • Author(s)
      金川裕紀, 能勢 隆, 小林隆夫
    • Journal Title

      電子情報通信学会技術研究報告

      Volume: vol.111, no.365 Pages: 191-196

    • NAID

      10031110896

    • Related Report
      2011 Research-status Report
  • [Journal Article] 韻律イベントHMMを用いた対話音声F0生成2011

    • Author(s)
      郡山知樹, 能勢 隆, 小林隆夫,
    • Journal Title

      電子情報通信学会技術研究報告

      Volume: vol.111, no.365 Pages: 185-190

    • NAID

      10031110881

    • Related Report
      2011 Research-status Report
  • [Journal Article] パラ言語情報を表現可能な対話音声合成のための重回帰HSMMの検討2011

    • Author(s)
      永田智洋, 森 大毅, 能勢 隆
    • Journal Title

      電子情報通信学会技術研究報告

      Volume: vol.111, no.365 Pages: 179-184

    • NAID

      10031110871

    • Related Report
      2011 Research-status Report
  • [Presentation] Robust estimation of multiple-regression HMM parameters for dimension-based expressive dialogue speech synthesis2013

    • Author(s)
      Tomohiro Nagata, Hiroki Mori, Takashi Nose
    • Organizer
      INTERSPEECH 2013
    • Place of Presentation
      Lyon, France
    • Year and Date
      2013-08-27
    • Related Report
      2013 Final Research Report
  • [Presentation] Statistical nonparametric speech synthesis using sparse Gaussian processes2013

    • Author(s)
      Tomoki Koriyama, Takashi Nose, Takao Kobayashi
    • Organizer
      INTERSPEECH 2013
    • Place of Presentation
      Lyon, France
    • Year and Date
      2013-08-27
    • Related Report
      2013 Final Research Report
  • [Presentation] A style control technique for singing voice synthesis based on multiple-regression HSMM2013

    • Author(s)
      Takashi Nose, Misa Kanemoto, Tomoki Koriyama, Takao Kobayashi
    • Organizer
      INTERSPEECH 2013
    • Place of Presentation
      Lyon, France
    • Year and Date
      2013-08-26
    • Related Report
      2013 Final Research Report
  • [Presentation] Frame-level acoustic modeling based on Gaussian process regression for statistical nonparametric speech synthesis2013

    • Author(s)
      Tomoki Koriyama, Takashi Nose, Takao Kobayashi
    • Organizer
      ICASSP 2013
    • Place of Presentation
      Vancouver, Canada
    • Year and Date
      2013-05-31
    • Related Report
      2013 Final Research Report
  • [Presentation] Speaker-independent style conversion for HMM-based expressive speech synthesis2013

    • Author(s)
      Hiroki Kanagawa, Takashi Nose, Takao Kobayashi
    • Organizer
      ICASSP 2013
    • Place of Presentation
      Vancouver, Canada
    • Year and Date
      2013-05-31
    • Related Report
      2013 Final Research Report
  • [Presentation] HMM-based expressive speech synthesis based on phrase-level F0 context labeling2013

    • Author(s)
      Yu Maeno, Takashi Nose, Takao Kobayashi, Tomoki Koriyama, Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka
    • Organizer
      ICASSP 2013
    • Place of Presentation
      Vancouver, Canada
    • Year and Date
      2013-05-31
    • Related Report
      2013 Final Research Report
  • [Presentation] A speech parameter generation algorithm using local variance for HMM-based speech synthesis2012

    • Author(s)
      Vataya Chunwijitra, Takashi Nose, Takao Kobayashi
    • Organizer
      INTERSPEECH 2012
    • Place of Presentation
      Portland, USA
    • Year and Date
      2012-09-11
    • Related Report
      2013 Final Research Report
  • [Presentation] Discontinuous observation HMM for prosodic-event-based F0 generation2012

    • Author(s)
      Tomoki Koriyama, Takashi Nose, Takao Kobayashi
    • Organizer
      INTERSPEECH 2012
    • Place of Presentation
      Portland, USA
    • Year and Date
      2012-09-10
    • Related Report
      2013 Final Research Report
  • [Presentation] An F0 modeling technique based on prosodic events for spontaneous speech synthesis2012

    • Author(s)
      Tomoki Koriyama, Takashi Nose, Takao Kobayashi
    • Organizer
      ICASSP 2012
    • Place of Presentation
      Kyoto, Japan
    • Year and Date
      2012-03-29
    • Related Report
      2013 Final Research Report
  • [Presentation] An F0 modeling technique based on prosodic events for spontaneous speech synthesis2012

    • Author(s)
      Tomoki Koriyama, Takashi Nose, Takao Kobayashi
    • Organizer
      2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012
    • Place of Presentation
      Kyoto, Japan
    • Related Report
      2011 Research-status Report
  • [Presentation] 合成音声のスタイル制御における系列内変動を考慮したスペクトル・韻律パラメータの生成2012

    • Author(s)
      能勢 隆, 小林隆夫
    • Organizer
      日本音響学会2012年春季研究発表会
    • Place of Presentation
      神奈川大学
    • Related Report
      2011 Research-status Report
  • [Presentation] 観測値の不連続性を考慮したHMMに基づくF0モデル化の検討2012

    • Author(s)
      郡山知樹, 能勢 隆, 小林隆夫
    • Organizer
      日本音響学会2012年春季研究発表会
    • Place of Presentation
      神奈川大学
    • Related Report
      2011 Research-status Report
  • [Presentation] Recent development of HMM-based expressive speech synthesis and its applications2011

    • Author(s)
      Takashi Nose, Takao Kobayashi
    • Organizer
      APSIPA ASC 2011
    • Place of Presentation
      Xi'an, China
    • Year and Date
      2011-10-19
    • Related Report
      2013 Final Research Report
  • [Presentation] On the use of extended context for HMM-based spontaneous conversational speech synthesis2011

    • Author(s)
      Tomoki Koriyama, Takashi Nose, Takao Kobayashi
    • Organizer
      INTERSPEECH 2011
    • Place of Presentation
      Florence, Italy
    • Year and Date
      2011-08-30
    • Related Report
      2013 Final Research Report
  • [Presentation] Performance prediction of speech recognition using average-voice-based speech synthesis2011

    • Author(s)
      Tatsuhiko Saito, Takashi Nose, Takao Kobayashi, Yohei Okato, Akio Horii
    • Organizer
      INTERSPEECH 2011
    • Place of Presentation
      Florence, Italy
    • Year and Date
      2011-08-29
    • Related Report
      2013 Final Research Report
  • [Presentation] HMM-based emphatic speech synthesis using unsupervised context labeling2011

    • Author(s)
      Yu Maeno, Takashi Nose, Takao Kobayashi, Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka
    • Organizer
      INTERSPEECH 2011
    • Place of Presentation
      Florence, Italy
    • Year and Date
      2011-08-29
    • Related Report
      2013 Final Research Report
  • [Presentation] A perceptual expressivity modeling technique for speech synthesis based on multiple-regression HSMM2011

    • Author(s)
      Takashi Nose, Takao Kobayashi
    • Organizer
      INTERSPEECH 2011
    • Place of Presentation
      Florence, Italy
    • Year and Date
      2011-08-28
    • Related Report
      2013 Final Research Report
  • [Presentation] Very low bit-rate F0 coding for phonetic vocoder using MSD-HMM with quantized F0 context2011

    • Author(s)
      Takashi Nose, Takao Kobayashi
    • Organizer
      2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, CASSP 2011
    • Place of Presentation
      Prague, Czech Republic
    • Related Report
      2011 Research-status Report
  • [Presentation] Tonal context labeling using quantized F0 symbols for improving tone correctness in average-voice-based speech synthesis2011

    • Author(s)
      Vataya Chunwijitra, Takashi Nose, Takao Kobayashi
    • Organizer
      2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, CASSP 2011
    • Place of Presentation
      Prague, Czech Republic
    • Related Report
      2011 Research-status Report
  • [Presentation] On the use of extended context for HMM-based spontaneous conversational speech synthesis2011

    • Author(s)
      Tomoki Koriyama, Takashi Nose, Takao Kobayashi
    • Organizer
      12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011
    • Place of Presentation
      Florence, Italy
    • Related Report
      2011 Research-status Report
  • [Presentation] Performance prediction of speech recognition using average-voice-based speech synthesis2011

    • Author(s)
      Tatsuhiko Saito, Takashi Nose, Takao Kobayashi, Yohei Okato, Akio Horii
    • Organizer
      12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011
    • Place of Presentation
      Florence, Italy
    • Related Report
      2011 Research-status Report
  • [Presentation] HMM-based emphatic speech synthesis using unsupervised context labeling2011

    • Author(s)
      Yu Maeno, Takashi Nose, Takao Kobayashi, Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka
    • Organizer
      12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011
    • Place of Presentation
      Florence, Italy
    • Related Report
      2011 Research-status Report
  • [Presentation] A perceptual expressivity modeling technique for speech synthesis based on multiple-regression HSMM2011

    • Author(s)
      Takashi Nose, Takao Kobayashi
    • Organizer
      12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011
    • Place of Presentation
      Florence, Italy
    • Related Report
      2011 Research-status Report
  • [Presentation] 多様な音声合成のための強調コンテキストの自動付与の検討2011

    • Author(s)
      前野 悠, 能勢 隆, 小林 隆夫, 井島 勇祐, 中嶋 秀治, 水野 秀之, 吉岡 理
    • Organizer
      日本音響学会2011年秋季研究発表会
    • Place of Presentation
      島根大学
    • Related Report
      2011 Research-status Report
  • [Presentation] 対話音声合成のためのイントネーションラベルのタイミング予測2011

    • Author(s)
      郡山 知樹, 能勢 隆, 小林隆夫
    • Organizer
      日本音響学会2011年秋季研究発表会
    • Place of Presentation
      島根大学
    • Related Report
      2011 Research-status Report
  • [Presentation] 感情音声合成における主観的表出度合のモデル化と制御の検討2011

    • Author(s)
      能勢 隆, 小林隆夫
    • Organizer
      日本音響学会2011年秋季研究発表会
    • Place of Presentation
      島根大学
    • Related Report
      2011 Research-status Report
  • [Presentation] 音声合成を用いた音声認識性能予測 -残響と騒音が存在する環境での評価-2011

    • Author(s)
      太刀岡 勇気, 堀井 昭男, 岩崎 知弘, 斉藤 辰彦, 能勢 隆, 小林隆夫
    • Organizer
      日本音響学会2011年秋季研究発表会
    • Place of Presentation
      島根大学
    • Related Report
      2011 Research-status Report
  • [Presentation] 日本語話し言葉コーパスを用いた対話音声合成のためのコンテキストの評価2011

    • Author(s)
      郡山知樹, 能勢 隆, 小林隆夫
    • Organizer
      音声研究会
    • Place of Presentation
      立命館大学
    • Related Report
      2011 Research-status Report
  • [Presentation] HMM音声合成のための動的特徴量を用いた音素継続長モデリングの検討2011

    • Author(s)
      能勢 隆, 小林隆夫
    • Organizer
      音声言語シンポジウム
    • Place of Presentation
      芝浦工業大学
    • Related Report
      2011 Research-status Report
  • [Presentation] HMM音声合成における不特定話者スタイル変換の検討2011

    • Author(s)
      金川裕紀, 能勢 隆, 小林隆夫
    • Organizer
      音声言語シンポジウム
    • Place of Presentation
      芝浦工業大学
    • Related Report
      2011 Research-status Report
  • [Presentation] 韻律イベントHMMを用いた対話音声F0生成2011

    • Author(s)
      郡山知樹, 能勢 隆, 小林隆夫
    • Organizer
      音声言語シンポジウム
    • Place of Presentation
      芝浦工業大学
    • Related Report
      2011 Research-status Report
  • [Presentation] パラ言語情報を表現可能な対話音声合成のための重回帰HSMMの検討2011

    • Author(s)
      永田智洋, 森 大毅, 能勢 隆
    • Organizer
      音声言語シンポジウム
    • Place of Presentation
      芝浦工業大学
    • Related Report
      2011 Research-status Report
  • [Presentation] Speaker-independent style conversion for HMM-based expressive speech synthesis

    • Author(s)
      Hiroki Kanagawa
    • Organizer
      2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013
    • Place of Presentation
      Vancouver, Canada
    • Related Report
      2013 Annual Research Report
  • [Presentation] HMM-based expressive speech synthesis based on phrase-level F0 context labeling

    • Author(s)
      Yu Maeno
    • Organizer
      2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013
    • Place of Presentation
      Vancouver, Canada
    • Related Report
      2013 Annual Research Report
  • [Presentation] A style control technique for singing voice synthesis based on multiple-regression HSMM

    • Author(s)
      Takashi Nose
    • Organizer
      14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013
    • Place of Presentation
      Lyon, France
    • Related Report
      2013 Annual Research Report
  • [Presentation] 複数ドメインコーパスからの文選択に基づくキャラクター音声合成の検討

    • Author(s)
      荒生侑介
    • Organizer
      日本音響学会2013年秋季研究発表会
    • Place of Presentation
      豊橋技術科学大学
    • Related Report
      2013 Annual Research Report
  • [Presentation] 共有決定木を利用した話者適応に基づくクロスリンガル音声合成の評価

    • Author(s)
      長濱大樹
    • Organizer
      日本音響学会2014年春季研究発表会
    • Place of Presentation
      日本大学
    • Related Report
      2013 Annual Research Report
  • [Presentation] 音声合成のための音韻・韻律コンテキストを考慮した文選択アルゴリズムの評価

    • Author(s)
      荒生侑介
    • Organizer
      日本音響学会2014年春季研究発表会
    • Place of Presentation
      日本大学
    • Related Report
      2013 Annual Research Report
  • [Presentation] Discontinuous observation HMM for prosodic-event-based F0 generation

    • Author(s)
      Tomoki Koriyama, Takashi Nose, Takao Kobayashi
    • Organizer
      13th Annual Conference of the International Speech Communication Association, INTERSPEECH 2012
    • Place of Presentation
      Portland, USA
    • Related Report
      2012 Research-status Report
  • [Presentation] A speech parameter generation algorithm using local variance for HMM-based speech synthesis

    • Author(s)
      Vataya Chunwijitra, Takashi Nose, Takao Kobayashi
    • Organizer
      13th Annual Conference of the International Speech Communication Association, INTERSPEECH 2012
    • Place of Presentation
      Portland, USA
    • Related Report
      2012 Research-status Report
  • [Presentation] HMM音声合成のための局所的系列内変動を考慮したパラメータ生成の検討

    • Author(s)
      能勢 隆, ワータヤー・チュンウィジター, 小林隆夫
    • Organizer
      日本音響学会2012年秋季研究発表会
    • Place of Presentation
      信州大学
    • Related Report
      2012 Research-status Report
  • [Presentation] 共有決定木を利用した話者適応に基づくクロスリンガル音声合成の検討

    • Author(s)
      能勢 隆, 小林隆夫
    • Organizer
      日本音響学会2012年秋季研究発表会
    • Place of Presentation
      信州大学
    • Related Report
      2012 Research-status Report
  • [Presentation] HMM音声合成における不特定話者スタイル変換のための話者正規化学習法の検討

    • Author(s)
      金川裕紀, 能勢 隆, 小林隆夫
    • Organizer
      日本音響学会2012年秋季研究発表会
    • Place of Presentation
      信州大学
    • Related Report
      2012 Research-status Report
  • [Presentation] HMM音声合成におけるスペクトル特徴量の局所変動のモデル化とパラメータ生成への適用

    • Author(s)
      能勢 隆, ワータヤー・チュンウィジター, 小林隆夫
    • Organizer
      音声研究会
    • Place of Presentation
      東北工業大学
    • Related Report
      2012 Research-status Report
  • [Presentation] 統計モデルに基づく音声合成における話者・スタイルの多様化

    • Author(s)
      能勢 隆
    • Organizer
      音声研究会
    • Place of Presentation
      同志社大学
    • Related Report
      2012 Research-status Report
    • Invited
  • [Presentation] 任意話者の多様なスタイル生成のための話者正規化スタイル変換法の検討

    • Author(s)
      金川裕紀, 能勢 隆, 小林隆夫
    • Organizer
      音声研究会
    • Place of Presentation
      同志社大学
    • Related Report
      2012 Research-status Report
  • [Presentation] 多様な歌声合成のための重回帰HSMMに基づくスタイル制御法の検討

    • Author(s)
      能勢 隆, 金本美沙, 郡山知樹, 小林隆夫
    • Organizer
      音声研究会
    • Place of Presentation
      同志社大学
    • Related Report
      2012 Research-status Report
  • [Presentation] 音声合成のためのガウス過程回帰を用いたフレームレベル音響モデリングの検討

    • Author(s)
      郡山知樹, 能勢 隆, 小林隆夫
    • Organizer
      日本音響学会2013年春季研究発表会
    • Place of Presentation
      東京工科大学
    • Related Report
      2012 Research-status Report
  • [Presentation] HMM音声合成における話者正規化学習を用いたスタイル変換法の評価

    • Author(s)
      金川裕紀, 能勢 隆, 小林隆夫
    • Organizer
      日本音響学会2013年春季研究発表会
    • Place of Presentation
      東京工科大学
    • Related Report
      2012 Research-status Report
  • [Presentation] 対話音声合成のための音韻・韻律コンテキストを考慮した音声コーパス構築法の検討

    • Author(s)
      荒生侑介, 能勢 隆, 小林隆夫
    • Organizer
      日本音響学会2013年春季研究発表会
    • Place of Presentation
      東京工科大学
    • Related Report
      2012 Research-status Report

URL: 

Published: 2011-08-05   Modified: 2019-07-29  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi