A study on speech diversification techniques based on corpus design for advanced humanoid speech synthesis

Research Project

Project/Area Number	23700195
Research Category	Grant-in-Aid for Young Scientists (B)
Allocation Type	Multi-year Fund
Research Field	Perception information processing/Intelligent robotics
Research Institution	Tohoku University (2013) Tokyo Institute of Technology (2011-2012)
Principal Investigator	NOSE Takashi 東北大学, 工学(系)研究科(研究院), 講師 (90550591)
Project Period (FY)	2011 – 2012
Project Status	Completed (Fiscal Year 2013)
Budget Amount *help	¥4,290,000 (Direct Cost: ¥3,300,000、Indirect Cost: ¥990,000) Fiscal Year 2012: ¥2,080,000 (Direct Cost: ¥1,600,000、Indirect Cost: ¥480,000) Fiscal Year 2011: ¥2,210,000 (Direct Cost: ¥1,700,000、Indirect Cost: ¥510,000)
Keywords	音声合成 / 隠れマルコフモデル / 統計的音声合成 / 感情音声合成 / ヒューマノイドロボット / 音声コーパス / 統計モデル / 感情音声 / コーパスデザイン / 話し言葉音声合成 / HMM音声合成 / 対話音声合成 / 音声コーパス設計 / 音声パラメータ生成 / スタイル変換 / 歌声合成
Research Abstract	Our goal in this research is to realize more human-like, natural text-to-speech system with various emotional expressions and speaking styles, and the achievements of our studies are as follows: (1)We proposed a novel corpus-design technique in which accent, style, and sentence-final expression are taken into account. (2)We incorporated user's subjective emotional intensities into acoustic model training to improve the performance of expressive speech synthesis. (3)We proposed an automatic labeling technique of emphasis expression using a parameter generation technique of fundamental frequency to realize emphatic speech synthesis. (4)We proposed cross-lingual speech synthesis using only a target speaker's native language speech samples to synthesis multi-lingual speech at a low cost.

Report

(4 results)

2013 Annual Research Report Final Research Report ( PDF )
2012 Research-status Report
2011 Research-status Report

Research Products
(106 results)

All 2014 2013 2012 2011 Other

All Journal Article (56 results) (of which Peer Reviewed: 32 results) Presentation (50 results) (of which Invited: 1 results)

[Journal Article] Prosodic variation enhancement using unsupervised context labeling for HMM-based expressive speech synthesis2014
- Author(s)
  Yu Maeno, Takashi Nose, Takao Kobayashi, Tomoki Koriyama, Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka
- Journal Title
  
  Speech Communication
  
  Volume: Vol.57 Pages: 144-154
- DOI
  10.1016/j.specom.2013.09.014
- Related Report
  2013 Annual Research Report 2013 Final Research Report
- Peer Reviewed
[Journal Article] 共有決定木を利用した話者適応に基づくクロスリンガル音声合成の評価2014
- Author(s)
  長濱大樹, 能勢隆, 郡山知樹, 小林隆夫
- Journal Title
  
  日本音響学会2014年春季研究発表会講演論文集
  
  Volume: vol.1 Pages: 413-414
- Related Report
  2013 Annual Research Report
[Journal Article] 音声合成のための音韻・韻律コンテキストを考慮した文選択アルゴリズムの評価2014
- Author(s)
  荒生侑介, 能勢隆, 郡山知樹, 篠崎隆宏, 小林隆夫
- Journal Title
  
  日本音響学会2014年春季研究発表会講演論文集
  
  Volume: vol.1 Pages: 405-406
- Related Report
  2013 Annual Research Report
[Journal Article] Robust estimation of multiple-regression HMM parameters for dimension-based expressive dialogue speech synthesis2013
- Author(s)
  Tomohiro Nagata, Hiroki Mori, Takashi Nose
- Journal Title
  
  Proceedings of 14th Annual Conference of the International Speech Communication Association (ISCA)
  
  Pages: 1549-1553
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] Statistical nonparametric speech synthesis using sparse Gaussian processes2013
- Author(s)
  Tomoki Koriyama, Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proceedings of 14th Annual Conference of the International Speech Communication Association (ISCA)
  
  Pages: 1072-1076
- NAID
  120006702716
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] A style control technique for singing voice synthesis based on multiple-regression HSMM2013
- Author(s)
  Takashi Nose, Misa Kanemoto, Tomoki Koriyama, Takao Kobayashi
- Journal Title
  
  Proceedings of 14th Annual Conference of the International Speech Communication Association (ISCA)
  
  Pages: 378-382
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] Frame-level acoustic modeling based on Gaussian process regression for statistical nonparametric speech synthesis2013
- Author(s)
  Tomoki Koriyama, Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proceedings of 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing
  
  Pages: 8007-8011
- NAID
  120006702668
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] Speaker-independent style conversion for HMM-based expressive speech synthesis2013
- Author(s)
  Hiroki Kanagawa, Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proceedings of 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing
  
  Pages: 7864-7868
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] HMM-based expressive speech synthesis based on phrase-level F0 context labeling2013
- Author(s)
  Yu Maeno, Takashi Nose, Takao Kobayashi, Tomoki Koriyama, Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka
- Journal Title
  
  Proceedings of 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing
  
  Pages: 7859-7863
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] An intuitive style control technique in HMM-based expressive speech synthesis using subjective style intensity and multiple-regression global variance model2013
- Author(s)
  Takashi Nose, Takao Kobayashi
- Journal Title
  
  Speech Communication
  
  Volume: Vol.55, No.2 Issue: 2 Pages: 347-357
- DOI
  10.1016/j.specom.2012.09.003
- Related Report
  2013 Final Research Report 2012 Research-status Report
- Peer Reviewed
[Journal Article] Speaker-independent style conversion for HMM-based expressive speech synthesis2013
- Author(s)
  Hiroki Kanagawa, Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proc. 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013
  
  Volume: vol.1 Pages: 7864-7867
- Related Report
  2013 Annual Research Report
- Peer Reviewed
[Journal Article] HMM-based expressive speech synthesis based on phrase-level F0 context labeling2013
- Author(s)
  Yu Maeno, Takashi Nose, Takao Kobayashi, Tomoki Koriyama, Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka
- Journal Title
  
  Proc. 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013
  
  Volume: vol.1 Pages: 7859-7863
- Related Report
  2013 Annual Research Report
- Peer Reviewed
[Journal Article] A style control technique for singing voice synthesis based on multiple-regression HSMM2013
- Author(s)
  Takashi Nose, Misa Kanemoto, Tomoki Koriyama, Takao Kobayashi
- Journal Title
  
  Proc. 14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013
  
  Volume: vol.1 Pages: 378-382
- Related Report
  2013 Annual Research Report
- Peer Reviewed
[Journal Article] 複数ドメインコーパスからの文選択に基づくキャラクター音声合成の検討2013
- Author(s)
  荒生侑介, 能勢隆, 篠崎隆宏, 小林隆夫
- Journal Title
  
  日本音響学会2013年秋季研究発表会講演論文集
  
  Volume: vol.1 Pages: 351-352
- Related Report
  2013 Annual Research Report
[Journal Article] 統計モデルに基づく音声合成における話者・スタイルの多様化2013
- Author(s)
  能勢隆
- Journal Title
  
  電子情報通信学会技術研究報告
  
  Volume: Vol.112, No.422 Pages: 67-72
- Related Report
  2012 Research-status Report
[Journal Article] 任意話者の多様なスタイル生成のための話者正規化スタイル変換法の検討2013
- Author(s)
  金川裕紀, 能勢隆, 小林隆夫
- Journal Title
  
  電子情報通信学会技術研究報告
  
  Volume: Vol.112, No.422 Pages: 73-78
- Related Report
  2012 Research-status Report
[Journal Article] 多様な歌声合成のための重回帰HSMMに基づくスタイル制御法の検討2013
- Author(s)
  能勢隆, 金本美沙, 郡山知樹, 小林隆夫
- Journal Title
  
  電子情報通信学会技術研究報告
  
  Volume: Vol.112, No.422 Pages: 79-84
- Related Report
  2012 Research-status Report
[Journal Article] 音声合成のためのガウス過程回帰を用いたフレームレベル音響モデリングの検討2013
- Author(s)
  郡山知樹, 能勢隆, 小林隆夫
- Journal Title
  
  日本音響学会2013年春季研究発表会講演論文集
  
  Volume: vol.1 Pages: 271-272
- NAID
  120006702626
- Related Report
  2012 Research-status Report
[Journal Article] HMM音声合成における話者正規化学習を用いたスタイル変換法の評価2013
- Author(s)
  金川裕紀, 能勢隆, 小林隆夫
- Journal Title
  
  日本音響学会2013年春季研究発表会講演論文集
  
  Volume: vol.1 Pages: 295-296
- Related Report
  2012 Research-status Report
[Journal Article] 対話音声合成のための音韻・韻律コンテキストを考慮した音声コーパス構築法の検討2013
- Author(s)
  荒生侑介, 能勢隆, 小林隆夫
- Journal Title
  
  日本音響学会2013年春季研究発表会講演論文集
  
  Volume: vol.1 Pages: 499-500
- Related Report
  2012 Research-status Report
[Journal Article] A speech parameter generation algorithm using local variance for HMM-based speech synthesis2012
- Author(s)
  Vataya Chunwijitra, Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proceedings of 13th Annual Conference of the International Speech Communication Association (ISCA)
  
  Pages: 1151-1154
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] Discontinuous observation HMM for prosodic-event-based F0 generation2012
- Author(s)
  Tomoki Koriyama, Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proceedings of 13th Annual Conference of the International Speech Communication Association (ISCA)
  
  Pages: 462-465
- NAID
  120006702590
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] An F0 modeling technique based on prosodic events for spontaneous speech synthesis2012
- Author(s)
  Tomoki Koriyama, Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proceedings of 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012)
  
  Pages: 4589-4593
- NAID
  120006702508
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] HMM に基づく対話音声合成における多様な韻律生成のためのコンテクストの拡張2012
- Author(s)
  郡山知樹, 能勢隆, 小林隆夫
- Journal Title
  
  電子情報通信学会論文誌
  
  Volume: Vol.J95-D, No.3 Pages: 597-607
- NAID
  110009418768
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] Very low bit-rate F0 coding for phonetic vocoders using MSD-HMM with quantized F0 symbols2012
- Author(s)
  Takashi Nose, Takao Kobayashi
- Journal Title
  
  Speech Communication
  
  Volume: 54 Issue: 3 Pages: 384-392
- DOI
  10.1016/j.specom.2011.10.002
- Related Report
  2013 Final Research Report 2011 Research-status Report
- Peer Reviewed
[Journal Article] A tone-modeling technique using a quantized F0 context to improve tone correctness in average-voice-based speech synthesis2012
- Author(s)
  Vataya Chunwijitra, Takashi Nose, Takao Kobayashi
- Journal Title
  
  Speech Communication
  
  Volume: Vol.54, No.2 Issue: 2 Pages: 245-255
- DOI
  10.1016/j.specom.2011.08.006
- Related Report
  2013 Final Research Report 2011 Research-status Report
- Peer Reviewed
[Journal Article] Discontinuous observation HMM for prosodic-event-based F0 generation2012
- Author(s)
  Tomoki Koriyama, Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proc. 13th Annual Conference of the International Speech Communication Association, INTERSPEECH 2012
  
  Volume: vol.1 Pages: 462-465
- NAID
  120006702590
- Related Report
  2012 Research-status Report
- Peer Reviewed
[Journal Article] A speech parameter generation algorithm using local variance for HMM-based speech synthesis2012
- Author(s)
  Vataya Chunwijitra, Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proc. 13th Annual Conference of the International Speech Communication Association, INTERSPEECH 2012
  
  Volume: vol.1 Pages: 1151-1154
- Related Report
  2012 Research-status Report
- Peer Reviewed
[Journal Article] HMM音声合成のための局所的系列内変動を考慮したパラメータ生成の検討2012
- Author(s)
  能勢隆, ワータヤー・チュンウィジター, 小林隆夫
- Journal Title
  
  日本音響学会2012年秋季研究発表会講演論文集
  
  Volume: vol.1 Pages: 277-278
- Related Report
  2012 Research-status Report
[Journal Article] 共有決定木を利用した話者適応に基づくクロスリンガル音声合成の検討2012
- Author(s)
  能勢隆, 小林隆夫
- Journal Title
  
  日本音響学会2012年秋季研究発表会講演論文集
  
  Volume: vol.1 Pages: 279-280
- Related Report
  2012 Research-status Report
[Journal Article] HMM音声合成における不特定話者スタイル変換のための話者正規化学習法の検討2012
- Author(s)
  金川裕紀, 能勢隆, 小林隆夫
- Journal Title
  
  日本音響学会2012年秋季研究発表会講演論文集
  
  Volume: vol.1 Pages: 431-432
- Related Report
  2012 Research-status Report
[Journal Article] HMM音声合成におけるスペクトル特徴量の局所変動のモデル化とパラメータ生成への適用2012
- Author(s)
  能勢隆, ワータヤー・チュンウィジター, 小林隆夫
- Journal Title
  
  電子情報通信学会技術研究報告
  
  Volume: Vol.112, No.81 Pages: 43-48
- NAID
  110009642342
- Related Report
  2012 Research-status Report
[Journal Article] HMMに基づく対話音声合成における多様な韻律生成のためのコンテクストの拡張2012
- Author(s)
  郡山知樹, 能勢隆, 小林隆夫
- Journal Title
  
  電子情報通信学会論文誌
  
  Volume: vol.J95-D, no.3 Pages: 597-607
- NAID
  110009418768
- Related Report
  2011 Research-status Report
- Peer Reviewed
[Journal Article] 合成音声のスタイル制御における系列内変動を考慮したスペクトル・韻律パラメータの生成2012
- Author(s)
  能勢隆, 小林隆夫
- Journal Title
  
  日本音響学会2012年春季研究発表会講演論文集
  
  Volume: vol.1 Pages: 307-308
- Related Report
  2011 Research-status Report
[Journal Article] 観測値の不連続性を考慮したHMMに基づくF0モデル化の検討2012
- Author(s)
  郡山知樹, 能勢隆, 小林隆夫
- Journal Title
  
  日本音響学会2012年春季研究発表会講演論文集
  
  Volume: vol.1 Pages: 305-306
- NAID
  120006702505
- Related Report
  2011 Research-status Report
[Journal Article] Recent development of HMM-based expressive speech synthesis and its applications2011
- Author(s)
  Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proceedings of 2011 Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference
- URL
  http://www.apsipa.org/proceedings_2011/pdf/APSIPA189.pdf
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] Speaker-independent HMM-based voice conversion using adaptive quantization of the fundamental frequency2011
- Author(s)
  Takashi Nose, Takao Kobayashi
- Journal Title
  
  Speech Communication
  
  Volume: Vol.53, No.7 Issue: 7 Pages: 973-985
- DOI
  10.1016/j.specom.2011.05.001
- Related Report
  2013 Final Research Report 2011 Research-status Report
- Peer Reviewed
[Journal Article] On the use of extended context for HMM-based spontaneous conversational speech synthesis2011
- Author(s)
  Tomoki Koriyama, Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proceedings of 12th Annual Conference of the International Speech Communication Association (ISCA) (INTERSPEECH 2011)
  
  Pages: 2657-2660
- NAID
  120006702435
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] Performance prediction of speech recognition using average-voice-based speech synthesis2011
- Author(s)
  Tatsuhiko Saito, Takashi Nose, Takao Kobayashi, Yohei Okato, Akio Horii
- Journal Title
  
  Proceedings of 12th Annual Conference of the International Speech Communication Association (ISCA) (INTERSPEECH 2011)
  
  Pages: 1953-1956
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] HMM-based emphatic speech synthesis using unsupervised context labeling2011
- Author(s)
  Yu Maeno, Takashi Nose, Takao Kobayashi, Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka
- Journal Title
  
  Proceedings of 12th Annual Conference of the International Speech Communication Association (ISCA) (INTERSPEECH 2011)
  
  Pages: 1849-185
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] A perceptual expressivity modeling technique for speech synthesis based on multiple-regression HSMM2011
- Author(s)
  Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proceedings of 12th Annual Conference of the International Speech Communication Association (ISCA) (INTERSPEECH 2011)
  
  Pages: 109-112
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] Very low bit-rate F0 coding for phonetic vocoder using MSD-HMM with quantized F0 context2011
- Author(s)
  Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proc. 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011
  
  Volume: vol.1 Pages: 5236-5239
- Related Report
  2011 Research-status Report
- Peer Reviewed
[Journal Article] Tonal context labeling using quantized F0 symbols for improving tone correctness in average-voice-based speech synthesis2011
- Author(s)
  Vataya Chunwijitra, Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proc. 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011
  
  Volume: vol.1 Pages: 4708-4711
- Related Report
  2011 Research-status Report
- Peer Reviewed
[Journal Article] On the use of extended context for HMM-based spontaneous conversational speech synthesis2011
- Author(s)
  Tomoki Koriyama, Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proc. 12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011
  
  Volume: vol.1 Pages: 2657-2660
- NAID
  120006702435
- Related Report
  2011 Research-status Report
- Peer Reviewed
[Journal Article] Performance prediction of speech recognition using average-voice-based speech synthesis2011
- Author(s)
  Tatsuhiko Saito, Takashi Nose, Takao Kobayashi, Yohei Okato, Akio Horii
- Journal Title
  
  Proc. 12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011
  
  Volume: vol.1 Pages: 1953-1956
- Related Report
  2011 Research-status Report
- Peer Reviewed
[Journal Article] HMM-based emphatic speech synthesis using unsupervised context labeling2011
- Author(s)
  Yu Maeno, Takashi Nose, Takao Kobayashi, Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka
- Journal Title
  
  Proc. 12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011
  
  Volume: vol.1 Pages: 1849-1852
- Related Report
  2011 Research-status Report
- Peer Reviewed
[Journal Article] A perceptual expressivity modeling technique for speech synthesis based on multiple-regression HSMM2011
- Author(s)
  Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proc. 12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011
  
  Volume: vol.1 Pages: 109-112
- Related Report
  2011 Research-status Report
- Peer Reviewed
[Journal Article] 多様な音声合成のための強調コンテキストの自動付与の検討2011
- Author(s)
  前野悠, 能勢隆, 小林隆夫, 井島勇祐, 中嶋秀治, 水野秀之, 吉岡理
- Journal Title
  
  日本音響学会2011年秋季研究発表会講演論文集
  
  Volume: vol.1 Pages: 335-336
- Related Report
  2011 Research-status Report
[Journal Article] 対話音声合成のためのイントネーションラベルのタイミング予測2011
- Author(s)
  郡山知樹, 能勢隆, 小林隆夫
- Journal Title
  
  日本音響学会2011年秋季研究発表会講演論文集
  
  Volume: vol.1 Pages: 333-334
- Related Report
  2011 Research-status Report
[Journal Article] 感情音声合成における主観的表出度合のモデル化と制御の検討2011
- Author(s)
  能勢隆, 小林隆夫
- Journal Title
  
  日本音響学会2011年秋季研究発表会講演論文集
  
  Volume: vol.1 Pages: 329-330
- Related Report
  2011 Research-status Report
[Journal Article] 音声合成を用いた音声認識性能予測－残響と騒音が存在する環境での評価－2011
- Author(s)
  太刀岡勇気, 堀井昭男, 岩崎知弘, 斉藤辰彦, 能勢隆, 小林隆夫
- Journal Title
  
  日本音響学会2011年秋季研究発表会講演論文集
  
  Volume: vol.1 Pages: 9-10
- Related Report
  2011 Research-status Report
[Journal Article] 日本語話し言葉コーパスを用いた対話音声合成のためのコンテキストの評価2011
- Author(s)
  郡山知樹, 能勢隆, 小林隆夫
- Journal Title
  
  電子情報通信学会技術研究報告
  
  Volume: vol.111, no.28 Pages: 155-160
- NAID
  110008725586
- Related Report
  2011 Research-status Report
[Journal Article] HMM音声合成のための動的特徴量を用いた音素継続長モデリングの検討2011
- Author(s)
  能勢隆, 小林隆夫
- Journal Title
  
  電子情報通信学会技術研究報告
  
  Volume: vol.111, no.365 Pages: 197-202
- NAID
  10031110919
- Related Report
  2011 Research-status Report
[Journal Article] HMM音声合成における不特定話者スタイル変換の検討2011
- Author(s)
  金川裕紀, 能勢隆, 小林隆夫
- Journal Title
  
  電子情報通信学会技術研究報告
  
  Volume: vol.111, no.365 Pages: 191-196
- NAID
  10031110896
- Related Report
  2011 Research-status Report
[Journal Article] 韻律イベントHMMを用いた対話音声F0生成2011
- Author(s)
  郡山知樹, 能勢隆, 小林隆夫,
- Journal Title
  
  電子情報通信学会技術研究報告
  
  Volume: vol.111, no.365 Pages: 185-190
- NAID
  10031110881
- Related Report
  2011 Research-status Report
[Journal Article] パラ言語情報を表現可能な対話音声合成のための重回帰HSMMの検討2011
- Author(s)
  永田智洋, 森大毅, 能勢隆
- Journal Title
  
  電子情報通信学会技術研究報告
  
  Volume: vol.111, no.365 Pages: 179-184
- NAID
  10031110871
- Related Report
  2011 Research-status Report
[Presentation] Robust estimation of multiple-regression HMM parameters for dimension-based expressive dialogue speech synthesis2013
- Author(s)
  Tomohiro Nagata, Hiroki Mori, Takashi Nose
- Organizer
  INTERSPEECH 2013
- Place of Presentation
  Lyon, France
- Year and Date
  2013-08-27
- Related Report
  2013 Final Research Report
[Presentation] Statistical nonparametric speech synthesis using sparse Gaussian processes2013
- Author(s)
  Tomoki Koriyama, Takashi Nose, Takao Kobayashi
- Organizer
  INTERSPEECH 2013
- Place of Presentation
  Lyon, France
- Year and Date
  2013-08-27
- Related Report
  2013 Final Research Report
[Presentation] A style control technique for singing voice synthesis based on multiple-regression HSMM2013
- Author(s)
  Takashi Nose, Misa Kanemoto, Tomoki Koriyama, Takao Kobayashi
- Organizer
  INTERSPEECH 2013
- Place of Presentation
  Lyon, France
- Year and Date
  2013-08-26
- Related Report
  2013 Final Research Report
[Presentation] Frame-level acoustic modeling based on Gaussian process regression for statistical nonparametric speech synthesis2013
- Author(s)
  Tomoki Koriyama, Takashi Nose, Takao Kobayashi
- Organizer
  ICASSP 2013
- Place of Presentation
  Vancouver, Canada
- Year and Date
  2013-05-31
- Related Report
  2013 Final Research Report
[Presentation] Speaker-independent style conversion for HMM-based expressive speech synthesis2013
- Author(s)
  Hiroki Kanagawa, Takashi Nose, Takao Kobayashi
- Organizer
  ICASSP 2013
- Place of Presentation
  Vancouver, Canada
- Year and Date
  2013-05-31
- Related Report
  2013 Final Research Report
[Presentation] HMM-based expressive speech synthesis based on phrase-level F0 context labeling2013
- Author(s)
  Yu Maeno, Takashi Nose, Takao Kobayashi, Tomoki Koriyama, Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka
- Organizer
  ICASSP 2013
- Place of Presentation
  Vancouver, Canada
- Year and Date
  2013-05-31
- Related Report
  2013 Final Research Report
[Presentation] A speech parameter generation algorithm using local variance for HMM-based speech synthesis2012
- Author(s)
  Vataya Chunwijitra, Takashi Nose, Takao Kobayashi
- Organizer
  INTERSPEECH 2012
- Place of Presentation
  Portland, USA
- Year and Date
  2012-09-11
- Related Report
  2013 Final Research Report
[Presentation] Discontinuous observation HMM for prosodic-event-based F0 generation2012
- Author(s)
  Tomoki Koriyama, Takashi Nose, Takao Kobayashi
- Organizer
  INTERSPEECH 2012
- Place of Presentation
  Portland, USA
- Year and Date
  2012-09-10
- Related Report
  2013 Final Research Report
[Presentation] An F0 modeling technique based on prosodic events for spontaneous speech synthesis2012
- Author(s)
  Tomoki Koriyama, Takashi Nose, Takao Kobayashi
- Organizer
  ICASSP 2012
- Place of Presentation
  Kyoto, Japan
- Year and Date
  2012-03-29
- Related Report
  2013 Final Research Report
[Presentation] An F0 modeling technique based on prosodic events for spontaneous speech synthesis2012
- Author(s)
  Tomoki Koriyama, Takashi Nose, Takao Kobayashi
- Organizer
  2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012
- Place of Presentation
  Kyoto, Japan
- Related Report
  2011 Research-status Report
[Presentation] 合成音声のスタイル制御における系列内変動を考慮したスペクトル・韻律パラメータの生成2012
- Author(s)
  能勢隆, 小林隆夫
- Organizer
  日本音響学会2012年春季研究発表会
- Place of Presentation
  神奈川大学
- Related Report
  2011 Research-status Report
[Presentation] 観測値の不連続性を考慮したHMMに基づくF0モデル化の検討2012
- Author(s)
  郡山知樹, 能勢隆, 小林隆夫
- Organizer
  日本音響学会2012年春季研究発表会
- Place of Presentation
  神奈川大学
- Related Report
  2011 Research-status Report
[Presentation] Recent development of HMM-based expressive speech synthesis and its applications2011
- Author(s)
  Takashi Nose, Takao Kobayashi
- Organizer
  APSIPA ASC 2011
- Place of Presentation
  Xi'an, China
- Year and Date
  2011-10-19
- Related Report
  2013 Final Research Report
[Presentation] On the use of extended context for HMM-based spontaneous conversational speech synthesis2011
- Author(s)
  Tomoki Koriyama, Takashi Nose, Takao Kobayashi
- Organizer
  INTERSPEECH 2011
- Place of Presentation
  Florence, Italy
- Year and Date
  2011-08-30
- Related Report
  2013 Final Research Report
[Presentation] Performance prediction of speech recognition using average-voice-based speech synthesis2011
- Author(s)
  Tatsuhiko Saito, Takashi Nose, Takao Kobayashi, Yohei Okato, Akio Horii
- Organizer
  INTERSPEECH 2011
- Place of Presentation
  Florence, Italy
- Year and Date
  2011-08-29
- Related Report
  2013 Final Research Report
[Presentation] HMM-based emphatic speech synthesis using unsupervised context labeling2011
- Author(s)
  Yu Maeno, Takashi Nose, Takao Kobayashi, Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka
- Organizer
  INTERSPEECH 2011
- Place of Presentation
  Florence, Italy
- Year and Date
  2011-08-29
- Related Report
  2013 Final Research Report
[Presentation] A perceptual expressivity modeling technique for speech synthesis based on multiple-regression HSMM2011
- Author(s)
  Takashi Nose, Takao Kobayashi
- Organizer
  INTERSPEECH 2011
- Place of Presentation
  Florence, Italy
- Year and Date
  2011-08-28
- Related Report
  2013 Final Research Report
[Presentation] Very low bit-rate F0 coding for phonetic vocoder using MSD-HMM with quantized F0 context2011
- Author(s)
  Takashi Nose, Takao Kobayashi
- Organizer
  2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, CASSP 2011
- Place of Presentation
  Prague, Czech Republic
- Related Report
  2011 Research-status Report
[Presentation] Tonal context labeling using quantized F0 symbols for improving tone correctness in average-voice-based speech synthesis2011
- Author(s)
  Vataya Chunwijitra, Takashi Nose, Takao Kobayashi
- Organizer
  2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, CASSP 2011
- Place of Presentation
  Prague, Czech Republic
- Related Report
  2011 Research-status Report
[Presentation] On the use of extended context for HMM-based spontaneous conversational speech synthesis2011
- Author(s)
  Tomoki Koriyama, Takashi Nose, Takao Kobayashi
- Organizer
  12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011
- Place of Presentation
  Florence, Italy
- Related Report
  2011 Research-status Report
[Presentation] Performance prediction of speech recognition using average-voice-based speech synthesis2011
- Author(s)
  Tatsuhiko Saito, Takashi Nose, Takao Kobayashi, Yohei Okato, Akio Horii
- Organizer
  12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011
- Place of Presentation
  Florence, Italy
- Related Report
  2011 Research-status Report
[Presentation] HMM-based emphatic speech synthesis using unsupervised context labeling2011
- Author(s)
  Yu Maeno, Takashi Nose, Takao Kobayashi, Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka
- Organizer
  12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011
- Place of Presentation
  Florence, Italy
- Related Report
  2011 Research-status Report
[Presentation] A perceptual expressivity modeling technique for speech synthesis based on multiple-regression HSMM2011
- Author(s)
  Takashi Nose, Takao Kobayashi
- Organizer
  12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011
- Place of Presentation
  Florence, Italy
- Related Report
  2011 Research-status Report
[Presentation] 多様な音声合成のための強調コンテキストの自動付与の検討2011
- Author(s)
  前野悠, 能勢隆, 小林隆夫, 井島勇祐, 中嶋秀治, 水野秀之, 吉岡理
- Organizer
  日本音響学会2011年秋季研究発表会
- Place of Presentation
  島根大学
- Related Report
  2011 Research-status Report
[Presentation] 対話音声合成のためのイントネーションラベルのタイミング予測2011
- Author(s)
  郡山知樹, 能勢隆, 小林隆夫
- Organizer
  日本音響学会2011年秋季研究発表会
- Place of Presentation
  島根大学
- Related Report
  2011 Research-status Report
[Presentation] 感情音声合成における主観的表出度合のモデル化と制御の検討2011
- Author(s)
  能勢隆, 小林隆夫
- Organizer
  日本音響学会2011年秋季研究発表会
- Place of Presentation
  島根大学
- Related Report
  2011 Research-status Report
[Presentation] 音声合成を用いた音声認識性能予測－残響と騒音が存在する環境での評価－2011
- Author(s)
  太刀岡勇気, 堀井昭男, 岩崎知弘, 斉藤辰彦, 能勢隆, 小林隆夫
- Organizer
  日本音響学会2011年秋季研究発表会
- Place of Presentation
  島根大学
- Related Report
  2011 Research-status Report
[Presentation] 日本語話し言葉コーパスを用いた対話音声合成のためのコンテキストの評価2011
- Author(s)
  郡山知樹, 能勢隆, 小林隆夫
- Organizer
  音声研究会
- Place of Presentation
  立命館大学
- Related Report
  2011 Research-status Report
[Presentation] HMM音声合成のための動的特徴量を用いた音素継続長モデリングの検討2011
- Author(s)
  能勢隆, 小林隆夫
- Organizer
  音声言語シンポジウム
- Place of Presentation
  芝浦工業大学
- Related Report
  2011 Research-status Report
[Presentation] HMM音声合成における不特定話者スタイル変換の検討2011
- Author(s)
  金川裕紀, 能勢隆, 小林隆夫
- Organizer
  音声言語シンポジウム
- Place of Presentation
  芝浦工業大学
- Related Report
  2011 Research-status Report
[Presentation] 韻律イベントHMMを用いた対話音声F0生成2011
- Author(s)
  郡山知樹, 能勢隆, 小林隆夫
- Organizer
  音声言語シンポジウム
- Place of Presentation
  芝浦工業大学
- Related Report
  2011 Research-status Report
[Presentation] パラ言語情報を表現可能な対話音声合成のための重回帰HSMMの検討2011
- Author(s)
  永田智洋, 森大毅, 能勢隆
- Organizer
  音声言語シンポジウム
- Place of Presentation
  芝浦工業大学
- Related Report
  2011 Research-status Report
[Presentation] Speaker-independent style conversion for HMM-based expressive speech synthesis
- Author(s)
  Hiroki Kanagawa
- Organizer
  2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013
- Place of Presentation
  Vancouver, Canada
- Related Report
  2013 Annual Research Report
[Presentation] HMM-based expressive speech synthesis based on phrase-level F0 context labeling
- Author(s)
  Yu Maeno
- Organizer
  2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013
- Place of Presentation
  Vancouver, Canada
- Related Report
  2013 Annual Research Report
[Presentation] A style control technique for singing voice synthesis based on multiple-regression HSMM
- Author(s)
  Takashi Nose
- Organizer
  14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013
- Place of Presentation
  Lyon, France
- Related Report
  2013 Annual Research Report
[Presentation] 複数ドメインコーパスからの文選択に基づくキャラクター音声合成の検討
- Author(s)
  荒生侑介
- Organizer
  日本音響学会2013年秋季研究発表会
- Place of Presentation
  豊橋技術科学大学
- Related Report
  2013 Annual Research Report
[Presentation] 共有決定木を利用した話者適応に基づくクロスリンガル音声合成の評価
- Author(s)
  長濱大樹
- Organizer
  日本音響学会2014年春季研究発表会
- Place of Presentation
  日本大学
- Related Report
  2013 Annual Research Report
[Presentation] 音声合成のための音韻・韻律コンテキストを考慮した文選択アルゴリズムの評価
- Author(s)
  荒生侑介
- Organizer
  日本音響学会2014年春季研究発表会
- Place of Presentation
  日本大学
- Related Report
  2013 Annual Research Report
[Presentation] Discontinuous observation HMM for prosodic-event-based F0 generation
- Author(s)
  Tomoki Koriyama, Takashi Nose, Takao Kobayashi
- Organizer
  13th Annual Conference of the International Speech Communication Association, INTERSPEECH 2012
- Place of Presentation
  Portland, USA
- Related Report
  2012 Research-status Report
[Presentation] A speech parameter generation algorithm using local variance for HMM-based speech synthesis
- Author(s)
  Vataya Chunwijitra, Takashi Nose, Takao Kobayashi
- Organizer
  13th Annual Conference of the International Speech Communication Association, INTERSPEECH 2012
- Place of Presentation
  Portland, USA
- Related Report
  2012 Research-status Report
[Presentation] HMM音声合成のための局所的系列内変動を考慮したパラメータ生成の検討
- Author(s)
  能勢隆, ワータヤー・チュンウィジター, 小林隆夫
- Organizer
  日本音響学会2012年秋季研究発表会
- Place of Presentation
  信州大学
- Related Report
  2012 Research-status Report
[Presentation] 共有決定木を利用した話者適応に基づくクロスリンガル音声合成の検討
- Author(s)
  能勢隆, 小林隆夫
- Organizer
  日本音響学会2012年秋季研究発表会
- Place of Presentation
  信州大学
- Related Report
  2012 Research-status Report
[Presentation] HMM音声合成における不特定話者スタイル変換のための話者正規化学習法の検討
- Author(s)
  金川裕紀, 能勢隆, 小林隆夫
- Organizer
  日本音響学会2012年秋季研究発表会
- Place of Presentation
  信州大学
- Related Report
  2012 Research-status Report
[Presentation] HMM音声合成におけるスペクトル特徴量の局所変動のモデル化とパラメータ生成への適用
- Author(s)
  能勢隆, ワータヤー・チュンウィジター, 小林隆夫
- Organizer
  音声研究会
- Place of Presentation
  東北工業大学
- Related Report
  2012 Research-status Report
[Presentation] 統計モデルに基づく音声合成における話者・スタイルの多様化
- Author(s)
  能勢隆
- Organizer
  音声研究会
- Place of Presentation
  同志社大学
- Related Report
  2012 Research-status Report
- Invited
[Presentation] 任意話者の多様なスタイル生成のための話者正規化スタイル変換法の検討
- Author(s)
  金川裕紀, 能勢隆, 小林隆夫
- Organizer
  音声研究会
- Place of Presentation
  同志社大学
- Related Report
  2012 Research-status Report
[Presentation] 多様な歌声合成のための重回帰HSMMに基づくスタイル制御法の検討
- Author(s)
  能勢隆, 金本美沙, 郡山知樹, 小林隆夫
- Organizer
  音声研究会
- Place of Presentation
  同志社大学
- Related Report
  2012 Research-status Report
[Presentation] 音声合成のためのガウス過程回帰を用いたフレームレベル音響モデリングの検討
- Author(s)
  郡山知樹, 能勢隆, 小林隆夫
- Organizer
  日本音響学会2013年春季研究発表会
- Place of Presentation
  東京工科大学
- Related Report
  2012 Research-status Report
[Presentation] HMM音声合成における話者正規化学習を用いたスタイル変換法の評価
- Author(s)
  金川裕紀, 能勢隆, 小林隆夫
- Organizer
  日本音響学会2013年春季研究発表会
- Place of Presentation
  東京工科大学
- Related Report
  2012 Research-status Report
[Presentation] 対話音声合成のための音韻・韻律コンテキストを考慮した音声コーパス構築法の検討
- Author(s)
  荒生侑介, 能勢隆, 小林隆夫
- Organizer
  日本音響学会2013年春季研究発表会
- Place of Presentation
  東京工科大学
- Related Report
  2012 Research-status Report

A study on speech diversification techniques based on corpus design for advanced humanoid speech synthesis

Principal Investigator

NOSE Takashi 東北大学, 工学(系)研究科(研究院), 講師 (90550591)

¥4,290,000 (Direct Cost: ¥3,300,000、Indirect Cost: ¥990,000)

Report

Research Products

[Journal Article] Prosodic variation enhancement using unsupervised context labeling for HMM-based expressive speech synthesis2014

Author(s)

Journal Title

DOI

Related Report

[Journal Article] 共有決定木を利用した話者適応に基づくクロスリンガル音声合成の評価2014

Author(s)

Journal Title

Related Report

[Journal Article] 音声合成のための音韻・韻律コンテキストを考慮した文選択アルゴリズムの評価2014

Author(s)

Journal Title

Related Report

[Journal Article] Robust estimation of multiple-regression HMM parameters for dimension-based expressive dialogue speech synthesis2013

Author(s)

Journal Title

Related Report

[Journal Article] Statistical nonparametric speech synthesis using sparse Gaussian processes2013

Author(s)

Journal Title

NAID

Related Report

[Journal Article] A style control technique for singing voice synthesis based on multiple-regression HSMM2013

Author(s)

Journal Title

Related Report

[Journal Article] Frame-level acoustic modeling based on Gaussian process regression for statistical nonparametric speech synthesis2013

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Speaker-independent style conversion for HMM-based expressive speech synthesis2013

Author(s)

Journal Title

Related Report

[Journal Article] HMM-based expressive speech synthesis based on phrase-level F0 context labeling2013

Author(s)

Journal Title

Related Report

[Journal Article] An intuitive style control technique in HMM-based expressive speech synthesis using subjective style intensity and multiple-regression global variance model2013

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Speaker-independent style conversion for HMM-based expressive speech synthesis2013

Author(s)

Journal Title

Related Report

[Journal Article] HMM-based expressive speech synthesis based on phrase-level F0 context labeling2013

Author(s)

Journal Title

Related Report

[Journal Article] A style control technique for singing voice synthesis based on multiple-regression HSMM2013

Author(s)

Journal Title

Related Report

[Journal Article] 複数ドメインコーパスからの文選択に基づくキャラクター音声合成の検討2013

Author(s)

Journal Title

Related Report

[Journal Article] 統計モデルに基づく音声合成における話者・スタイルの多様化2013

Author(s)

Journal Title

Related Report

[Journal Article] 任意話者の多様なスタイル生成のための話者正規化スタイル変換法の検討2013

Author(s)

Journal Title

Related Report

[Journal Article] 多様な歌声合成のための重回帰HSMMに基づくスタイル制御法の検討2013

Author(s)

Journal Title

Related Report

[Journal Article] 音声合成のためのガウス過程回帰を用いたフレームレベル音響モデリングの検討2013

Author(s)