Research on advanced robust speech synthesis and its applications to multi-lingual speech communication

Research Project

Project/Area Number	24300071
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Partial Multi-year Fund
Section	一般
Research Field	Perception information processing/Intelligent robotics
Research Institution	Tokyo Institute of Technology
Principal Investigator	KOBAYASHI Takao 東京工業大学, 総合理工学研究科(研究院), 教授 (70153616)
Co-Investigator(Kenkyū-buntansha)	NOSE Takashi 東北大学, 大学院工学研究科, 講師 (90550591)
Research Collaborator	KORIYAMA Tomoki 東京工業大学, 大学院総合理工学研究科, 助教 (50749124) ARIFIANTO Dhany スラバヤ工科大学, 工学物理学科, 講師
Project Period (FY)	2012-04-01 – 2015-03-31
Project Status	Completed (Fiscal Year 2014)
Budget Amount *help	¥14,300,000 (Direct Cost: ¥11,000,000、Indirect Cost: ¥3,300,000) Fiscal Year 2014: ¥4,290,000 (Direct Cost: ¥3,300,000、Indirect Cost: ¥990,000) Fiscal Year 2013: ¥4,680,000 (Direct Cost: ¥3,600,000、Indirect Cost: ¥1,080,000) Fiscal Year 2012: ¥5,330,000 (Direct Cost: ¥4,100,000、Indirect Cost: ¥1,230,000)
Keywords	テキスト音声合成 / 統計的パラメトリック音声合成 / HMM音声合成 / 表現豊かな音声合成 / 韻律 / クロスリンガル音声合成 / 音声スタイル制御 / 基本周波数正規化学習 / 韻律ラベリング / 国際情報交換（インドネシア） / 自然発話音声 / ガウス過程回帰 / トーン（声調） / 共有決定木 / 話者正規化学習 / 韻律イベント
Outline of Final Research Achievements	The purpose of the research is to develop advanced techniques that enable us to model acoustic features of prosodic information as well as spectral information with being less dependent on quality and quantity of training speech data for synthesizing natural-sounding and diverse expressive speech. We have proposed several robust techniques such as style control and prosody modeling ones and showed their effectiveness through objective and subjective evaluation tests. We have also applied the proposed techniques to under-resourced languages. Furthermore, we examined a cross-lingual speech synthesis technique for universal speech communication.

Report

(4 results)

2014 Annual Research Report Final Research Report ( PDF )
2013 Annual Research Report
2012 Annual Research Report

Research Products
(81 results)

All 2015 2014 2013 2012

All Journal Article (42 results) (of which Peer Reviewed: 16 results, Acknowledgement Compliant: 13 results, Open Access: 3 results) Presentation (39 results) (of which Invited: 1 results)

[Journal Article] ガウス過程回帰に基づく音声合成システムの検討2015
- Author(s)
  郡山知樹, 小林隆夫
- Journal Title
  
  日本音響学会2015年春季研究発表会講演論文集
  
  Volume: CD-ROM Pages: 269-270
- NAID
  120006703848
- Related Report
  2014 Annual Research Report
- Acknowledgement Compliant
[Journal Article] 言語モデルと音響モデルを用いた自動韻律ラベリングの評価2015
- Author(s)
  増子理菜, 郡山知樹, 小林隆夫
- Journal Title
  
  日本音響学会2015年春季研究発表会講演論文集
  
  Volume: CD-ROM Pages: 361-362
- Related Report
  2014 Annual Research Report
- Acknowledgement Compliant
[Journal Article] ガウス過程回帰に基づく音声合成のためのコンテキストの検討2015
- Author(s)
  岡元伶洋, 郡山知樹, 小林隆夫
- Journal Title
  
  日本音響学会2015年春季研究発表会講演論文集
  
  Volume: CD-ROM Pages: 371-372
- Related Report
  2014 Annual Research Report
- Acknowledgement Compliant
[Journal Article] Prosody generation using frame-based Gaussian process regression and classification for statistical parametric speech synthesis2015
- Author(s)
  Tomoki Koriyama, Takao Kobayashi
- Journal Title
  
  Proceedings of 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing
  
  Volume: ICASSP 2015 Pages: 4929-4933
- NAID
  120006703851
- Related Report
  2014 Annual Research Report
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] Statistical Parametric Speech Synthesis Based on Gaussian Process Regression2014
- Author(s)
  Tomoki Koriyama, Takashi Nose, Takao Kobayashi
- Journal Title
  
  IEEE Journal of Selected Topics in Signal Processing
  
  Volume: 8 Issue: 2 Pages: 173-183
- DOI
  10.1109/jstsp.2013.2283461
- Related Report
  2014 Annual Research Report
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] A parameter generation algorithm using local variance for HMM-based speech synthesis2014
- Author(s)
  Takashi Nose, Vataya Chunwijitra, Takao Kobayashi
- Journal Title
  
  IEEE Journal of Selected Topics in Signal Processing
  
  Volume: 8 Issue: 2 Pages: 221-228
- DOI
  10.1109/jstsp.2013.2283459
- Related Report
  2014 Annual Research Report
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] Parametric speech synthesis based on Gaussian process regression using global variance and hyperparameter optimization2014
- Author(s)
  Tomoki Koriyama, Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proceedings of 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing
  
  Volume: ICASSP 2014 Pages: 3862-3866
- DOI
  10.1109/icassp.2014.6854319
- NAID
  120006703288
- Related Report
  2014 Annual Research Report
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] Tone modeling using stress information for HMM-based Thai speech synthesis2014
- Author(s)
  Decha Moungsri, Tomoki Koriyama, Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proceedings of the 7th International Conference on Speech Prosody
  
  Volume: SPEECHPROSODY 7 Pages: 1057-1061
- Related Report
  2014 Annual Research Report
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] Transform mapping using shared decision tree context clustering for HMM-based cross-lingual speech synthesis2014
- Author(s)
  Daiki Nagahama, Takashi Nose, Tomoki Koriyama, Takao Kobayashi
- Journal Title
  
  Proceedings of the 15th Annual Conference of the International Speech Communication Association
  
  Volume: INTERSPEECH 2014 Pages: 770-774
- Related Report
  2014 Annual Research Report
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Accent type and phrase boundary estimation using acoustic and language models for automatic prosodic labeling2014
- Author(s)
  Tomoki Koriyama, Hiroshi Suzuki, Takashi Nose, Takahiro Shinozaki, Takao Kobayashi
- Journal Title
  
  Proceedings of the 15th Annual Conference of the International Speech Communication Association
  
  Volume: INTERSPEECH 2014 Pages: 2337-2341
- NAID
  120006703358
- Related Report
  2014 Annual Research Report
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Parametric speech synthesis using local and global sparse Gaussian processes2014
- Author(s)
  Tomoki Koriyama, Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proceedings of IEEE International Workshop on Machine Learning for Signal Processing
  
  Volume: MLSP 2014 Pages: 1-6
- DOI
  10.1109/mlsp.2014.6958921
- NAID
  120006703336
- Related Report
  2014 Annual Research Report
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] HMM-based Thai speech synthesis using unsupervised stress context labeling2014
- Author(s)
  Decha Moungsri, Tomoki Koriyama, Takao Kobayashi
- Journal Title
  
  Proceedings of 2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference
  
  Volume: APSIPA ASC 2014 Pages: 1-4
- DOI
  10.1109/apsipa.2014.7041599
- Related Report
  2014 Annual Research Report
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] ガウス過程回帰に基づくF0パタン生成の検討2014
- Author(s)
  郡山知樹, 能勢隆, 小林隆夫
- Journal Title
  
  日本音響学会2014年秋季研究発表会講演論文集
  
  Volume: CD-ROM Pages: 247-248
- NAID
  120006703360
- Related Report
  2014 Annual Research Report
- Acknowledgement Compliant
[Journal Article] ガウス過程回帰に基づく音声合成におけるハイパーパラメータ最適化の検討2014
- Author(s)
  郡山知樹, 能勢隆, 小林隆夫
- Journal Title
  
  電子情報通信学会技術研究報告　音声
  
  Volume: 113, SP2013-99 Pages: 19-24
- Related Report
  2013 Annual Research Report
[Journal Article] 音声合成のための音韻・韻律コンテキストを考慮した文選択アルゴリズムの評価2014
- Author(s)
  荒生侑介, 能勢隆, 郡山知樹, 篠崎隆宏, 小林隆夫
- Journal Title
  
  日本音響学会2014年春季研究発表会講演論文集
  
  Volume: CD-ROM Pages: 405-406
- Related Report
  2013 Annual Research Report
[Journal Article] HMM音声合成のための音節出現頻度にロバストな音素セットの検討2014
- Author(s)
  舘野英樹, 能勢隆, 郡山知樹, 篠崎隆宏, 小林隆夫
- Journal Title
  
  日本音響学会2014年春季研究発表会講演論文集
  
  Volume: CD-ROM Pages: 409-410
- Related Report
  2013 Annual Research Report
[Journal Article] HMM音声合成における正規化学習を用いたアクセント誤り削減の検討2014
- Author(s)
  大西浩之, 能勢隆, 郡山知樹, 小林隆夫
- Journal Title
  
  日本音響学会2014年春季研究発表会講演論文集
  
  Volume: CD-ROM Pages: 411-412
- Related Report
  2013 Annual Research Report
[Journal Article] 共有決定木を利用した話者適応に基づくクロスリンガル音声合成の評価2014
- Author(s)
  長濱大樹, 能勢隆, 郡山知樹, 小林隆夫
- Journal Title
  
  日本音響学会2014年春季研究発表会講演論文集
  
  Volume: CD-ROM Pages: 413-414
- Related Report
  2013 Annual Research Report
[Journal Article] 音響モデルと言語モデルを利用したアクセント型・アクセント句境界の同時推定2014
- Author(s)
  鈴木啓史, 郡山知樹, 能勢隆, 篠崎隆宏, 小林隆夫
- Journal Title
  
  日本音響学会2014年春季研究発表会講演論文集
  
  Volume: CD-ROM Pages: 441-442
- Related Report
  2013 Annual Research Report
[Journal Article] 系列内変動を考慮したガウス過程回帰に基づく音声パラメータ生成2014
- Author(s)
  郡山知樹, 能勢隆, 小林隆夫
- Journal Title
  
  日本音響学会2014年春季研究発表会講演論文集
  
  Volume: CD-ROM Pages: 355-356
- NAID
  120006702995
- Related Report
  2013 Annual Research Report
[Journal Article] Frame-level acoustic modeling based on Gaussian process regression for statistical nonparametric speech synthesis2013
- Author(s)
  Tomoki Koriyama, Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proceedings of 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013
  
  Volume: ICASSP 2013 Pages: 8007-8011
- DOI
  10.1109/icassp.2013.6639224
- NAID
  120006702668
- Related Report
  2013 Annual Research Report
- Peer Reviewed
[Journal Article] Speaker-independent style conversion for HMM-based expressive speech synthesis2013
- Author(s)
  Hiroki Kanagawa, Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proceedings of 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013
  
  Volume: ICASSP 2013 Pages: 7864-7868
- DOI
  10.1109/icassp.2013.6639195
- Related Report
  2013 Annual Research Report
- Peer Reviewed
[Journal Article] Statistical nonparametric speech synthesis using sparse Gaussian processes2013
- Author(s)
  Tomoki Koriyama, Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proceedings of the 14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013
  
  Volume: INTERSPEECH 2013 Pages: 1072-1076
- NAID
  120006702716
- Related Report
  2013 Annual Research Report
- Peer Reviewed
[Journal Article] A style control technique for singing voice synthesis based on multiple-regression HSMM2013
- Author(s)
  Takashi Nose, Misa Kanemoto, Tomoki Koriyama, Takao Kobayashi
- Journal Title
  
  Proceedings of the 14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013
  
  Volume: INTERSPEECH 2013 Pages: 378-382
- Related Report
  2013 Annual Research Report
- Peer Reviewed
[Journal Article] 複数ドメインコーパスからの文選択に基づくキャラクター音声合成の検討2013
- Author(s)
  荒生侑介, 能勢隆, 篠崎隆宏, 小林隆夫
- Journal Title
  
  日本音響学会2013年秋季研究発表会講演論文集
  
  Volume: CD-ROM Pages: 351-352
- Related Report
  2013 Annual Research Report
[Journal Article] GMMに基づく声質変換のためのMDL基準による混合数の自動決定2013
- Author(s)
  小林友哉, 能勢隆, 篠崎隆宏, 小林隆夫
- Journal Title
  
  日本音響学会2013年秋季研究発表会講演論文集
  
  Volume: CD-ROM Pages: 341-342
- Related Report
  2013 Annual Research Report
[Journal Article] スパース近似と畳み込みカーネルを用いたガウス過程回帰に基づく音声合成2013
- Author(s)
  郡山知樹, 能勢隆, 小林隆夫
- Journal Title
  
  日本音響学会2013年秋季研究発表会講演論文集
  
  Volume: CD-ROM Pages: 311-312
- NAID
  120006702748
- Related Report
  2013 Annual Research Report
[Journal Article] 言語モデルと音響モデルを利用したアクセント境界の自動推定2013
- Author(s)
  鈴木啓史, 郡山知樹, 能勢隆, 篠崎隆宏, 小林隆夫
- Journal Title
  
  電子情報通信学会技術研究報告　音声
  
  Volume: 113, SP2013-89 Pages: 97-102
- Related Report
  2013 Annual Research Report
[Journal Article] 多様な音声合成に向けた取組みと課題2013
- Author(s)
  小林隆夫
- Journal Title
  
  電子情報通信学会技術研究報告　音声
  
  Volume: 113, SP2013-93 Pages: 119-122
- Related Report
  2013 Annual Research Report
[Journal Article] An intuitive style control technique in HMM-based expressive speech synthesis using subjective style intensity and multiple-regression global variance model2013
- Author(s)
  Takashi Nose, Takao Kobayashi
- Journal Title
  
  Speech Communication
  
  Volume: Vol.55, No.2 Issue: 2 Pages: 347-357
- DOI
  10.1016/j.specom.2012.09.003
- Related Report
  2012 Annual Research Report
- Peer Reviewed
[Journal Article] 統計モデルに基づく音声合成における話者・スタイルの多様化2013
- Author(s)
  能勢隆
- Journal Title
  
  電子情報通信学会技術研究報告
  
  Volume: 112 SP2012-109(招待講演) Pages: 67-72
- Related Report
  2012 Annual Research Report
[Journal Article] 任意話者の多様なスタイル生成のための話者正規化スタイル変換法の検討2013
- Author(s)
  金川裕紀
- Journal Title
  
  電子情報通信学会技術研究報告
  
  Volume: 112 SP2012-110 Pages: 79-84
- Related Report
  2012 Annual Research Report
[Journal Article] 多様な歌声合成のための重回帰HSMMに基づくスタイル制御法の検討2013
- Author(s)
  能勢隆
- Journal Title
  
  電子情報通信学会技術研究報告
  
  Volume: CD-ROM Pages: 271-272
- Related Report
  2012 Annual Research Report
[Journal Article] 音声合成のためのガウス過程回帰を用いたフレームレベル音響モデリングの検2013
- Author(s)
  郡山知樹
- Journal Title
  
  日本音響学会2013年春季研究発表会講演論文集
  
  Volume: CD-ROM Pages: 271-272
- Related Report
  2012 Annual Research Report
[Journal Article] HMM音声合成における話者正規化学習を用いたスタイル変換法の評価2013
- Author(s)
  金川裕紀
- Journal Title
  
  日本音響学会2013年春季研究発表講演論文集
  
  Volume: CD-ROM Pages: 295-296
- Related Report
  2012 Annual Research Report
[Journal Article] 対話音声合成のための音韻・韻律コンテキストを考慮した音声コーパス構築法の検討2013
- Author(s)
  荒生侑介
- Journal Title
  
  日本音響学会2013年春季研究発表講演論文集
  
  Volume: CD-ROM Pages: 499-500
- Related Report
  2012 Annual Research Report
[Journal Article] Discontinuous observation HMM for prosodic-event-based FO generation2012
- Author(s)
  Tomoki Koriyama
- Journal Title
  
  Proceedings of the 13th Annual Conference of the International Speech Communication Association, INTERSPEECH 2012
  
  Volume: (CD-ROM)
- Related Report
  2012 Annual Research Report
- Peer Reviewed
[Journal Article] A speech parameter generation algorithm using local variance for HMM-based speech synthesis2012
- Author(s)
  Vataya Chunwijitra
- Journal Title
  
  Proceedings of the 13th Annual Conference of the International Speech Communication Association, INTERSPEECH 2012
  
  Volume: (CD-ROM)
- Related Report
  2012 Annual Research Report
- Peer Reviewed
[Journal Article] HMM音声合成のための局所的系列内変動を考慮したパラメータ生成の検討2012
- Author(s)
  能勢隆
- Journal Title
  
  日本音響学会2012年秋季研究発表会講演論文集
  
  Volume: (CD-ROM) Pages: 277-278
- Related Report
  2012 Annual Research Report
[Journal Article] 共有決定木を利用した話者適応に基づくクロスリンガル音声合成の検討2012
- Author(s)
  能勢隆
- Journal Title
  
  日本音響学会2012年秋季研究発表会講演論文集
  
  Volume: (CD-ROM) Pages: 279-280
- Related Report
  2012 Annual Research Report
[Journal Article] HMM音声合成における不特定話者スタイル変換のための話者正規化学習法の2012
- Author(s)
  金川裕紀
- Journal Title
  
  日本音響学会2012年秋季研究発表会講演論文集
  
  Volume: (CD-ROM) Pages: 431-432
- Related Report
  2012 Annual Research Report
[Journal Article] HMM音声合成におけるスペクトル特微量の局所変動のモデル化とパラメータ2012
- Author(s)
  能勢隆
- Journal Title
  
  電子情報通信学会技術研究報告
  
  Volume: 112 SP2012-79 Pages: 43-48
- Related Report
  2012 Annual Research Report
[Presentation] Prosody generation using frame-based Gaussian process regression and classification for statistical parametric speech synthesis2015
- Author(s)
  Tomoki Koriyama
- Organizer
  2015 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015
- Place of Presentation
  Brisbane Convention & Exhibition Centre（オーストラリア）
- Year and Date
  2015-04-19 – 2015-04-24
- Related Report
  2014 Annual Research Report
[Presentation] 言語モデルと音響モデルを用いた自動韻律ラベリングの評価2015
- Author(s)
  増子理菜
- Organizer
  日本音響学会2015年春季研究発表会
- Place of Presentation
  中央大学後楽園キャンパス（東京）
- Year and Date
  2015-03-16 – 2015-03-18
- Related Report
  2014 Annual Research Report
[Presentation] ガウス過程回帰に基づく音声合成システムの検討2015
- Author(s)
  郡山知樹
- Organizer
  日本音響学会2015年春季研究発表会
- Place of Presentation
  中央大学後楽園キャンパス（東京）
- Year and Date
  2015-03-16 – 2015-03-18
- Related Report
  2014 Annual Research Report
[Presentation] ガウス過程回帰に基づく音声合成のためのコンテキストの検討2015
- Author(s)
  岡元伶洋
- Organizer
  日本音響学会2015年春季研究発表会
- Place of Presentation
  中央大学後楽園キャンパス（東京）
- Year and Date
  2015-03-16 – 2015-03-18
- Related Report
  2014 Annual Research Report
[Presentation] HMM-based Thai speech synthesis using unsupervised stress context labeling2014
- Author(s)
  Decha Moungsri
- Organizer
  2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2014
- Place of Presentation
  Sokha Angkor Resort（カンボジア）
- Year and Date
  2014-12-09 – 2014-12-12
- Related Report
  2014 Annual Research Report
[Presentation] Parametric speech synthesis using local and global sparse Gaussian processes2014
- Author(s)
  Tomoki Koriyama
- Organizer
  International Workshop on Machine Learning for Signal Processing, MLSP2014
- Place of Presentation
  Reims Centre De Congres（フランス）
- Year and Date
  2014-09-21 – 2014-09-24
- Related Report
  2014 Annual Research Report
[Presentation] Transform mapping using shared decision tree context clustering for HMM-based cross-lingual speech synthesis2014
- Author(s)
  Daiki Nagahama
- Organizer
  The 15th Annual Conference of the International Speech Communication Association, INTERSPEECH 2014
- Place of Presentation
  Singapore Expo（シンガポール）
- Year and Date
  2014-09-14 – 2014-09-18
- Related Report
  2014 Annual Research Report
[Presentation] Accent type and phrase boundary estimation using acoustic and language models for automatic prosodic labeling2014
- Author(s)
  Tomoki Koriyama
- Organizer
  The 15th Annual Conference of the International Speech Communication Association, INTERSPEECH 2014
- Place of Presentation
  Singapore Expo（シンガポール）
- Year and Date
  2014-09-14 – 2014-09-18
- Related Report
  2014 Annual Research Report
[Presentation] ガウス過程回帰に基づくF0パタン生成の検討2014
- Author(s)
  郡山知樹
- Organizer
  日本音響学会2014年秋季研究発表会
- Place of Presentation
  北海学園大学豊平キャンパス
- Year and Date
  2014-09-03 – 2014-09-05
- Related Report
  2014 Annual Research Report
[Presentation] Tone modeling using stress information for HMM-based Thai speech synthesis2014
- Author(s)
  Decha Moungsri
- Organizer
  The 7th International Conference on Speech Prosody, SPEECHPROSODY 7
- Place of Presentation
  トリニティカレッジ（アイルランド）
- Year and Date
  2014-05-20 – 2014-05-23
- Related Report
  2014 Annual Research Report
[Presentation] Parametric speech synthesis based on Gaussian process regression using global variance and hyperparameter optimization2014
- Author(s)
  Tomoki Koriyama
- Organizer
  2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014
- Place of Presentation
  "Fortezza Da Basso” Convention & Exhibition Centre （イタリア）
- Year and Date
  2014-05-04 – 2014-05-09
- Related Report
  2014 Annual Research Report
[Presentation] ガウス過程回帰に基づく音声合成におけるハイパーパラメータ最適化の検討2014
- Author(s)
  郡山　知樹
- Organizer
  電子情報通信学会・日本音響学会　音声研究会
- Place of Presentation
  名城大学天白キャンパス（愛知）
- Related Report
  2013 Annual Research Report
[Presentation] 音声合成のための音韻・韻律コンテキストを考慮した文選択アルゴリズムの評価2014
- Author(s)
  荒生　侑介
- Organizer
  日本音響学会2014年春季研究発表会
- Place of Presentation
  日本大学理工学部駿河台キャンパス（東京）
- Related Report
  2013 Annual Research Report
[Presentation] HMM音声合成のための音節出現頻度にロバストな音素セットの検討2014
- Author(s)
  舘野　英樹
- Organizer
  日本音響学会2014年春季研究発表会
- Place of Presentation
  日本大学理工学部駿河台キャンパス（東京）
- Related Report
  2013 Annual Research Report
[Presentation] HMM音声合成における正規化学習を用いたアクセント誤り削減の検討2014
- Author(s)
  大西　浩之
- Organizer
  日本音響学会2014年春季研究発表会
- Place of Presentation
  日本大学理工学部駿河台キャンパス（東京）
- Related Report
  2013 Annual Research Report
[Presentation] 共有決定木を利用した話者適応に基づくクロスリンガル音声合成の評価2014
- Author(s)
  長濱　大樹
- Organizer
  日本音響学会2014年春季研究発表会
- Place of Presentation
  日本大学理工学部駿河台キャンパス（東京）
- Related Report
  2013 Annual Research Report
[Presentation] 音響モデルと言語モデルを利用したアクセント型・アクセント句境界の同時推定2014
- Author(s)
  郡山　知樹
- Organizer
  日本音響学会2014年春季研究発表会
- Place of Presentation
  日本大学理工学部駿河台キャンパス（東京）
- Related Report
  2013 Annual Research Report
[Presentation] 系列内変動を考慮したガウス過程回帰に基づく音声パラメータ生成2014
- Author(s)
  郡山　知樹
- Organizer
  日本音響学会2014年春季研究発表会
- Place of Presentation
  日本大学理工学部駿河台キャンパス（東京）
- Related Report
  2013 Annual Research Report
[Presentation] 対話音声合成のための音韻・韻律コンテキストを考慮した音声コーパス構築法の検討2013
- Author(s)
  荒生侑介
- Organizer
  日本音響学会2013年春季研究発表会
- Place of Presentation
  東京工科大学, 東京都八王子市
- Year and Date
  2013-03-15
- Related Report
  2012 Annual Research Report
[Presentation] 音声合成のためのガウス過程回帰を用いたフレームレベル音響モデリングの検討2013
- Author(s)
  郡山知樹
- Organizer
  日本音響学会2013年春季研究発表会
- Place of Presentation
  東京工科大学, 東京都八王子市
- Year and Date
  2013-03-13
- Related Report
  2012 Annual Research Report
[Presentation] HMM音声合成における話者正規化学習を用いたスタイル変換法の評価2013
- Author(s)
  金川裕紀
- Organizer
  日本音響学会2013年春季研究発表会
- Place of Presentation
  東京工科大学, 東京都八王子市
- Year and Date
  2013-03-13
- Related Report
  2012 Annual Research Report
[Presentation] 統計モデルに基づく音声合成における話者・スタイルの多様化2013
- Author(s)
  能勢隆
- Organizer
  2013年1月度音声研究会
- Place of Presentation
  同志社大学, 京都府京田辺市(招待講演)
- Year and Date
  2013-01-31
- Related Report
  2012 Annual Research Report
[Presentation] 任意話者の多様なスタイル生成のための話者正規化2013
- Author(s)
  金川裕紀
- Organizer
  2013年1月度音声研究会
- Place of Presentation
  同志社大学, 京都府京田辺市
- Year and Date
  2013-01-31
- Related Report
  2012 Annual Research Report
[Presentation] 多様な歌声合成のための重回帰HSMMIに基づくスタイル制御法の検討2013
- Author(s)
  能勢隆
- Organizer
  2013年1月度音声研究会
- Place of Presentation
  同志社大学, 京都府京田辺市
- Year and Date
  2013-01-31
- Related Report
  2012 Annual Research Report
[Presentation] Frame-level acoustic modeling based on Gaussian process regression for statistical nonparametric speech synthesis2013
- Author(s)
  郡山知樹
- Organizer
  2013 IEEE International Conference on Acoustics, Speech, and signal Processing, ICASSP 2013
- Place of Presentation
  バンクーバーコンベンション＆エキシビションセンター（カナダ）
- Related Report
  2013 Annual Research Report
[Presentation] Speaker-independent style conversion for HMM-based expressive speech synthesis2013
- Author(s)
  能勢隆
- Organizer
  2013 IEEE International Conference on Acoustics, Speech, and signal Processing, ICASSP 2013
- Place of Presentation
  バンクーバーコンベンション＆エキシビションセンター（カナダ）
- Related Report
  2013 Annual Research Report
[Presentation] Statistical nonparametric speech synthesis using sparse Gaussian processes2013
- Author(s)
  郡山知樹
- Organizer
  14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013
- Place of Presentation
  リヨンコンベンションセンター（フランス）
- Related Report
  2013 Annual Research Report
[Presentation] A style control technique for singing voice synthesis based on multiple-regression HSMM2013
- Author(s)
  能勢隆
- Organizer
  14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013
- Place of Presentation
  リヨンコンベンションセンター（フランス）
- Related Report
  2013 Annual Research Report
[Presentation] 複数ドメインコーパスからの文選択に基づくキャラクター音声合成の検討2013
- Author(s)
  荒生　侑介
- Organizer
  日本音響学会2013年秋季研究発表会
- Place of Presentation
  豊橋技術科学大学（愛知）
- Related Report
  2013 Annual Research Report
[Presentation] GMMに基づく声質変換のためのMDL基準による混合数の自動決定2013
- Author(s)
  小林　友哉
- Organizer
  日本音響学会2013年秋季研究発表会
- Place of Presentation
  豊橋技術科学大学（愛知）
- Related Report
  2013 Annual Research Report
[Presentation] スパース近似と畳み込みカーネルを用いたガウス過程回帰に基づく音声合成2013
- Author(s)
  郡山　知樹
- Organizer
  日本音響学会2013年秋季研究発表会
- Place of Presentation
  豊橋技術科学大学（愛知）
- Related Report
  2013 Annual Research Report
[Presentation] 言語モデルと音響モデルを利用したアクセント境界の自動推定2013
- Author(s)
  鈴木　啓史
- Organizer
  第15回音声言語シンポジウム
- Place of Presentation
  筑波大学東京キャンパス文京校舎（東京）
- Related Report
  2013 Annual Research Report
[Presentation] 多様な音声合成に向けた取組みと課題2013
- Author(s)
  小林　隆夫
- Organizer
  第15回音声言語シンポジウム
- Place of Presentation
  筑波大学東京キャンパス文京校舎（東京）
- Related Report
  2013 Annual Research Report
- Invited
[Presentation] HMM音声合成におけるスペクトル特微量の局所変動のモデル化とパラメータ2012
- Author(s)
  能勢隆
- Organizer
  2012年11月度音声研究会
- Place of Presentation
  東北工業大学, 宮城県仙台市
- Year and Date
  2012-11-08
- Related Report
  2012 Annual Research Report
[Presentation] HMM音声合成のための局所的系列内変動を考慮したパラメータ生成の検討2012
- Author(s)
  能勢隆
- Organizer
  日本音響学会2012年秋季研究発表会
- Place of Presentation
  信州大学, 長野県長野市
- Year and Date
  2012-09-20
- Related Report
  2012 Annual Research Report
[Presentation] 共有決定木を利用した話者適応に基づくクロスリンガル音声合成の検討2012
- Author(s)
  能勢隆
- Organizer
  日本音響学会2012年秋季研究発表会
- Place of Presentation
  信州大学,長野県長野市
- Year and Date
  2012-09-20
- Related Report
  2012 Annual Research Report
[Presentation] HMM音声合成における不特定話者スタイル変換のための話者正規化学習法の2012
- Author(s)
  金川裕紀
- Organizer
  日本音響学会2012年秋季研究発表会
- Place of Presentation
  信州大学,長野県長野市
- Year and Date
  2012-09-20
- Related Report
  2012 Annual Research Report
[Presentation] A speech parameter generation algorithm using local variance for HMM-based speech synthesis2012
- Author(s)
  Vataya Chunwijitra
- Organizer
  13th Annual Conference of the International Speech Communication Association, INTERSPEECH 2012
- Place of Presentation
  Portland, USA
- Year and Date
  2012-09-11
- Related Report
  2012 Annual Research Report
[Presentation] Discontinuous observation HMM for prosodic-e vent-based FO generation2012
- Author(s)
  Tomoki Koriyama
- Organizer
  13th Annual Conference of the International Speech Communication Association, INTERSPEECH 2012
- Place of Presentation
  Portland, USA
- Year and Date
  2012-09-10
- Related Report
  2012 Annual Research Report

Research on advanced robust speech synthesis and its applications to multi-lingual speech communication

Principal Investigator

KOBAYASHI Takao 東京工業大学, 総合理工学研究科(研究院), 教授 (70153616)

¥14,300,000 (Direct Cost: ¥11,000,000、Indirect Cost: ¥3,300,000)

Report

Research Products

[Journal Article] ガウス過程回帰に基づく音声合成システムの検討2015

Author(s)

Journal Title

NAID

Related Report

[Journal Article] 言語モデルと音響モデルを用いた自動韻律ラベリングの評価2015

Author(s)

Journal Title

Related Report

[Journal Article] ガウス過程回帰に基づく音声合成のためのコンテキストの検討2015

Author(s)

Journal Title

Related Report

[Journal Article] Prosody generation using frame-based Gaussian process regression and classification for statistical parametric speech synthesis2015

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Statistical Parametric Speech Synthesis Based on Gaussian Process Regression2014

Author(s)

Journal Title

DOI

Related Report

[Journal Article] A parameter generation algorithm using local variance for HMM-based speech synthesis2014

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Parametric speech synthesis based on Gaussian process regression using global variance and hyperparameter optimization2014

Author(s)

Journal Title

DOI

NAID

Related Report

[Journal Article] Tone modeling using stress information for HMM-based Thai speech synthesis2014

Author(s)

Journal Title

Related Report

[Journal Article] Transform mapping using shared decision tree context clustering for HMM-based cross-lingual speech synthesis2014

Author(s)

Journal Title

Related Report

[Journal Article] Accent type and phrase boundary estimation using acoustic and language models for automatic prosodic labeling2014

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Parametric speech synthesis using local and global sparse Gaussian processes2014

Author(s)

Journal Title

DOI

NAID

Related Report

[Journal Article] HMM-based Thai speech synthesis using unsupervised stress context labeling2014

Author(s)

Journal Title

DOI

Related Report

[Journal Article] ガウス過程回帰に基づくF0パタン生成の検討2014

Author(s)

Journal Title

NAID

Related Report

[Journal Article] ガウス過程回帰に基づく音声合成におけるハイパーパラメータ最適化の検討2014

Author(s)

Journal Title

Related Report

[Journal Article] 音声合成のための音韻・韻律コンテキストを考慮した文選択アルゴリズムの評価2014

Author(s)

Journal Title

Related Report

[Journal Article] HMM音声合成のための音節出現頻度にロバストな音素セットの検討2014

Author(s)

Journal Title