Research on robust spoken language interfaces for diverse voice variability and expressivity

Research Project

Project/Area Number	21300063
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Single-year Grants
Section	一般
Research Field	Perception information processing/Intelligent robotics
Research Institution	Tokyo Institute of Technology
Principal Investigator	KOBAYASHI Takao 東京工業大学, 大学院・総合理工学研究科, 教授 (70153616)
Co-Investigator(Renkei-kenkyūsha)	NAGAHASHI Hiroshi 東京工業大学, 像情報工学研究所, 教授 (20143084)
Research Collaborator	NOSE Takashi 東京工業大学, 大学院・総合理工学研究科, 助教 (90550591)
Project Period (FY)	2009 – 2011
Project Status	Completed (Fiscal Year 2011)
Budget Amount *help	¥9,750,000 (Direct Cost: ¥7,500,000、Indirect Cost: ¥2,250,000) Fiscal Year 2011: ¥2,470,000 (Direct Cost: ¥1,900,000、Indirect Cost: ¥570,000) Fiscal Year 2010: ¥3,510,000 (Direct Cost: ¥2,700,000、Indirect Cost: ¥810,000) Fiscal Year 2009: ¥3,770,000 (Direct Cost: ¥2,900,000、Indirect Cost: ¥870,000)
Keywords	音声情報処理 / HMM音声合成 / テキスト音声合成 / 基本周波数量子化コンテキスト / 自然発話音声 / 重回帰HMM / 音声スタイル制御 / 音声スタイル推定 / モデル適応 / 対話音声合成 / スタイル推定 / 声質変換 / FO量子化 / 発話様式
Research Abstract	The purpose of the research is to develop techniques that make the human-computer interaction using speech input/output more robust for variations of users' emotional states, speaking styles, preferences, and expressivity. We have proposed techniques using a quantized fundamental frequency prosodic context for robust speech synthesis and an extended context set for spontaneous conversational speech synthesis. We have also proposed techniques for robust speech recognition including extraction of paralinguistic information and rapid model adaptation.

Report

(4 results)

2011 Annual Research Report Final Research Report ( PDF )
2010 Annual Research Report
2009 Annual Research Report

Research Products
(109 results)

All 2012 2011 2010 2009 Other

All Journal Article (60 results) (of which Peer Reviewed: 31 results) Presentation (48 results) Remarks (1 results)

[Journal Article] HMMに基づく対話音声合成における多様な韻律生成のためのコンテクストの拡張2012
- Author(s)
  郡山知樹, 能勢隆, 小林隆夫
- Journal Title
  
  電子情報通信学会論文誌
  
  Volume: Vol.J95-D Pages: 597-607
- NAID
  110009418768
- Related Report
  2011 Final Research Report
- Peer Reviewed
[Journal Article] Very lowbit-rate F0 coding for phonetic vocoders using MSD-HMM with quantized Fosymbols2012
- Author(s)
  Takashi Nose, Takao Kobayashi
- Journal Title
  
  Speech Communication
  
  Volume: Vol.54 Pages: 384-392
- Related Report
  2011 Final Research Report
- Peer Reviewed
[Journal Article] A tone-modeling technique using a quantized F0 context to improvetone correctness in average-voice-based speech synthesis2012
- Author(s)
  Vataya Chunwijitra, Takashi Nose, TakaoKobayashi
- Journal Title
  
  SpeechCommunication
  
  Volume: vol.54 Pages: 245-255
- Related Report
  2011 Final Research Report
- Peer Reviewed
[Journal Article] Very low bit-rate F0 coding for phonetic vocoders using MSD-HMM with quantized F0 symbols2012
- Author(s)
  Takashi Nose
- Journal Title
  
  Speech Communication
  
  Volume: 54 Pages: 384-392
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] A tone-modeling technique using a quantized F0 context to improve tone correctness in average-voice-based speech synthesis2012
- Author(s)
  Vataya Chunwijitra
- Journal Title
  
  Speech Communication
  
  Volume: 54 Pages: 245-255
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] An F0 modeling technique based on prosodic events for spontaneous speech synthesis2012
- Author(s)
  Tomoki Koriyama
- Journal Title
  
  Proceedings of 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012
  
  Pages: 4589-4592
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] HMMに基づく対話音声合成における多様な韻律生成のためのコンテクストの拡張2012
- Author(s)
  郡山知樹
- Journal Title
  
  電子情報通信学会論文誌
  
  Volume: J95-D Pages: 597-607
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] 観測値の不連続性を考慮したHMMに基づくF0モデル化の検討2012
- Author(s)
  郡山知樹
- Journal Title
  
  日本音響学会2012年春季研究発表会講演論文集
  
  Pages: 305-306
- Related Report
  2011 Annual Research Report
[Journal Article] 合成音声のスタイル制御における系列内変動を考慮したスペクトル・韻律パラメータの生成2012
- Author(s)
  能勢隆
- Journal Title
  
  日本音響学会2012年春季研究発表会講演論文集
  
  Pages: 307-308
- Related Report
  2011 Annual Research Report
[Journal Article] Speaker-independent HMM-based voice conversion using adaptive quantization of the fundamental frequency2011
- Author(s)
  Takashi Nose, Takao Kobayashi
- Journal Title
  
  Speech Communication
  
  Volume: vol.53 Pages: 973-985
- Related Report
  2011 Final Research Report
- Peer Reviewed
[Journal Article] Speaker-independent HMM-based voice conversion using adaptive quantization of the fundamental frequency2011
- Author(s)
  Takashi Nose
- Journal Title
  
  Speech Communication
  
  Volume: 53 Pages: 973-985
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] Recent development of HMM-based expressive speech synthesis and its applications2011
- Author(s)
  Takashi Nose
- Journal Title
  
  Proceedings of 2011 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2011
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] A perceptual expressivity modeling technique for speech synthesis based on multiple-regression HSMM2011
- Author(s)
  Takashi Nose
- Journal Title
  
  Proceedings of the 12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011
  
  Pages: 109-112
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] On the use of extended context for HMM-based spontaneous conversational speech synthesis2011
- Author(s)
  Tomoki Koriyama
- Journal Title
  
  Proceedings of the 12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011
  
  Pages: 2657-2660
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] Tonal context labeling using quantized F0 symbols for improving tone correctness in average-voice-based speech synthesis2011
- Author(s)
  Vataya Chunwijitra
- Journal Title
  
  Proceedings of 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011
  
  Pages: 4708-4711
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] Very low bit-rate F0 coding for phonetic vocoder using MSD-HMM with quantized F0 context2011
- Author(s)
  Takashi Nose
- Journal Title
  
  Proceedings of 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011
  
  Pages: 5236-5239
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] 韻律イベントHMMを用いた対話音声F0生成2011
- Author(s)
  郡山知樹
- Journal Title
  
  電子情報通信学会技術研究報告
  
  Volume: 111,SP2011-98 Pages: 185-190
- Related Report
  2011 Annual Research Report
[Journal Article] HMM音声合成における不特定話者スタイル変換の検討2011
- Author(s)
  金川裕紀
- Journal Title
  
  電子情報通信学会技術研究報告
  
  Volume: 111,SP2011-99 Pages: 191-196
- Related Report
  2011 Annual Research Report
[Journal Article] HMM音声合成のための動的特徴量を用いた音素継続長モデリングの検討2011
- Author(s)
  能勢隆
- Journal Title
  
  電子情報通信学会技術研究報告
  
  Volume: 111,SP2011-100 Pages: 197-202
- NAID
  10031110919
- Related Report
  2011 Annual Research Report
[Journal Article] 日本語話し言葉コーパスを用いた対話音声合成のためのコンテキストの評価2011
- Author(s)
  郡山知樹
- Journal Title
  
  電子情報通信学会技術研究報告
  
  Volume: 111,SP2011-27 Pages: 155-160
- Related Report
  2011 Annual Research Report
[Journal Article] 感情音声合成における主観的表出度合のモデル化と制御の検討2011
- Author(s)
  能勢隆
- Journal Title
  
  日本音響学会2011年秋季研究発表会講演論文集
  
  Pages: 329-330
- Related Report
  2011 Annual Research Report
[Journal Article] 対話音声合成のためのイントネーションラベルのタイミング予測2011
- Author(s)
  郡山知樹
- Journal Title
  
  日本音響学会2011年秋季研究発表会講演論文集
  
  Pages: 333-334
- Related Report
  2011 Annual Research Report
[Journal Article] 日本語話し言葉コーパスを用いた対話音声合成のための音韻・韻律コンテキストの検討2011
- Author(s)
  郡山知樹
- Journal Title
  
  日本音響学会2010年秋研究発表会講演論文集
  
  Pages: 297-298
- Related Report
  2010 Annual Research Report
[Journal Article] 極低ビットレートボコーダのためのMSD-HMMに基づくF0符号化の評価2011
- Author(s)
  能勢隆
- Journal Title
  
  日本音響学会2010年秋研究発表会講演論文集
  
  Pages: 335-336
- Related Report
  2010 Annual Research Report
[Journal Article] 合成音声を用いた非パラレルデータによる声質変換の検討2011
- Author(s)
  史潤宇
- Journal Title
  
  日本音響学会2010年秋研究発表会講演論文集
  
  Pages: 411-412
- Related Report
  2010 Annual Research Report
[Journal Article] 小特集-音声合成に関する研究の動向- 小特集にあたって2011
- Author(s)
  小林隆夫
- Journal Title
  
  日本音響学会誌
  
  Volume: 67 Pages: 15-16
- Related Report
  2010 Annual Research Report
[Journal Article] HMM-based voice conversion using quantized F0 context2010
- Author(s)
  Takashi Nose, Yuhei Ota, Takao Kobayashi
- Journal Title
  
  IEICE Trans. onInformation and Systems
  
  Volume: Vol.E93-D Pages: 2483-2490
- NAID
  10027640446
- Related Report
  2011 Final Research Report
- Peer Reviewed
[Journal Article] Evaluation of prosodic contextual factors for HMM-based speechsynthesis2010
- Author(s)
  Shuji Yokomizo, Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proc. 11th AnnualConference of the International SpeechCommunication Association
  
  Pages: 430-433
- Related Report
  2011 Final Research Report
- Peer Reviewed
[Journal Article] Conversational spontaneousspeech synthesis using average voice model2010
- Author(s)
  Tomoki Koriyama, Takashi Nose, Takao Kobayashi
- Journal Title
  
  Proc. 11th Annual Conference of the International Speech Communication Association
  
  Pages: 853-856
- Related Report
  2011 Final Research Report
- Peer Reviewed
[Journal Article] HMM-based speech synthesis with unsupervised labeling of accentualcontext based on F0 quantization and average voice model2010
- Author(s)
  Takashi Nose, Koujirou Ooki, Takao Kobayashi
- Journal Title
  
  Proc. 2010IEEE International Conference onAcoustics, Speech, and Signal Processing
  
  Pages: 4622-4625
- Related Report
  2011 Final Research Report
- Peer Reviewed
[Journal Article] A rapidmodel adaptation technique for emotionalspeech recognition with style estimationbased on multiple-regression HMM2010
- Author(s)
  Yusuke Ijima, Takashi Nose, Makoto Tachibana, Takao Kobayashi
- Journal Title
  
  IEICE Trans. on Information andSystems
  
  Volume: Vol.E93-D Pages: 107-115
- Related Report
  2011 Final Research Report
- Peer Reviewed
[Journal Article] Atechnique for estimating intensity of emotional expressions and speaking styles in speech based on multiple-regression HSMM2010
- Author(s)
  Takashi Nose, Takao Kobayashi
- Journal Title
  
  IEICE Trans. on Information and Systems
  
  Volume: Vol.E93-D Pages: 116-124
- Related Report
  2011 Final Research Report
- Peer Reviewed
[Journal Article] HMM-based Voice conversion using quantized F0 context2010
- Author(s)
  Takashi Nose
- Journal Title
  
  IEICE Transactions on Information and Systems
  
  Volume: E93-D Pages: 2483-2490
- NAID
  10027640446
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] Evaluation of prosodic contextual factors for HMM-based speech synthesis2010
- Author(s)
  Shuji Yokomizo
- Journal Title
  
  Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH2010
  
  Pages: 430-433
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] Conversational spontaneous speech synthesis using average voice model2010
- Author(s)
  Tomoki Koriyama
- Journal Title
  
  Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH2010
  
  Pages: 853-856
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] Speaker-independent HMM-based voice conversion using quantized fundamental frequency2010
- Author(s)
  Takashi Nose
- Journal Title
  
  Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH2010
  
  Pages: 1724-1727
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] HMM-based robust voice conversion using adaptive F0 quantization2010
- Author(s)
  Takashi Nose
- Journal Title
  
  Proceedings of the 7th ISCA Workshop on Speech Synthesis, SSW7-2010
  
  Pages: 80-85
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] 極低ビットレートボコーダのためのHMMに基づくF0符号化法の検討2010
- Author(s)
  能勢隆
- Journal Title
  
  電子情報通信学会技術研究報告
  
  Volume: vol.110, SP2010-101 Pages: 189-194
- NAID
  110008676775
- Related Report
  2010 Annual Research Report
[Journal Article] 話者適応を用いたHMMに基づく不特定話者間声質変換2010
- Author(s)
  能勢隆
- Journal Title
  
  日本音響学会2010年秋研究発表会講演論文集
  
  Pages: 283-284
- Related Report
  2010 Annual Research Report
[Journal Article] 適応F0量子化によるHMM声質変換の品質改善2010
- Author(s)
  能勢隆
- Journal Title
  
  日本音響学会2010年秋研究発表会講演論文集
  
  Pages: 285-288
- Related Report
  2010 Annual Research Report
[Journal Article] 二段階モデル適応に基づく対話音声合成の検討2010
- Author(s)
  郡山知樹
- Journal Title
  
  日本音響学会2010年秋研究発表会講演論文集
  
  Pages: 303-304
- Related Report
  2010 Annual Research Report
[Journal Article] HMMに基づく英語音声合成の韻律コンテキストの評価2010
- Author(s)
  横溝秀始
- Journal Title
  
  日本音響学会2010年秋研究発表会講演論文集
  
  Pages: 409-410
- Related Report
  2010 Annual Research Report
[Journal Article] A rapid model adaptation technique for emotional speech recognition with style estimation based on multiple-regression HMM2010
- Author(s)
  Yusuke Ijima
- Journal Title
  
  IEICE Transactions on Information and Systems E93-D
  
  Pages: 107-115
- Related Report
  2009 Annual Research Report
- Peer Reviewed
[Journal Article] A technique for estimating intensity of emotional expressions and speaking styles in speech based on multiple-regression HSMM2010
- Author(s)
  Takashi Nose
- Journal Title
  
  IEICE Transactions on Information and Systems E93-D
  
  Pages: 116-124
- NAID
  10026813214
- Related Report
  2009 Annual Research Report
- Peer Reviewed
[Journal Article] HMM-based speech synthesis with unsupervised labeling of accentual context based on F0 quantization and average voice model2010
- Author(s)
  Takashi Nose
- Journal Title
  
  Proc.2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010
  
  Pages: 4622-4625
- Related Report
  2009 Annual Research Report
- Peer Reviewed
[Journal Article] F0量子化と非パラレル学習に基づく声質変換の評価2010
- Author(s)
  太田悠平
- Journal Title
  
  電子情報通信学会技術研究報告 SP2009-100
  
  Pages: 27-32
- Related Report
  2009 Annual Research Report
[Journal Article] 平均声に基づく対話音声合成に関する検討2010
- Author(s)
  郡山知樹
- Journal Title
  
  電子情報通信学会技術研究報告 SP2009-107
  
  Pages: 33-38
- Related Report
  2009 Annual Research Report
[Journal Article] HMMに基づく対話音声合成のための発話単位の検討2010
- Author(s)
  郡山知樹
- Journal Title
  
  日本音響学会2010年春季研究発表会講演論文集
  
  Pages: 143-144
- Related Report
  2009 Annual Research Report
[Journal Article] 量子化F0コンテキストを用いたHMMに基づく不特定話者声質変換の検討2010
- Author(s)
  太田悠平
- Journal Title
  
  日本音響学会2010年春季研究発表会講演論文集
  
  Pages: 327-328
- Related Report
  2009 Annual Research Report
[Journal Article] 量子化F0韻律コンテキストを用いたHMM音声合成の評価2010
- Author(s)
  大木康次郎
- Journal Title
  
  日本音響学会2010年春季研究発表会講演論文集
  
  Pages: 341-342
- Related Report
  2009 Annual Research Report
[Journal Article] HMM音声合成における韻律コンテキストの評価2010
- Author(s)
  横溝秀始
- Journal Title
  
  日本音響学会2010年春季研究発表会講演論文集
  
  Pages: 403-404
- Related Report
  2009 Annual Research Report
[Journal Article] Speaking style adaptation for spontaneous speech recognition using multiple-regression HMM2009
- Author(s)
  Yusuke Ijima
- Journal Title
  
  Proc.10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009
  
  Pages: 552-555
- Related Report
  2009 Annual Research Report
- Peer Reviewed
[Journal Article] HMM-based speaker characteristics emphasis using average voice model2009
- Author(s)
  Takashi Nose
- Journal Title
  
  Proc.10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009
  
  Pages: 2631-2634
- Related Report
  2009 Annual Research Report
- Peer Reviewed
[Journal Article] Emotional speech recognition based on style estimation and adaptation with multiple-regression HMM2009
- Author(s)
  Yusuke Ijima
- Journal Title
  
  Proc.2009 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2009
  
  Pages: 4157-4160
- Related Report
  2009 Annual Research Report
- Peer Reviewed
[Journal Article] F0量子化に基づく韻律コンテキストを用いたHMM音声合成2009
- Author(s)
  大木康次郎
- Journal Title
  
  電子情報通信学会技術研究報告 SP2009-87
  
  Pages: 141-146
- Related Report
  2009 Annual Research Report
[Journal Article] F0量子化と非パラレル学習に基づく声質変換の検討2009
- Author(s)
  太田悠平
- Journal Title
  
  電子情報通信学会技術研究報告 SP2009-91
  
  Pages: 171-176
- Related Report
  2009 Annual Research Report
[Journal Article] 重回帰HMMに基づく自然発話音声の発話様式識別2009
- Author(s)
  能勢隆
- Journal Title
  
  電子情報通信学会技術研究報告 SP2009-46
  
  Pages: 31-36
- NAID
  110007358749
- Related Report
  2009 Annual Research Report
[Journal Article] HMMに基づく対話音声合成の検討2009
- Author(s)
  郡山知樹
- Journal Title
  
  日本音響学会2009年秋季研究発表会講演論文集
  
  Pages: 255-256
- Related Report
  2009 Annual Research Report
[Journal Article] HMM音声合成におけるF0モデルの教師なし学習の検討2009
- Author(s)
  大木康次郎
- Journal Title
  
  日本音響学会2009年秋季研究発表会講演論文集
  
  Pages: 259-260
- Related Report
  2009 Annual Research Report
[Journal Article] F0量子化と非パラレル学習に基づく声質変換2009
- Author(s)
  太田悠平
- Journal Title
  
  日本音響学会2009年秋季研究発表会講演論文集
  
  Pages: 289-290
- Related Report
  2009 Annual Research Report
[Presentation] An F0 modeling technique based on prosodic events for spontaneous speech synthesis2012
- Author(s)
  Tomoki Koriyama
- Organizer
  2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012
- Place of Presentation
  Kyoto, Japan
- Year and Date
  2012-03-29
- Related Report
  2011 Annual Research Report 2011 Final Research Report
[Presentation] 合成音声のスタイル制御における系列内変動を考慮したスペクトル・韻律パラメータの生成2012
- Author(s)
  能勢隆
- Organizer
  日本音響学会2012年春季研究発表会
- Place of Presentation
  神奈川大学,神奈川県横浜市
- Year and Date
  2012-03-13
- Related Report
  2011 Annual Research Report
[Presentation] 観測値の不連続性を考慮したHMMに基づくF0モデル化の検討2012
- Author(s)
  郡山知樹
- Organizer
  日本音響学会2012年春季研究発表会
- Place of Presentation
  神奈川大学,神奈川県横浜市
- Year and Date
  2012-03-13
- Related Report
  2011 Annual Research Report
[Presentation] HMM音声合成における不特定話者スタイル変換の検討2011
- Author(s)
  金川裕紀
- Organizer
  電子情報通信学会音声研究会
- Place of Presentation
  芝浦工業大学,東京都江東区
- Year and Date
  2011-12-20
- Related Report
  2011 Final Research Report
[Presentation] 韻律イベントHMMを用いた対話音声F0生成2011
- Author(s)
  郡山知樹
- Organizer
  2011年12月度音声研究会
- Place of Presentation
  芝浦工業大学,東京都江東区
- Year and Date
  2011-12-20
- Related Report
  2011 Annual Research Report
[Presentation] HMM音声合成における不特定話者スタイル変換の検討2011
- Author(s)
  金川裕紀
- Organizer
  2011年12月度音声研究会
- Place of Presentation
  芝浦工業大学,東京都江東区
- Year and Date
  2011-12-20
- Related Report
  2011 Annual Research Report
[Presentation] HMM音声合成のための動的特徴量を用いた音素継続長モデリングの検討2011
- Author(s)
  能勢隆
- Organizer
  2011年12月度音声研究会
- Place of Presentation
  芝浦工業大学,東京都江東区
- Year and Date
  2011-12-20
- Related Report
  2011 Annual Research Report
[Presentation] Recent development of HMM-based expressive speech synthesis and its applications2011
- Author(s)
  Takashi Nose
- Organizer
  2011 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2011
- Place of Presentation
  Xi'an, China
- Year and Date
  2011-10-19
- Related Report
  2011 Annual Research Report
[Presentation] 感情音声合成における主観的表出度合のモデル化と制御の検討2011
- Author(s)
  能勢隆
- Organizer
  日本音響学会2011年秋季研究発表会
- Place of Presentation
  島根大学,島根県松江市
- Year and Date
  2011-09-22
- Related Report
  2011 Annual Research Report
[Presentation] 対話音声合成のためのイントネーションラベルのタイミング予測2011
- Author(s)
  郡山知樹
- Organizer
  日本音響学会2011年秋季研究発表会
- Place of Presentation
  島根大学,島根県松江市
- Year and Date
  2011-09-22
- Related Report
  2011 Annual Research Report
[Presentation] On the use of extended context for HMM-based spontaneous conversational speech synthesis2011
- Author(s)
  Tomoki Koriyama
- Organizer
  12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011
- Place of Presentation
  Florence, Italy
- Year and Date
  2011-08-30
- Related Report
  2011 Annual Research Report 2011 Final Research Report
[Presentation] A perceptual expressivity modeling technique for speech synthesis based on multiple-regression HSMM2011
- Author(s)
  Takashi Nose
- Organizer
  12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011
- Place of Presentation
  Florence, Italy
- Year and Date
  2011-08-28
- Related Report
  2011 Annual Research Report 2011 Final Research Report
[Presentation] Very low bit-rate F0 coding for phonetic vocoder using MSD-HMM with quantized F0 context2011
- Author(s)
  Takashi Nose
- Organizer
  2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP2011
- Place of Presentation
  Prague, Czech Republic
- Year and Date
  2011-05-26
- Related Report
  2011 Final Research Report
[Presentation] Very low bit-rate F0 coding for phonetic vocoder using MSD-HMM with quantized F0 context2011
- Author(s)
  Takashi Nose
- Organizer
  2011 IEEE International Conference on Acousties, Speech, and Signal Processing, ICASSP 2011
- Place of Presentation
  Prague, Czech Republic
- Year and Date
  2011-05-26
- Related Report
  2011 Annual Research Report
[Presentation] Tonal context labeling using quantized F0 symbols for improvingtone correctness in average-voice-basedspeech synthesis2011
- Author(s)
  Vataya Chunwijitra
- Organizer
  2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011
- Place of Presentation
  Prague, Czech Republic
- Year and Date
  2011-05-24
- Related Report
  2011 Final Research Report
[Presentation] Tonal context labeling using quantized F0 symbols for improving tone correctness in average-voice-based speech synthesis2011
- Author(s)
  Vataya Chunwijitra
- Organizer
  2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011
- Place of Presentation
  Prague, Czech Republic
- Year and Date
  2011-05-24
- Related Report
  2011 Annual Research Report
[Presentation] 日本語話し言葉コーパスを用いた対話音声合成のためのコンテキストの評価2011
- Author(s)
  郡山知樹
- Organizer
  2011年5月度音声研究会
- Place of Presentation
  立命館大学,大阪府大阪市
- Year and Date
  2011-05-13
- Related Report
  2011 Annual Research Report
[Presentation] 日本語話し言葉コーパスを用いた対話音声合成のための音韻律・韻律コンテキストの検討2011
- Author(s)
  郡山知樹
- Organizer
  日本音響学会2010年春季研究発表会
- Place of Presentation
  早稲田大学, 東京都新宿区
- Year and Date
  2011-03-11
- Related Report
  2010 Annual Research Report
[Presentation] 極低ビットレートボコーダのためのMSD-HMMに基づくF0符号化の評価2011
- Author(s)
  能勢隆
- Organizer
  日本音響学会2010年春季研究発表会
- Place of Presentation
  早稲田大学, 東京都新宿区
- Year and Date
  2011-03-11
- Related Report
  2010 Annual Research Report
[Presentation] 合成音声を用いた非パラレルデータによる声質変換の検討2011
- Author(s)
  史潤宇
- Organizer
  日本音響学会2010年春季研究発表会
- Place of Presentation
  早稲田大学, 東京都新宿区
- Year and Date
  2011-03-09
- Related Report
  2010 Annual Research Report
[Presentation] 極低ビットレートボコーダのためのHMMに基づくF0符号化法の検討2010
- Author(s)
  能勢隆
- Organizer
  電子情報通信学会
- Place of Presentation
  国立オリンピック記念青少年総合センター, 東京都渋谷区
- Year and Date
  2010-12-21
- Related Report
  2010 Annual Research Report
[Presentation] Speaker-independent HMM-based voice conversiou using quantized fundamental frequency2010
- Author(s)
  Takashi Nose
- Organizer
  11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010
- Place of Presentation
  Makuhari Messe, Japan
- Year and Date
  2010-09-29
- Related Report
  2010 Annual Research Report
[Presentation] Conversational spontaneous speech synthesis using average voice model2010
- Author(s)
  Tomoki Koriyama
- Organizer
  11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010
- Place of Presentation
  Makuhari Messe, Japan
- Year and Date
  2010-09-28
- Related Report
  2010 Annual Research Report
[Presentation] Evaluation of prosodic contextual factors for HMM-based speech synthesis2010
- Author(s)
  Shuji Yokomizo
- Organizer
  11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010
- Place of Presentation
  Makuhari Messe, Japan
- Year and Date
  2010-09-27
- Related Report
  2010 Annual Research Report
[Presentation] HMM-based robust voice conversion using adaptive F0 quantization2010
- Author(s)
  Takadhi Nose
- Organizer
  7th ISCA Workshop on Speech Synthesis, SSW7-2010
- Place of Presentation
  Kyoto, Japan
- Year and Date
  2010-09-22
- Related Report
  2010 Annual Research Report
[Presentation] 話者適応を用いたHMMに基づく不特定話者間声質変換2010
- Author(s)
  能勢隆
- Organizer
  日本音響学会2010年秋季研究発表会
- Place of Presentation
  関西大学, 大阪府吹田市
- Year and Date
  2010-09-16
- Related Report
  2010 Annual Research Report
[Presentation] 適応F0量子化によるHMM声質変換の品質改善2010
- Author(s)
  能勢隆
- Organizer
  日本音響学会2010年秋季研究発表会
- Place of Presentation
  関西大学, 大阪府吹田市
- Year and Date
  2010-09-16
- Related Report
  2010 Annual Research Report
[Presentation] HMMに基づく英語音声合成の韻律コンテキストの評価2010
- Author(s)
  横溝秀始
- Organizer
  日本音響学会2010年秋季研究発表会
- Place of Presentation
  関西大学, 大阪府吹田市
- Year and Date
  2010-09-16
- Related Report
  2010 Annual Research Report
[Presentation] 二段階モデル適応に基づく対話音声合成の検討2010
- Author(s)
  郡山知樹
- Organizer
  日本音響学会2010年秋季研究発表会
- Place of Presentation
  関西大学, 大阪府吹田市
- Year and Date
  2010-09-15
- Related Report
  2010 Annual Research Report
[Presentation] 動画像からの顔の姿勢推定による非言語情報の取得2010
- Author(s)
  宮崎悠樹
- Organizer
  画像電子学会第250回研究会
- Place of Presentation
  崇城大学,熊本市
- Year and Date
  2010-03-23
- Related Report
  2011 Final Research Report
[Presentation] HMM-based speech synthesis with unsupervised labeling of accentual context based on F0 quantization and average voice model2010
- Author(s)
  Takashi Nose
- Organizer
  2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010
- Place of Presentation
  Dallas, Texas, USA
- Year and Date
  2010-03-17
- Related Report
  2009 Annual Research Report
[Presentation] HMMに基づく対話音声合成のための発話単位の検討2010
- Author(s)
  郡山知樹
- Organizer
  日本音響学会2010年春季研究発表会
- Place of Presentation
  電気通信大学,東京都調布市
- Year and Date
  2010-03-10
- Related Report
  2009 Annual Research Report
[Presentation] 量子化F0韻律コンテキストを用いたHMM音声合成の評価2010
- Author(s)
  大木康次郎
- Organizer
  日本音響学会2010年春季研究発表会
- Place of Presentation
  電気通信大学,東京都調布市
- Year and Date
  2010-03-09
- Related Report
  2009 Annual Research Report
[Presentation] 量子化F0コンテキストを用いたHMMに基づく不特定話者声質変換の検討2010
- Author(s)
  太田悠平
- Organizer
  日本音響学会2010年春季研究発表会
- Place of Presentation
  電気通信大学,東京都調布市
- Year and Date
  2010-03-08
- Related Report
  2009 Annual Research Report
[Presentation] HMM音声合成における韻律コンテキストの評価2010
- Author(s)
  横溝秀始
- Organizer
  日本音響学会2010年春季研究発表会
- Place of Presentation
  電気通信大学,東京都調布市
- Year and Date
  2010-03-08
- Related Report
  2009 Annual Research Report
[Presentation] F0量子化と非パラレル学習に基づく声質変換の評価2010
- Author(s)
  太田悠平
- Organizer
  電子情報通信学会・音声研究会
- Place of Presentation
  京都大学,京都市
- Year and Date
  2010-01-21
- Related Report
  2009 Annual Research Report
[Presentation] 平均声に基づく対話音声合成に関する検討2010
- Author(s)
  郡山知樹
- Organizer
  電子情報通信学会・音声研究会
- Place of Presentation
  京都大学,京都市
- Year and Date
  2010-01-21
- Related Report
  2009 Annual Research Report
[Presentation] F0量子化に基づく韻律コンテキストを用いたHMM音声合成2009
- Author(s)
  大木康次郎
- Organizer
  電子情報通信学会
- Place of Presentation
  東京大学,東京都文京区
- Year and Date
  2009-12-21
- Related Report
  2009 Annual Research Report
[Presentation] F0量子化と非パラレル学習に基づく声質変換の検討2009
- Author(s)
  太田悠平
- Organizer
  電子情報通信学会
- Place of Presentation
  東京大学,東京都文京区
- Year and Date
  2009-12-21
- Related Report
  2009 Annual Research Report
[Presentation] F0量子化と非パラレル学習に基づく声質変換2009
- Author(s)
  太田悠平
- Organizer
  日本音響学会2009年秋季研究発表会
- Place of Presentation
  日本大学,福島県郡山市
- Year and Date
  2009-09-16
- Related Report
  2009 Annual Research Report
[Presentation] HMMに基づく対話音声合成の検討2009
- Author(s)
  郡山知樹
- Organizer
  日本音響学会2009年秋季研究発表会
- Place of Presentation
  日本大学,福島県郡山市
- Year and Date
  2009-09-15
- Related Report
  2009 Annual Research Report
[Presentation] HMM音声合成におけるF0モデルの教師なし学習の検討2009
- Author(s)
  大木康次郎
- Organizer
  日本音響学会2009年秋季研究発表会
- Place of Presentation
  日本大学,福島県郡山市
- Year and Date
  2009-09-15
- Related Report
  2009 Annual Research Report
[Presentation] HMM-based speaker characteristics emphasis using average voice model2009
- Author(s)
  Takashi Nose, HMM-based speakercharacteristics emphasis using averagevoice model
- Organizer
  10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009
- Place of Presentation
  Brighton, UK.
- Year and Date
  2009-09-10
- Related Report
  2011 Final Research Report
[Presentation] HMM-based speaker characteristics emphasis using average voice model2009
- Author(s)
  Takashi Nose
- Organizer
  10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009
- Place of Presentation
  Brighton, UK
- Year and Date
  2009-09-10
- Related Report
  2009 Annual Research Report
[Presentation] Speaking style adaptation for spontaneous speech recognition using multiple-regression HMM2009
- Author(s)
  Yusuke Ijima
- Organizer
  10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009
- Place of Presentation
  Brighton, UK.
- Year and Date
  2009-09-07
- Related Report
  2011 Final Research Report
[Presentation] Speaking style adaptation for spontaneous speech recognition using multiple-regression HMM2009
- Author(s)
  Yusuke Ijima
- Organizer
  10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009
- Place of Presentation
  Brighton, UK
- Year and Date
  2009-09-07
- Related Report
  2009 Annual Research Report
[Presentation] 重回帰HMMに基づく自然発話音声の発話様式識別2009
- Author(s)
  能熱隆
- Organizer
  電子情報通信学会音声研究会
- Place of Presentation
  飯坂ホテル聚楽,福島県福島市
- Year and Date
  2009-07-18
- Related Report
  2009 Annual Research Report
[Presentation] Emotional speech recognition based on style estimation and adaptation with multiple-regression HMM2009
- Author(s)
  Yusuke Ijima
- Organizer
  2009 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2009
- Place of Presentation
  Taipei, Taiwan
- Year and Date
  2009-04-21
- Related Report
  2009 Annual Research Report
[Remarks]
- URL
  http://www.kbys.ip.titech.ac.jp/
- Related Report
  2011 Final Research Report

Research on robust spoken language interfaces for diverse voice variability and expressivity

Principal Investigator

KOBAYASHI Takao 東京工業大学, 大学院・総合理工学研究科, 教授 (70153616)

¥9,750,000 (Direct Cost: ¥7,500,000、Indirect Cost: ¥2,250,000)

Report

Research Products

[Journal Article] HMMに基づく対話音声合成における多様な韻律生成のためのコンテクストの拡張2012

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Very lowbit-rate F0 coding for phonetic vocoders using MSD-HMM with quantized Fosymbols2012

Author(s)

Journal Title

Related Report

[Journal Article] A tone-modeling technique using a quantized F0 context to improvetone correctness in average-voice-based speech synthesis2012

Author(s)

Journal Title

Related Report

[Journal Article] Very low bit-rate F0 coding for phonetic vocoders using MSD-HMM with quantized F0 symbols2012

Author(s)

Journal Title

Related Report

[Journal Article] A tone-modeling technique using a quantized F0 context to improve tone correctness in average-voice-based speech synthesis2012

Author(s)

Journal Title

Related Report

[Journal Article] An F0 modeling technique based on prosodic events for spontaneous speech synthesis2012

Author(s)

Journal Title

Related Report

[Journal Article] HMMに基づく対話音声合成における多様な韻律生成のためのコンテクストの拡張2012

Author(s)

Journal Title

Related Report

[Journal Article] 観測値の不連続性を考慮したHMMに基づくF0モデル化の検討2012

Author(s)

Journal Title

Related Report

[Journal Article] 合成音声のスタイル制御における系列内変動を考慮したスペクトル・韻律パラメータの生成2012

Author(s)

Journal Title

Related Report

[Journal Article] Speaker-independent HMM-based voice conversion using adaptive quantization of the fundamental frequency2011

Author(s)

Journal Title

Related Report

[Journal Article] Speaker-independent HMM-based voice conversion using adaptive quantization of the fundamental frequency2011

Author(s)

Journal Title

Related Report

[Journal Article] Recent development of HMM-based expressive speech synthesis and its applications2011

Author(s)

Journal Title

Related Report

[Journal Article] A perceptual expressivity modeling technique for speech synthesis based on multiple-regression HSMM2011

Author(s)

Journal Title

Related Report

[Journal Article] On the use of extended context for HMM-based spontaneous conversational speech synthesis2011

Author(s)

Journal Title

Related Report

[Journal Article] Tonal context labeling using quantized F0 symbols for improving tone correctness in average-voice-based speech synthesis2011

Author(s)

Journal Title

Related Report

[Journal Article] Very low bit-rate F0 coding for phonetic vocoder using MSD-HMM with quantized F0 context2011

Author(s)

Journal Title

Related Report

[Journal Article] 韻律イベントHMMを用いた対話音声F0生成2011

Author(s)

Journal Title

Related Report

[Journal Article] HMM音声合成における不特定話者スタイル変換の検討2011

Author(s)

Journal Title

Related Report

[Journal Article] HMM音声合成のための動的特徴量を用いた音素継続長モデリングの検討2011