2007 Fiscal Year Annual Research Report

ディジタルメディアコンテンツ制作のための多様な音声の合成技術

Research Project

Project/Area Number	17300063
Research Institution	Nagoya Institute of Technology
Principal Investigator	徳田恵一 Nagoya Institute of Technology, 大学院・工学研究科, 教授 (20217483)
Co-Investigator(Kenkyū-buntansha)	北村正名古屋工業大学, 大学院・工学研究科, 教授 (60114865) 南角吉彦名古屋工業大学, 助教 (80397497) 戸田智基奈良先端科学技術大学院大学, 情報科学研究科, 助教 (90403328)
Keywords	音声合成 / メディアコンテンツ / 声質 / 発話様式 / HMM
Research Abstract	本研究で克服しなければならない課題は,1)音声データの自動ラベリング,2)合成音声の肉声感の確保,3)話者性と発話様式の分離,4)多様性の拡大,などである。これらの成果をまとめることにより,最終的には,計算機上にバーチャルなアニメーションキャラクターを構築し,研究成果の効果を検証する。声質・スタイル制御等における問題点を検証して,必要ならば改良を加える。言語情報の伝達を主目的とした音声の合成のみならず,歌声など,より広い人間の音声活動について研究を発展させる。これらの研究計画に従い,平成19年度においては以下を行った。 (1)話者性と発話様式を分離してモデル化することの可能なシステムを構築し,所望の効果が得られることを確認した。 (2)前年度の励振源のモデル化手法をさらに改良した。 (3)統合システムの評価を行った。歌声合成などに関しても評価を行った。 (4)最終年度に向けた評価実験のための音声データを収録し,ラベル付けを行った。 (5)海外研究協力者のカーネギーメロン大学・コンピュータサイエンス学部・アランブラック助教授とともにラベリング,多言語化に関する討論・予備実験等を行った。

Research Products
(31 results)

All 2008 2007 2006

All Journal Article (5 results) (of which Peer Reviewed: 5 results) Presentation (26 results)

[Journal Article] Statistical mapping between articulatory movements and acqustic spectrum with a Gaussian mixture model2008
- Author(s)
  Tomoki Toda, Alan W.Black, and Keiichi Tokuda
- Journal Title
  
  Speech Communication vol.50,no.3
  
  Pages: 215-227
- Peer Reviewed
[Journal Article] Voice conversion based on maximum likelihood estimation ofspeech parameter trajectory2007
- Author(s)
  Tomoki Toda, Alan W.Black, and Keiichi Tokuda
- Journal Title
  
  IEEE Transactions on Audio, Speech andLanguage Processing vol.15,no.8
  
  Pages: 2222-2235
- Peer Reviewed
[Journal Article] Hidden SemiMarkov Model Based Speech Synthesis System2007
- Author(s)
  Heiga Zen, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, and Tadashi Kitamura
- Journal Title
  
  IEICE Transactions on Informstion Systems vol.E90-D,no.5
  
  Pages: 825-834
- Peer Reviewed
[Journal Article] Speech Parameter Generation Algorithm Considering Global Variance for HMM-Based Speech Synthesis2007
- Author(s)
  Tomoki Toda and Keiichi Tokuda
- Journal Title
  
  IEICE Transactionson Information Systems vol.E90-D,no.5
  
  Pages: 816-824
- Peer Reviewed
[Journal Article] An HMM-based Brazilian Portuguese speech synthesizer andits characteristics2006
- Author(s)
  Ranniery Maia, Heiga Zen, Keiichi Tokuda, Tadashi Kitamura, and Fernando G.V.Resende
- Journal Title
  
  Journal of Communication and Information Systems vol.21,no.2
  
  Pages: 132-145
- Peer Reviewed
[Presentation] Performance evaluation of the speaker-independent HMM-based speech synthesis system HTS-2007 for the Blizzard Challenge 20072008
- Author(s)
  Junichi Yamagishi, TakashiNose, Heiga Zen, TomokiToda, Keiichi Tokuda
- Organizer
  2008 IEEE International Conference on Acoustics, Speech, and Signal Processing(ICASSP 2008)
- Place of Presentation
  Las Vegas,Nevada,U.S.A
- Year and Date
  20080330-0404
[Presentation] Statistical approach to vocal tract transfer function estimation based on factor analyzed trajectory HMM2008
- Author(s)
  Tomoki Toda, Keiichi Tokuda
- Organizer
  2008 IEEE International Conference on Acoustics, Speech, and Signal Processing(ICASSP 2008)
- Place of Presentation
  Las Vegas,Nevada,U.S.A
- Year and Date
  20080330-0404
[Presentation] ON the state definition for a trainable excitation model in HMM-based speech synthesis2008
- Author(s)
  Ranniery Maia, Tomoki Toda, Keiichi Tokuda, ShinsukeSakai, Satoshi Nakamura
- Organizer
  2008 IEEE International Conference on Acoustics, Speech, and SignalProcessing(ICASSP 2008)
- Place of Presentation
  Las Vegas,Nevada,U.S.A
- Year and Date
  20080330-0404
[Presentation] Minimum generation error criterion considering globalllocal v.ariance for HMM-based speech synthesis2008
- Author(s)
  Yi-Jian Wu, Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda
- Organizer
  2008 IEEE International Conference on Acoustics, Speech, and Signal Processing(ICASSP 2008)
- Place of Presentation
  Las Vegas,Nevada,U.S.A
- Year and Date
  20080330-0404
[Presentation] 声質変換のためのスペクトルおよびF0の同時モデリング2008
- Author(s)
  宇藤陽介, 南角吉彦, 李晃伸, 徳田恵一
- Organizer
  日本音響学会2008年春季研究発表会
- Place of Presentation
  千葉工業大学
- Year and Date
  2008-03-18
[Presentation] 時系列マッチングを含む統計モデルに基づいた継続長およびスペクトルの同時変換2008
- Author(s)
  油谷かおり, 宇藤陽介, 南角吉彦, 戸田智基, 李晃伸, 徳田恵一
- Organizer
  日本音響学会2008年春季研究発表会
- Place of Presentation
  千葉工業大学
- Year and Date
  2008-03-18
[Presentation] Blizzar Challenge 2007のための平均声に基づくHMM音声合成システムの評価2008
- Author(s)
  能勢隆, 山岸順一, 全柄河, 戸田智基, 徳田恵一
- Organizer
  日本音響学会2008年春季研究発表会
- Place of Presentation
  千葉工業大学
- Year and Date
  2008-03-18
[Presentation] 英語音声合成における韻律推定モデルと音響モデルの同時学習2008
- Author(s)
  大浦圭一郎, 戸田智基, 南角吉彦, 徳田恵一, マイアハニエリ, 坂井信輔, 中村哲
- Organizer
  日本音響学会2008年春季研究発表会
- Place of Presentation
  千葉工業大学
- Year and Date
  2008-03-18
[Presentation] State clustering on an excitiation model for HMM-based speech synthesis2008
- Author(s)
  Ranniery Maia, Tomoki Toda, Keiichi Tokuda, ShinsukeSakai, Satoshi Nakamura
- Organizer
  日本音響学会2008年春季研究発表会
- Place of Presentation
  千葉工業大学
- Year and Date
  2008-03-18
[Presentation] 声質と歌唱スタイルを自動学習可能な歌声合成システム2008
- Author(s)
  酒向慎司, 才野慶二郎, 南角吉彦, 徳田恵一, 北村正
- Organizer
  情報処理学会
- Place of Presentation
  伊東温泉ホテル暖香園
- Year and Date
  2008-02-08
[Presentation] Model-Space MLLR for Trajectoty HMMs2007
- Author(s)
  Heiga Zen, Yoshihiko Nankaku, Keuchi Tokuda
- Organizer
  Interspeech 2007-EUR, OSPEECH
- Place of Presentation
  Antwerp,Belguim
- Year and Date
  20070827-31
[Presentation] A trainable excitation model for HMM-based speech synthesis2007
- Author(s)
  Ranniery Maia, Tomoki Toda, Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda
- Organizer
  Interspeech 2007-EUR.OSPEECH
- Place of Presentation
  Antwerp,Belguim
- Year and Date
  20070827-31
[Presentation] Spectral conversion based on statistical models including time -frequency matching2007
- Author(s)
  Yoshihiko Nankaku, Kenichi Nakamura, and Keiichi To kuda
- Organizer
  Proc.of 6th ISCA Speech Synthesis Workshop
- Place of Presentation
  Bonn,Germany
- Year and Date
  20070822-24
[Presentation] An excitation model for HMM-based speech synthesis basedon residual modeling2007
- Author(s)
  Ranniery Maia, Tomoki Toda, Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda
- Organizer
  Proc.of 6th ISCA Speech Synthesis Workshop
- Place of Presentation
  Bonn,Germany
- Year and Date
  20070822-24
[Presentation] Improved average-voice-based speech synthesis usinggender-mixed modeling and a parameter generation algorithm considBring GV2007
- Author(s)
  Junichi Yamagishi, Takao Kobayashi, Steve Renals, Simon King, Heiga Zen, Tomoki Toda, Keiichi Tokuda
- Organizer
  Proc.of 6th ISCA Speech Synthesis Workshop
- Place of Presentation
  Bonn,Germany
- Year and Date
  20070822-24
[Presentation] The HMM-based speech synt hesis system (HTS)version 2.02007
- Author(s)
  Heiga Zen, Takashi Nose, Junichi Yamagishi, Shinji Sako, Takashi Masuko, Alan W.Black, Keiichi Tokuda
- Organizer
  Proc.of 6th ISCA Speech Synthesis Workshop
- Place of Presentation
  Bonn,Germany
- Year and Date
  20070822-24
[Presentation] Statistical parametric speech synthesis2007
- Author(s)
  Alan W.Black, Heiga Zen, Keuchi Tokuda
- Organizer
  2007 IEEE International Conference on Acoustics, Speech, and SignalProcessing(ICASSP 2007)
- Place of Presentation
  Hawaii,USA
- Year and Date
  20070415-20
[Presentation] Acoustic modeling with contextual additive structure for HMM-based speech recognition2007
- Author(s)
  Yoshihiko Nankaku, Kazuhiro Nakamura, Heiga Zen, Keiichi Tokuda
- Organizer
  2008 IEEE International Conference on Acoustics, Speech, and Signal Processing(ICASSP 2008)
- Place of Presentation
  Las Vegas,Nevada,U.S.A
- Year and Date
  20070330-0404
[Presentation] Recent development of the HMM-based speech synthesis system(HTS)2007
- Author(s)
  Heiga Zen, Keiichiro Oura, Takashi Nose, Junichi Yamagishi, Shinji Sako, TomokiToda, Takashi Masuko, Alan W.Black, Keiichi Tokuda
- Organizer
  電子情報通信学会(音声言語シンポジウム)
- Place of Presentation
  NTTけいはんな
- Year and Date
  2007-12-21
[Presentation] 変分ベイズ法に基づく声質変換2007
- Author(s)
  丸目雅浩, 南角吉彦, 酒向慎司, 徳田恵一, 北村正
- Organizer
  電子情報通信学会(音声言語シンポジウム)
- Place of Presentation
  NTTけいはんな
- Year and Date
  2007-12-21
[Presentation] 声質変換のためのスペクトル・F0の同時モデリング2007
- Author(s)
  宇藤陽介, 南角吉彦, 李晃伸, 徳田恵一
- Organizer
  電子情報通信学会(音声言語シンポジウム)
- Place of Presentation
  NTTけいはんな
- Year and Date
  2007-12-20
[Presentation] 因子分析に基づく固有声モデルを用いたHMM音声合成2007
- Author(s)
  才野慶二郎, 全柄河, 南角吉彦, 李晃伸, 徳田恵一
- Organizer
  日本音響学会2007年秋季研究発表会
- Place of Presentation
  山梨大学
- Year and Date
  2007-09-21
[Presentation] MEL-LSPを用いたHMM音声合成におけるポストフィルタリングの検討2007
- Author(s)
  大浦圭一郎, 全柄河, 南角吉彦, 李晃伸, 徳田恵一
- Organizer
  日本音響学会2007年秋季研究発表会
- Place of Presentation
  山梨大学
- Year and Date
  2007-09-21
[Presentation] Evaluation of parameter optimization methods for millimum generation error based HMM training2007
- Author(s)
  Yi-Jian Wu, Heiga Zen, 「Yoshihiko Nankaku, Keiichi Tokuda
- Organizer
  日本音響学会2007年秋季研究発表会
- Place of Presentation
  山梨大学
- Year and Date
  2007-09-21
[Presentation] モデル鵬尤線形回腿つぐトラジェクトリmmの話櫛2007
- Author(s)
  全柄河, 南角吉彦, 徳田恵一
- Organizer
  日本音響学会2007年秋季研究発表会
- Place of Presentation
  山梨大学
- Year and Date
  2007-09-20
[Presentation] Speaker-independent HMM-based speech synthesis system-HTS-2007 system for the Blizzard Challenge 20072007
- Author(s)
  Junichi Yamagishi, Heiga Zen, Tomoki Toda, Keiichi Tokuda
- Organizer
  Proc.of Blizzard, Challenge 2007
- Place of Presentation
  Bonn,Germany
- Year and Date
  2007-08-25

2007 Fiscal Year Annual Research Report

ディジタルメディアコンテンツ制作のための多様な音声の合成技術

Principal Investigator

徳田 恵一 Nagoya Institute of Technology, 大学院・工学研究科, 教授 (20217483)

Research Products

[Journal Article] Statistical mapping between articulatory movements and acqustic spectrum with a Gaussian mixture model2008

Author(s)

Journal Title

[Journal Article] Voice conversion based on maximum likelihood estimation ofspeech parameter trajectory2007

Author(s)

Journal Title

[Journal Article] Hidden SemiMarkov Model Based Speech Synthesis System2007

Author(s)

Journal Title

[Journal Article] Speech Parameter Generation Algorithm Considering Global Variance for HMM-Based Speech Synthesis2007

Author(s)

Journal Title

[Journal Article] An HMM-based Brazilian Portuguese speech synthesizer andits characteristics2006

Author(s)

Journal Title

[Presentation] Performance evaluation of the speaker-independent HMM-based speech synthesis system HTS-2007 for the Blizzard Challenge 20072008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Statistical approach to vocal tract transfer function estimation based on factor analyzed trajectory HMM2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] ON the state definition for a trainable excitation model in HMM-based speech synthesis2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Minimum generation error criterion considering globalllocal v.ariance for HMM-based speech synthesis2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 声質変換のためのスペクトルおよびF0の同時モデリング2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 時系列マッチングを含む統計モデルに基づいた継続長およびスペクトルの同時変換2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Blizzar Challenge 2007のための平均声に基づくHMM音声合成システムの評価2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 英語音声合成における韻律推定モデルと音響モデルの同時学習2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] State clustering on an excitiation model for HMM-based speech synthesis2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 声質と歌唱スタイルを自動学習可能な歌声合成システム2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Model-Space MLLR for Trajectoty HMMs2007

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] A trainable excitation model for HMM-based speech synthesis2007

Author(s)

Organizer

Place of Presentation

Year and Date

徳田恵一 Nagoya Institute of Technology, 大学院・工学研究科, 教授 (20217483)