2006 年度実績報告書

生成過程モデルに基づくコーパスベース韻律生成による任意の発話スタイルの音声合成

研究課題

研究課題/領域番号	17300055
研究機関	東京大学
研究代表者	広瀬啓吉東京大学, 大学院情報理工学系研究科, 教授 (50111472)
研究分担者	峯松信明東京大学, 大学院新領域創成科学研究科, 助教授 (90273333)
キーワード	生成過程モデル / 基本周波数パターン / 感情の程度 / 統計的手法 / HMM音声合成 / 2段階手法 / 焦点制御 / 音声コーパス
研究概要	主として種々の発話スタイルの実現を目的として研究を進め、下記の成果を達成した。 1.生成過程モデルの指令の位置と大きさを、2分木により推定することにより、テキストから基本周波数パターンを生成する手法を開発したが、その中で、感情に関する要因は指令の大きさのみに関与することを明らかにした。 2.感情音声について、文節毎に、そこに込められた感情の程度を2段階でラベリングした音声コーパスを作成した。この感情の程度を上記1の手法の入力項目とすることにより、より適切な感情表現の合成音声が得られることを示した。 3.テキストから、休止の位置と長さ、各音素長、基本周波数パターンを、この順で生成し、HMM音声合成により合成音声を得る、テキスト音声合成の統計的手法を構築した。基本周波数パターンの生成は、上記1に基づくが、その際、推定した休止の位置と長さを入力項目とする必要があることを示した。また、アクセント指令の時点に制約を加えることで、推定精度が向上することを示した。 4.中国語を対象とした基本周波数パターン生成手法を開発し、その有効性を合成音声の聴取実験により確認した。この手法は、フレーズ成分をルールベースで、アクセント成分をコーパスベースで生成するものである。アクセント成分については、音調核という安定した基本周波数パターン部のみを2分木により推定し、それらを直線補間することで、少ないコーパスでも良好な推定が可能なことを示した。 5.上記の手法において、アクセント成分の推定の入力項目として、生成したフレーズ成分の情報を加える2段階手法を提案し、その有効性を合成音声の聴取により示した。また、この手法によれば、強調したい語の直前にフレーズ指令を置くことにより、合成音声において適切な焦点制御が実現できることを示した。これは、生成過程モデルにより、柔軟な韻律制御が可能なことを示す具体例である。

研究成果
(12件)

すべて 2007 2006

すべて雑誌論文 (12件)

[雑誌論文] Synthesis of fundamental frequency contours for Standard Chinese based on superpositional and tone nucleus models2007
- 著者名/発表者名
  Keikichi Hirose
- 雑誌名
  
  Archives of Acoustics 32・1
  
  ページ: 41-50
[雑誌論文] Prosody in spoken language technologies(Special Lecture)2007
- 著者名/発表者名
  Keikichi Hirose
- 雑誌名
  
  Proceedings of International Workshop on Nonlinear Circuits and Signal Processing(NCSP2007) CD-ROM
  
  ページ: 615-622
[雑誌論文] Generation of sentence F0 contours for Mandarin speech synthesis by superposing tone components on phrase components2007
- 著者名/発表者名
  Qinghua Sun
- 雑誌名
  
  Proceedings of International Workshop on Nonlinear Circuits and Signal Processing(NCSP2007) CD-ROM
  
  ページ: 317-320
[雑誌論文] 日本語音声合成を目的としたアクセント処理のための規則と統計的学習2007
- 著者名/発表者名
  黒岩龍
- 雑誌名
  
  日本音響学会講演論文集 CD-ROM
  
  ページ: 301-302
[雑誌論文] 道案内対話システムにおける応答音声の評価2007
- 著者名/発表者名
  八木裕司
- 雑誌名
  
  日本音響学会講演論文集 CD-ROM
  
  ページ: 9-10
[雑誌論文] 基本周波数パターン生成過程モデルを用いたテキストからのコーパスベース韻律生成とその評価2007
- 著者名/発表者名
  越智景子
- 雑誌名
  
  日本音響学会講演論文集 CD-ROM
  
  ページ: 231-232
[雑誌論文] Quantitative and structural modeling of voice fundamental frequency contours of speech in Mandarin2006
- 著者名/発表者名
  Jinfu Ni
- 雑誌名
  
  Speech Communication 62・5
  
  ページ: 370-378
[雑誌論文] Modeling the effects of emphasis and question on fundamental frequency contours of Cantonese utterances2006
- 著者名/発表者名
  Wentao Gu
- 雑誌名
  
  IEEE Transactions on Speech and Audio Processing 14・4
  
  ページ: 1155-1170
[雑誌論文] Corpus-based synthesis of fundamental frequency contours using generation process model and automatic preparation of training corpora(Plenary)2006
- 著者名/発表者名
  Keikichi Hirose
- 雑誌名
  
  Conference Guide for Oriental COCOSDA 2006 - International Conference on Speech Databases and Assessment
  
  ページ: 12
[雑誌論文] Rule-based generation of phrase components in two-step synthesis of fundamental frequency contours of Mandarin2006
- 著者名/発表者名
  Qinghua Sun
- 雑誌名
  
  Proceedings of International Conference on Speech Prosody 2
  
  ページ: 561-564
[雑誌論文] Corpus-based generation of fundamental frequency contours using generation process model and considering emotional focuses2006
- 著者名/発表者名
  Keikichi Hirose
- 雑誌名
  
  Proceedings of Interspeech 2006-ICSLP CD-ROM
  
  ページ: 305-308
[雑誌論文] 生成過程モデルによるコーパスベース韻律制御のための休止推定2006
- 著者名/発表者名
  越智景子
- 雑誌名
  
  日本音響学会講演論文集 CD-ROM
  
  ページ: 269-270

2006 年度 実績報告書

生成過程モデルに基づくコーパスベース韻律生成による任意の発話スタイルの音声合成

研究代表者

広瀬 啓吉 東京大学, 大学院情報理工学系研究科, 教授 (50111472)

研究成果

[雑誌論文] Synthesis of fundamental frequency contours for Standard Chinese based on superpositional and tone nucleus models2007

著者名/発表者名

雑誌名

[雑誌論文] Prosody in spoken language technologies(Special Lecture)2007

著者名/発表者名

雑誌名

[雑誌論文] Generation of sentence F0 contours for Mandarin speech synthesis by superposing tone components on phrase components2007

著者名/発表者名

雑誌名

[雑誌論文] 日本語音声合成を目的としたアクセント処理のための規則と統計的学習2007

著者名/発表者名

雑誌名

[雑誌論文] 道案内対話システムにおける応答音声の評価2007

著者名/発表者名

雑誌名

[雑誌論文] 基本周波数パターン生成過程モデルを用いたテキストからのコーパスベース韻律生成とその評価2007

著者名/発表者名

雑誌名

[雑誌論文] Quantitative and structural modeling of voice fundamental frequency contours of speech in Mandarin2006

著者名/発表者名

雑誌名

[雑誌論文] Modeling the effects of emphasis and question on fundamental frequency contours of Cantonese utterances2006

著者名/発表者名

雑誌名

[雑誌論文] Corpus-based synthesis of fundamental frequency contours using generation process model and automatic preparation of training corpora(Plenary)2006

著者名/発表者名

雑誌名

[雑誌論文] Rule-based generation of phrase components in two-step synthesis of fundamental frequency contours of Mandarin2006

著者名/発表者名

雑誌名

[雑誌論文] Corpus-based generation of fundamental frequency contours using generation process model and considering emotional focuses2006

著者名/発表者名

雑誌名

[雑誌論文] 生成過程モデルによるコーパスベース韻律制御のための休止推定2006

著者名/発表者名

雑誌名

2006 年度実績報告書

広瀬啓吉東京大学, 大学院情報理工学系研究科, 教授 (50111472)