統計的音声認識・合成のための次世代音響モデリング

Research Project

Project/Area Number	18800019
Research Category	Grant-in-Aid for Young Scientists (Start-up)
Allocation Type	Single-year Grants
Research Field	Perception information processing/Intelligent robotics
Research Institution	Nagoya Institute of Technology
Principal Investigator	全炳河 Nagoya Institute of Technology, 工学研究科, 研究員 (60432291)
Project Period (FY)	2006 – 2007
Project Status	Completed (Fiscal Year 2007)
Budget Amount *help	¥2,400,000 (Direct Cost: ¥2,400,000) Fiscal Year 2007: ¥1,200,000 (Direct Cost: ¥1,200,000) Fiscal Year 2006: ¥1,200,000 (Direct Cost: ¥1,200,000)
Keywords	音声認識 / 音声合成 / 音響モデル / 隠れマルコフモデル / トラジェクトリモデル / ベイズ学習 / 話者適応 / セミマルコフモデル
Research Abstract	統計的音声認識や音声合成における音響モデルとして広く用いられている隠れマルコフモデル(HMM)の本質的限界としては,以下の3点がある。(1)状態内で統計量が一定であり,状態内での時間変化をモデル化できない。(2)観測ベクトル間の時間的独立性を仮定しており,時間的な依存関係を表現できない。(3)状態持続時間長確率分布が幾何分布であり,実際の音声の持続時間特性をモデル化できない。これらの問題は,HMMにおける仮定である,状態内での統計的定常性、観測ベクトル間の条件付無相関性、1次のマルコフ過程,に関するものである。HMMの取り扱いの容易さと実装可能性は,これらの仮定によっているものの,実際の音声ラメータ列では成り立たないものである。そこで,本年は,一つ目及びニつ目の問題を同時に解決可能な新しい統計モデリング手法であるトラジェクトリHMMを導入し,これを用いて音声認識、合成を行うための各種アルゴリズムを整備した。具体的には,モンテカルロEMに基づく学習アルゴリズム及び特徴量空間、モデル空間における線形変換を用いた話者適応アルゴリズムを導出した。また,これらのアルゴリズムを実際に音声認識に適用、評価し,国際会議等で発表した。三つ目の問題を解決できる隠れセミマルコフモデル学習のための各種ツールの整備を行い,最新の音声合成システムに組み込み,音声合成システムの国際的な評価会に参加し,優秀な成績を収めた。

Report

(2 results)

2007 Annual Research Report
2006 Annual Research Report

Research Products

(6 results)

All 2007

All Journal Article (3 results) (of which Peer Reviewed: 1 results) Presentation (3 results)

[Journal Article] A hidden semi-Markov model-based speech synthesis system2007
- Author(s)
  Heiga Zen
- Journal Title
  
  IEICE TRANSACTIONS on Information and Svstems E90-D No. 5
  
  Pages: 825-834
- Related Report
  2007 Annual Research Report
- Peer Reviewed
[Journal Article] Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences2007
- Author(s)
  Heiga Zen
- Journal Title
  
  Computer Speech and Language Vol.21 No.1
  
  Pages: 153-173
- Related Report
  2006 Annual Research Report
[Journal Article] Details of Nitech HMM-based speech synthesis system for the Blizzard Challenge 20052007
- Author(s)
  Heiga Zen
- Journal Title
  
  IEICE TRANSACTIONS on Information and Systems Vol.E90-D No.1
  
  Pages: 325-333
- Related Report
  2006 Annual Research Report
[Presentation] モデル空間最尤線形回帰に基つくトラジェクトリHMMの話者適応2007
- Author(s)
  全〓河
- Organizer
  日本音響学会
- Place of Presentation
  山梨大学
- Related Report
  2007 Annual Research Report
[Presentation] Model-space MLLR for tralectory HMMs2007
- Author(s)
  Heiga Zen
- Organizer
  Interspeech2008
- Place of Presentation
  アントワープ
- Related Report
  2007 Annual Research Report
[Presentation] The HMM-based speech synthesis system version 2.02007
- Author(s)
  Heiga Zen
- Organizer
  ISCA Speech Synthesis Workshop
- Place of Presentation
  ボン
- Related Report
  2007 Annual Research Report

統計的音声認識・合成のための次世代音響モデリング

Principal Investigator

全 炳河 Nagoya Institute of Technology, 工学研究科, 研究員 (60432291)

¥2,400,000 (Direct Cost: ¥2,400,000)

Report

Research Products

[Journal Article] A hidden semi-Markov model-based speech synthesis system2007

Author(s)

Journal Title

Related Report

[Journal Article] Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences2007

Author(s)

Journal Title

Related Report

[Journal Article] Details of Nitech HMM-based speech synthesis system for the Blizzard Challenge 20052007

Author(s)

Journal Title

Related Report

[Presentation] モデル空間最尤線形回帰に基つくトラジェクトリHMMの話者適応2007

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] Model-space MLLR for tralectory HMMs2007

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] The HMM-based speech synthesis system version 2.02007

Author(s)

Organizer

Place of Presentation

Related Report

全炳河 Nagoya Institute of Technology, 工学研究科, 研究員 (60432291)