2008 Fiscal Year Annual Research Report

シングルマイクロフォンによる発話区間検出及び音源方向推定の研究

Research Project

Project/Area Number	19700170
Research Institution	Kobe University
Principal Investigator	滝口哲也 Kobe University, 自然科学系先端融合研究環都市安全研究センター, 講師 (40397815)
Keywords	音声認識 / 音源方向推定 / 雑音除去
Research Abstract	マイクスイッチレスな音声認識の実現を目指し, 1. 未知な突発性雑音の除去手法, 2. 雑音に頑健な音声特徴量抽出手法, 3. シングルチャネルでの複数話者方向推定法の提案を行った. 1. 「音声の動的特徴のモデルを用いた突発性雑音除去手法」 H19年度に提案したEMアルゴリズムとニュートン法に基づく雑音除去手法を更に改善し, 音声の動的特徴のモデル化を新たに組み合わせた手法を提案した. 提案手法により, 500単語認識実験にて, 77.7%(H19年度の提案手法)から87.9%まで認識率が改善された. 2. 「雑音に頑健な非負行列因子分解を用いた音声特徴抽出法の提案」各音素の部分空間の統合方法として, 高次空間から低次空間への写像が実現可能であるNMF特徴量抽出法を提案した. データの相関を利用したNMF初期化手法を組み込むことにより, 残響下にて従来手法よりも認識率の改善が得られた. また, ランダムプロジェクションによる新しい特徴量変換手法も提案し, 雑音下音声認識において有効性を示した. 3. 「モデル合成法によるシングルマイクロフォン音源方向推定の提案」これまでに提案されてきた音源位置の推定方法は, マイクロフォンアレーにおける各観測信号の時間差を用いた手法が多く, 複数のマイクロフォンが必要であった. シングルマイクロフォンによる音源位置推定を可能にするため, H19年度では, 音声の統計モデルであるGMMを用いて尤度最大化基準により音声と音響伝達特性を分離することにより実現した. ただし一つの音源のみが存在する環境のみ対応出来るものであったため, H20年度では, 複数音源に対応出来るようにモデル合成法を用いた音源方向推定法を提案し, 二音源に対する同時方向推定実験により有効性を確認した.

Research Products
(19 results)

All 2009 2008

All Journal Article (7 results) (of which Peer Reviewed: 7 results) Presentation (11 results) Book (1 results)

[Journal Article] Speaker Independent Phoneme Recognition Based on Fisher Weight Map2008
- Author(s)
  Takashi Muroi
- Journal Title
  
  International Journal of Hybrid Information Technology Vol.1, No.3
  
  Pages: 61-70
- Peer Reviewed
[Journal Article] Human-Robot Interface Using System, Request Utterance Detection Based on Acoustic Features2008
- Author(s)
  Tetsuya Takiguchi
- Journal Title
  
  International Journal of Hybrid Information Technology Vol.1, No.3
  
  Pages: 81-90
- Peer Reviewed
[Journal Article] Integration of Metamodel and Acoustic Model for Speech Recognition2008
- Author(s)
  Hironori Matsumasa
- Journal Title
  
  Interspeech
  
  Pages: 2234-2237
- Peer Reviewed
[Journal Article] Sudden Noise Reduction Based on GMM with Noise Power Estimation2008
- Author(s)
  Nobuyuki Miyake
- Journal Title
  
  Interspeech
  
  Pages: 403-406
- Peer Reviewed
[Journal Article] Integration of Phoneme-subspaces Using ICA for Speech Feature Extraction and Recognition2008
- Author(s)
  Hyunsin Park
- Journal Title
  
  Joint workshop on Hands-free Speech Communication and Microphone Arrays
  
  Pages: 148-151
- Peer Reviewed
[Journal Article] Active Microphone with Parabolic Reflection Board for Estimation of Sound Source Direction2008
- Author(s)
  Tetsuya Takiguchi
- Journal Title
  
  Joint workshop on Hands-free Speech Communication and Microphone Arrays
  
  Pages: 65-68
- Peer Reviewed
[Journal Article] Estimation of Sound Source Direction Using Parabolic Reflection Board2008
- Author(s)
  Tetsuya Takiguchi
- Journal Title
  
  International Workshop on Nonlinear Circuits and Signal Processing
  
  Pages: 9-12
- Peer Reviewed
[Presentation] パラボラ反射板による音響伝達特性の変化を用いたシングルチャネル音源方向推定2009
- Author(s)
  高島遼一
- Organizer
  日本音響学会2009年春季研究発表会
- Place of Presentation
  東京工業大学
- Year and Date
  2009-03-19
[Presentation] 音響伝達特性モデルを用いたシングルチャネル音源位置推定の検討2009
- Author(s)
  高島遼一
- Organizer
  日本音響学会2009年春季研究発表会
- Place of Presentation
  東京工業大学
- Year and Date
  2009-03-18
[Presentation] 勾配ヒストグラムに基づく時間-周波数特徴を用いた単語認識2009
- Author(s)
  室井貴司
- Organizer
  日本音響学会2009年春季研究発表会
- Place of Presentation
  東京工業大学
- Year and Date
  2009-03-17
[Presentation] ランダムプロジェクションを用いた音声特徴量抽出2009
- Author(s)
  吉井麻里子
- Organizer
  日本音響学会2009年春季研究発表会
- Place of Presentation
  東京工業大学
- Year and Date
  2009-03-17
[Presentation] スペクトル平面における勾配ヒストグラムに基づく音声特徴量の検討2008
- Author(s)
  室井貴司
- Organizer
  第10回音声言語シンポジウム
- Place of Presentation
  早稲田大学
- Year and Date
  2008-12-10
[Presentation] 音声の動的特徴のモデルを使った突発性雑音の除去2008
- Author(s)
  三宅信之
- Organizer
  第10回音声言語シンポジウム
- Place of Presentation
  早稲田大学
- Year and Date
  2008-12-10
[Presentation] 制約付き非負行列因子分解を用いた音声特徴抽出の検討2008
- Author(s)
  朴玄信
- Organizer
  第10回音声言語シンポジウム
- Place of Presentation
  早稲田大学
- Year and Date
  2008-12-09
[Presentation] GMMに基づく音声特徴量の時間変動を考慮した突発性雑音の除去2008
- Author(s)
  三宅信之
- Organizer
  日本音響学会2008年秋季研究発表会
- Place of Presentation
  九州大学
- Year and Date
  2008-09-12
[Presentation] アクティブマイクロフォンによる音響伝達特性を用いたシングルチャネル音源方向推定2008
- Author(s)
  高島遼一
- Organizer
  日本音響学会2008年秋季研究発表会
- Place of Presentation
  九州大学
- Year and Date
  2008-09-12
[Presentation] MDL基準とICAを用いた統合音素部分空間による音声特徴量抽出の検討2008
- Author(s)
  朴玄信
- Organizer
  日本音響学会2008年秋季研究発表会
- Place of Presentation
  九州大学
- Year and Date
  2008-09-11
[Presentation] 勾配に基づく特徴量を用いた音声認識の検討2008
- Author(s)
  室井貴司
- Organizer
  日本音響学会2008年秋季研究発表会
- Place of Presentation
  九州大学
- Year and Date
  2008-09-11
[Book] System Request Utterance Detection Based on Acoustic and Linguistic Features2008
- Author(s)
  Tetsuya Takiguchi
- Total Pages
  539-550
- Publisher
  I-Tech Education and Publishing

2008 Fiscal Year Annual Research Report

シングルマイクロフォンによる発話区間検出及び音源方向推定の研究

Principal Investigator

滝口 哲也 Kobe University, 自然科学系先端融合研究環都市安全研究センター, 講師 (40397815)

Research Products

[Journal Article] Speaker Independent Phoneme Recognition Based on Fisher Weight Map2008

Author(s)

Journal Title

[Journal Article] Human-Robot Interface Using System, Request Utterance Detection Based on Acoustic Features2008

Author(s)

Journal Title

[Journal Article] Integration of Metamodel and Acoustic Model for Speech Recognition2008

Author(s)

Journal Title

[Journal Article] Sudden Noise Reduction Based on GMM with Noise Power Estimation2008

Author(s)

Journal Title

[Journal Article] Integration of Phoneme-subspaces Using ICA for Speech Feature Extraction and Recognition2008

Author(s)

Journal Title

[Journal Article] Active Microphone with Parabolic Reflection Board for Estimation of Sound Source Direction2008

Author(s)

Journal Title

[Journal Article] Estimation of Sound Source Direction Using Parabolic Reflection Board2008

Author(s)

Journal Title

[Presentation] パラボラ反射板による音響伝達特性の変化を用いたシングルチャネル音源方向推定2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 音響伝達特性モデルを用いたシングルチャネル音源位置推定の検討2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 勾配ヒストグラムに基づく時間-周波数特徴を用いた単語認識2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] ランダムプロジェクションを用いた音声特徴量抽出2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] スペクトル平面における勾配ヒストグラムに基づく音声特徴量の検討2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 音声の動的特徴のモデルを使った突発性雑音の除去2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 制約付き非負行列因子分解を用いた音声特徴抽出の検討2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] GMMに基づく音声特徴量の時間変動を考慮した突発性雑音の除去2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] アクティブマイクロフォンによる音響伝達特性を用いたシングルチャネル音源方向推定2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] MDL基準とICAを用いた統合音素部分空間による音声特徴量抽出の検討2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 勾配に基づく特徴量を用いた音声認識の検討2008

Author(s)

Organizer

Place of Presentation

滝口哲也 Kobe University, 自然科学系先端融合研究環都市安全研究センター, 講師 (40397815)