2009 Fiscal Year Annual Research Report

脳性麻痺構音障がい者の発話スタイルの解析及びハンズフリーコミュニケーションの研究

Research Project

Project/Area Number	21680054
Research Institution	Kobe University
Principal Investigator	滝口哲也 Kobe University, 自然科学系先端融合研究環都市安全研究センター, 准教授 (40397815)
Keywords	音声認織 / 脳性麻痺 / 構音障がい
Research Abstract	脳性麻痺構音障がい者の音声コミュニケーションの実現を目指し,1.構音障がい者の音素体系構築,2.顔方位にロバストな唇領域特徴と音声特徴の統合による構音障がい者の音声認識,3.ランダムプロジェクションを用いた音声特徴量抽出の研究などを行った. 1. 「PLSAによる構音障がい者の音素体系構築」従来の構音障がい者の音声認識は健常者の音素体系を基に行われているが,両者の発声方法は異なり,音素体系が一致しない.そこで本研究では,PLSA(Probabilistic Latent Semantic Analysis)によって音素モデルを自動生成し,それによって音声認識を行う手法を検討した.発話が母音のみであれば,100%の正解精度で認識可能である事が示せた. 2. 「顔方位にロバストな唇領域特徴と音声特徴の統合による構音障がい者の音声認識」アテトーゼ型の構音障がい者の場合,筋肉の緊張のため発話が不安定になりやすく,発話時に頭が動いてしまう場合がある.これに対して,音声特徴としてデルタケプストラム係数のセグメント特徴量を用いる.また,発話時の頭部の動きに対しては,Active Appearance Model(AAM)を用いることで画像から顔方位にロバストな唇領域特徴を抽出し,音声特徴と共に用いることで,雑音の影響を受けず発話変動を考慮したマルチモーダル音声認識手法を提案し,有効性を示した. 3. 「ランダムプロジェクションを用いた音声特徴量抽出」複数のランダムマトリックスを用いて機械的に音声特徴量を変換し,各々のランダム写像に対する音声認識結果に投票を行い,最適な認識結果を求める手法を提案し,その有効性を示した.

Research Products
(18 results)

All 2010 2009

All Journal Article (8 results) (of which Peer Reviewed: 8 results) Presentation (10 results)

[Journal Article] Gradient-Based Acoustic Features for Speech Recognition2009
- Author(s)
  Takashi Muroi
- Journal Title
  
  ISPACS
  
  Pages: 445-448
- Peer Reviewed
[Journal Article] System Request Detection in Human Conversation Based on Multi-Resolution GaborWavelet Features2009
- Author(s)
  Tomoyuki Yamagata
- Journal Title
  
  Interspeech
  
  Pages: 256-259
- Peer Reviewed
[Journal Article] Single-Channel Multi-Talker-Localization Based on Maximum Likelihood2009
- Author(s)
  Ryoichi Takashima
- Journal Title
  
  IEEE Statistical Signal Processing Workshop
  
  Pages: 769-772
- Peer Reviewed
[Journal Article] Generic Object Recognition using CRF by Incorporating BoF as Global Features2009
- Author(s)
  Takeshi Okumura
- Journal Title
  
  International Conference on Multimedia, Information Technology and its Applications
  
  Pages: 49-52
- Peer Reviewed
[Journal Article] Speech Feature Extraction Using Weighted Higher-Order Local Auto-Correlation2009
- Author(s)
  Yasuo Ariki
- Journal Title
  
  Far East Journal of Electronics and Communications Volume 3, Issue 2
  
  Pages: 125-140
- Peer Reviewed
[Journal Article] Integration of Metamodel and Acoustic Model for Dysarthric Speech Recognition2009
- Author(s)
  Hironori Matsumasa
- Journal Title
  
  Journal of Multimedia Volume 4, Issue 4
  
  Pages: 254-261
- Peer Reviewed
[Journal Article] Graph Cuts Segmentation by Using Local Texture Features of Multiresolution Analysis2009
- Author(s)
  Keita Fukuda
- Journal Title
  
  IEICE Transactions on Information and Systems Vol.E92-D, No.7
  
  Pages: 1452-1462
- Peer Reviewed
[Journal Article] Pose Robust and Person Independent Facial Expressions Recognition Using AAM Selection2009
- Author(s)
  Tomoko Okada
- Journal Title
  
  ISCE
  
  Pages: 637-638
- Peer Reviewed
[Presentation] ランダムプロジェクションを用いた音響モデルの線形変換2010
- Author(s)
  吉井麻里子
- Organizer
  日本音響学会2010年春季研究発表会
- Place of Presentation
  電気通信大学(東京都)
- Year and Date
  2010-03-09
[Presentation] 残響適応パラメータを用いた単一チャネル音源位置推定の検討2010
- Author(s)
  高島遼一
- Organizer
  日本音響学会2010年春季研究発表会
- Place of Presentation
  電気通信大学(東京都)
- Year and Date
  2010-03-09
[Presentation] PLSAによる構音障害者の音素体系構築の検討2010
- Author(s)
  高塚智敬
- Organizer
  日本音響学会2010年春季研究発表会
- Place of Presentation
  電気通信大学(東京都)
- Year and Date
  2010-03-09
[Presentation] 雑音環境下音声認識のためのバイラテラルフィルタを用いた音声特徴量抽出2010
- Author(s)
  山田馨土朗
- Organizer
  日本音響学会2010年春季研究発表会
- Place of Presentation
  電気通信大学(東京都)
- Year and Date
  2010-03-09
[Presentation] Buried Markov Modelの構造構築における独立性検定法の検討2010
- Author(s)
  山本隆之
- Organizer
  日本音響学会2010年春季研究発表会
- Place of Presentation
  電気通信大学(東京都)
- Year and Date
  2010-03-09
[Presentation] 職別的言語モデルに基づくConfusion Network上での音声認職誤り訂正2010
- Author(s)
  松本智彦
- Organizer
  日本音響学会2010年春季研究発表会
- Place of Presentation
  電気通信大学(東京都)
- Year and Date
  2010-03-08
[Presentation] AAMを用いた顔方位にロバストな唇領域特徴抽出と音声特徴による構音障害者の音声認識2009
- Author(s)
  宮本千琴
- Organizer
  第11回音声言語シンポジウム
- Place of Presentation
  東京大学(東京都)
- Year and Date
  2009-12-22
[Presentation] 構音障害者の音声認識における動的特徴量の考察2009
- Author(s)
  宮本千琴
- Organizer
  電子情報通信学会技術研究報告,SP2009-55
- Place of Presentation
  観光物産館イスパム(青森県)
- Year and Date
  2009-10-29
[Presentation] 局所特徴量を用いた構音障害者の音声認識の検討2009
- Author(s)
  宮本千琴
- Organizer
  日本音響学会2009年秋季研究発表会
- Place of Presentation
  日本大学(福島県)
- Year and Date
  2009-09-15
[Presentation] Random Projectionを用いた音声特徴量抽出におけるRandom Matnxの統合2009
- Author(s)
  吉井麻里子
- Organizer
  日本音響学会2009年秋季研究発表会
- Place of Presentation
  日本大学(福島県)
- Year and Date
  2009-09-15

2009 Fiscal Year Annual Research Report

脳性麻痺構音障がい者の発話スタイルの解析及びハンズフリーコミュニケーションの研究

Principal Investigator

滝口 哲也 Kobe University, 自然科学系先端融合研究環都市安全研究センター, 准教授 (40397815)

Research Products

[Journal Article] Gradient-Based Acoustic Features for Speech Recognition2009

Author(s)

Journal Title

[Journal Article] System Request Detection in Human Conversation Based on Multi-Resolution GaborWavelet Features2009

Author(s)

Journal Title

[Journal Article] Single-Channel Multi-Talker-Localization Based on Maximum Likelihood2009

Author(s)

Journal Title

[Journal Article] Generic Object Recognition using CRF by Incorporating BoF as Global Features2009

Author(s)

Journal Title

[Journal Article] Speech Feature Extraction Using Weighted Higher-Order Local Auto-Correlation2009

Author(s)

Journal Title

[Journal Article] Integration of Metamodel and Acoustic Model for Dysarthric Speech Recognition2009

Author(s)

Journal Title

[Journal Article] Graph Cuts Segmentation by Using Local Texture Features of Multiresolution Analysis2009

Author(s)

Journal Title

[Journal Article] Pose Robust and Person Independent Facial Expressions Recognition Using AAM Selection2009

Author(s)

Journal Title

[Presentation] ランダムプロジェクションを用いた音響モデルの線形変換2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 残響適応パラメータを用いた単一チャネル音源位置推定の検討2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] PLSAによる構音障害者の音素体系構築の検討2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 雑音環境下音声認識のためのバイラテラルフィルタを用いた音声特徴量抽出2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Buried Markov Modelの構造構築における独立性検定法の検討2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 職別的言語モデルに基づくConfusion Network上での音声認職誤り訂正2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] AAMを用いた顔方位にロバストな唇領域特徴抽出と音声特徴による構音障害者の音声認識2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 構音障害者の音声認識における動的特徴量の考察2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 局所特徴量を用いた構音障害者の音声認識の検討2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Random Projectionを用いた音声特徴量抽出におけるRandom Matnxの統合2009

Author(s)

Organizer

Place of Presentation

Year and Date

滝口哲也 Kobe University, 自然科学系先端融合研究環都市安全研究センター, 准教授 (40397815)