話者認識のための局所話者識別的特徴処理の研究

Research Project

Project/Area Number	09F09805
Research Category	Grant-in-Aid for JSPS Fellows
Allocation Type	Single-year Grants
Section	外国
Research Field	Intelligent informatics
Research Institution	Tokyo Institute of Technology
Principal Investigator	古井貞熙東京工業大学, 大学院・情報理工学研究科, 教授
Co-Investigator(Kenkyū-buntansha)	FERRAS FONT Marc 東京工業大学, 大学院・情報理工学研究科, 外国人特別研究員 FERRAS FONT Marc 東京工業大学, 大学院・情報理工学研究科, 外国人特別研究員
Project Period (FY)	2009 – 2010
Project Status	Completed (Fiscal Year 2010)
Budget Amount *help	¥1,200,000 (Direct Cost: ¥1,200,000) Fiscal Year 2010: ¥800,000 (Direct Cost: ¥800,000) Fiscal Year 2009: ¥400,000 (Direct Cost: ¥400,000)
Keywords	話者認識 / 識別的特徴
Research Abstract	(1)構造的MAP適応法の研究 GMM (Gaussian-mixture model)とSVM (Support Vector Machine)を組み合わせて用いる話者認識法に、構造的MAP適応法(Structural MAP adaptation)を適用する方法について研究した。構造的MAP適応法を用いることにより、限られた量の学習データを効果的に用いて、頑健な話者モデルを推定することができる。種々の構成の木構造を用いて実験を行った結果、最適な構造を用いた場合、10-12%の話者認識性能の向上が達成できることが確認された。この方法に、特徴量空間の線形変換に基づくCMLLR法による話者適応学習(SAT : Speaker Adaptive Training)を組み合わせることにより、最適な木構造の選択が容易になることが確認できた。これらの成果は、国際会議論文として採録決定済みである。 (2)個人差で重みづけられたMAP適応法の研究 GMMとSVMを組み合わせる話者認識法において、話者識別基準(speaker-discriminative criterion)を導入することにより、認識性能の安定性を高める研究を行った。適応における重み係数を調整するためのデータ占有数に、Fisher discriminative ratioすなわち話者間・話者内分散比を用いることにより、7%の話者認識性能の向上が確認できた。この成果は、情報処理学会の音声言語処理シンポジウムで発表した。

Report

(2 results)

2010 Annual Research Report
2009 Annual Research Report

Research Products

(2 results)

All 2010

All Presentation (2 results)

[Presentation] Optimal use of trees in structural MAP adaptation for speaker verification2010
- Author(s)
  S.Biswas, M.Ferras, K.Shinoda, S.Furui
- Organizer
  音声言語処理研究会
- Place of Presentation
  東京都
- Year and Date
  2010-12-21
- Related Report
  2010 Annual Research Report
[Presentation] Inter-speaker weighted MAP adaptation for GMM-supervector speaker recognition2010
- Author(s)
  M.Ferras, K.Shinoda, S.Furui
- Organizer
  音声言語処理研究会
- Place of Presentation
  東京都
- Year and Date
  2010-12-21
- Related Report
  2010 Annual Research Report

話者認識のための局所話者識別的特徴処理の研究

Principal Investigator

古井 貞熙 東京工業大学, 大学院・情報理工学研究科, 教授

¥1,200,000 (Direct Cost: ¥1,200,000)

Report

Research Products

[Presentation] Optimal use of trees in structural MAP adaptation for speaker verification2010

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Inter-speaker weighted MAP adaptation for GMM-supervector speaker recognition2010

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

古井貞熙東京工業大学, 大学院・情報理工学研究科, 教授