2012 年度研究成果報告書

マルチチャンネル最小二乗平均を用いた複数話者の発話に頑健なハンズフリー音声認識

研究課題

研究課題/領域番号	22700169
研究種目	若手研究(B)
配分区分	補助金
研究分野	知覚情報処理・知能ロボティクス
研究機関	長岡技術科学大学 (2012) 静岡大学 (2010-2011)
研究代表者	王龍標長岡技術科学大学, 産学融合トップランナー養成センター, 産学融合特任准教授 (30510458)
研究期間 (年度)	2010 – 2012
キーワード	一般化スペクトルサブトラクション / ハンズフリー音声認 / missing feature theory / マルチチャンネルLMS / ブラインド残響除去
研究概要	遠隔環境下で音の生成を定式化し、伝送路の伝達特性を自動的に推定し、様々な残響環境に対して頑健な残響除去および残響除去の信頼性を用いる後処理を行い、高精度な残響処理を実現した。また、パワースペクトル減算(SS)の代わりに,一般化SSを用いたブラインド残響除去法を提案し,パワーSSに基づくブラインド残響除去法に対してエラー率が大幅に削減できた。さらに、実環境(会議室)の残響を含んだ音声を収録し評価に用いた。人工残響音声と同程度のエラー削減率を達成した。なお、非定常雑音である音楽を含む残響音声に対して,本提案のマルチチャンネル最小二乗平均を基づく一般化スペクトルサブトラクション(GSS)によるブラインド残響除去法とICA(独立成分分析)に基づくブラインド音源分離を組み合わせる方法を提案しました。

研究成果
(17件)

すべて 2013 2012 2011 2010 その他

すべて雑誌論文 (5件) 学会発表 (9件) 図書 (2件) 備考 (1件)

[雑誌論文] Speaker identification and verification by combining MFCC and phase information2012
- 著者名/発表者名
  S. Nakagawa, L. Wang and S. Ohtsuka
- 雑誌名
  
  IEEE Transactions on Audio, Speech and Language Processing
  
  巻: Vol.20, No.4 ページ: 1085-1095
- DOI
  DOI:10.1109/TASL.2011.2172422
[雑誌論文] Dereverberation and Denoising Based on Generalized Spectral Subtraction by Multi-channel LMS Algorithm Using a Small-scale Microphone Array2012
- 著者名/発表者名
  L. Wang, K. Odani and A. Kai
- 雑誌名
  
  Eurasip Journal on Advanced in Signal Processing
  
  巻: 2012
- DOI
  DOI:10.1186/1687-6180-2012-12
[雑誌論文] Identification of a distant speaker and its robustness2011
- 著者名/発表者名
  Y. Jiang, Z. Tang and L. Wang
- 雑誌名
  
  Chinese Journal of Electronics
  
  巻: Vol.20, No.2 ページ: 278-282
- URL
  http://www.ejournal.org.cn/Jweb_cje/EN/abstract/abstract1109.shtml
[雑誌論文] Distant-talking speech recognition based on spectral subtraction by multi-channel LMS algorithm2011
- 著者名/発表者名
  L. Wang, N. Kitaoka, S. Nakagawa
- 雑誌名
  
  IEICE Trans. on Information and Systems
  
  巻: Vol.E94-D, No.3 ページ: 659-667
- URL
  http://search.ieice.org/bin/summary.php?id=e94-d_3_659
[雑誌論文] Speaker recognition by combining MFCC and phase information in noisy conditions2010
- 著者名/発表者名
  L. Wang, K. Minami, K. Yamamoto, S. Nakagawa
- 雑誌名
  
  IEICE Trans. on Information and Systems
  
  巻: Vol.E93-D,No.9 ページ: 2397-2406
- URL
  http://search.ieice.org/bin/summary.php?id=e93-d_9_2397
[学会発表] Single-sided Approach to Discriminative PLDA Training for Text-Independent SpeakerVerification2013
- 著者名/発表者名
  Zhaofeng Zhang、Lee Kong Aik、LongbiaoWang、Atsuhiko Kai、Ma Bin
- 学会等名
  Proc. of the 2013 SpringMeeting of the ASJ
- 年月日
  20130300
[学会発表] Distant-talking speaker identification using a reverberation model with various artificial room impulse responses2012
- 著者名/発表者名
  L. Wang, Z. Zhang, A. Kai and Y. Kishi
- 学会等名
  Proc. of APSIPA ASC 2012
- 年月日
  20121200
[学会発表] Dereverberantion based on Generalized Spectral Subtraction for Distant-talking Speaker Recognition2012
- 著者名/発表者名
  Z. Zhang, L. Wang and A. Kai
- 学会等名
  Proc. of APSIPA ASC 2012
- 年月日
  20121200
[学会発表] On the Use of Phase Information-based Joint Factor Analysis for Speaker Verification under Channel Mismatch Condition2012
- 著者名/発表者名
  Y. Hirano, L. Wang, A. Kai and S.Nakagawa
- 学会等名
  Proc. of APSIPA ASC 2012
- 年月日
  20121200
[学会発表] Speech Recognition by Denoising and Dereverberation Based on Spectral Subtraction in a Real Noisy Reverberant Environment2012
- 著者名/発表者名
  K. Odani, L. Wang and A. Kai
- 学会等名
  Proc. of Interspeech 2012
- 年月日
  20120900
[学会発表] Blind Dereverberation Based on Generalized Spectral Subtraction by Multi-channel LMS Algorithm2011
- 著者名/発表者名
  Kyohei Odani, Longbiao Wang and Atsuhiko Kai
- 学会等名
  Proc. of APSIPA ASC 2011
- 年月日
  20111000
[学会発表] Evaluation of Hands-free Large Vocabulary Continuous Speech Recognition by Blind Dereverberation Based onSpectral Subtraction by Multi-channelLMS Algorithm2011
- 著者名/発表者名
  Longbiao Wang , Kyohei Odani and Atsuhiko Kai
- 学会等名
  Proc. of Text, Speech and Dialogue
- 年月日
  20110900
[学会発表] Multimodal interface with N-best display including candidates of spoken word fragments2010
- 著者名/発表者名
  Y. Jang, A. Kai and L. Wang
- 学会等名
  Proc. of APSIPA ASC2010
- 年月日
  20101200
[学会発表] Compensation approaches for distant Speaker identification under reverberant environments2010
- 著者名/発表者名
  Y. Jiang, Z. Tang and L. Wang
- 学会等名
  Proc. of CCPR 2010
- 年月日
  20101000
[図書] Dereverberation Based on Spectral Subtraction by Multi-channel LMS Algorithm for Hands-free Speech Recognition2012
- 著者名/発表者名
  Longbiao Wang, Kyohei Odani, Atsuhiko Kai, Norihide Kitaoka and Seiichi Nakagawa
- 総ページ数
  155-174
- 出版者
  Chapter in Modern Speech Recognition Approaches with Case Studies, S. Ramakrishnan (Eds.), IN-TECH
[図書] Evaluation of hands-free large vocabulary continuous speech recognition by blind dereverberation based on spectral subtraction by multi-channel LMS algorithm2011
- 著者名/発表者名
  Longbiao Wang, Kyohei Odani and Atsuhiko Kai
- 総ページ数
  131-138
- 出版者
  Ivan Habernal, Vaclav Matousek (Eds.), Lecture Notes in Artificial Intelligence, Springer LNAI6836
[備考]
- URL
  http://sip.nagaokaut.ac.jp/wang-j.html

2012 年度 研究成果報告書

マルチチャンネル最小二乗平均を用いた複数話者の発話に頑健なハンズフリー音声認識

研究代表者

王 龍標 長岡技術科学大学, 産学融合トップランナー養成センター, 産学融合特任准教授 (30510458)

研究成果

[雑誌論文] Speaker identification and verification by combining MFCC and phase information2012

著者名/発表者名

雑誌名

DOI

[雑誌論文] Dereverberation and Denoising Based on Generalized Spectral Subtraction by Multi-channel LMS Algorithm Using a Small-scale Microphone Array2012

著者名/発表者名

雑誌名

DOI

[雑誌論文] Identification of a distant speaker and its robustness2011

著者名/発表者名

雑誌名

URL

[雑誌論文] Distant-talking speech recognition based on spectral subtraction by multi-channel LMS algorithm2011

著者名/発表者名

雑誌名

URL

[雑誌論文] Speaker recognition by combining MFCC and phase information in noisy conditions2010

著者名/発表者名

雑誌名

URL

[学会発表] Single-sided Approach to Discriminative PLDA Training for Text-Independent SpeakerVerification2013

著者名/発表者名

学会等名

年月日

[学会発表] Distant-talking speaker identification using a reverberation model with various artificial room impulse responses2012

著者名/発表者名

学会等名

年月日

[学会発表] Dereverberantion based on Generalized Spectral Subtraction for Distant-talking Speaker Recognition2012

著者名/発表者名

学会等名

年月日

[学会発表] On the Use of Phase Information-based Joint Factor Analysis for Speaker Verification under Channel Mismatch Condition2012

著者名/発表者名

学会等名

年月日

[学会発表] Speech Recognition by Denoising and Dereverberation Based on Spectral Subtraction in a Real Noisy Reverberant Environment2012

著者名/発表者名

学会等名

年月日

[学会発表] Blind Dereverberation Based on Generalized Spectral Subtraction by Multi-channel LMS Algorithm2011

著者名/発表者名

学会等名

年月日

[学会発表] Evaluation of Hands-free Large Vocabulary Continuous Speech Recognition by Blind Dereverberation Based onSpectral Subtraction by Multi-channelLMS Algorithm2011

著者名/発表者名

学会等名

年月日

[学会発表] Multimodal interface with N-best display including candidates of spoken word fragments2010

著者名/発表者名

学会等名

年月日

[学会発表] Compensation approaches for distant Speaker identification under reverberant environments2010

著者名/発表者名

学会等名

年月日

[図書] Dereverberation Based on Spectral Subtraction by Multi-channel LMS Algorithm for Hands-free Speech Recognition2012

著者名/発表者名

総ページ数

出版者

[図書] Evaluation of hands-free large vocabulary continuous speech recognition by blind dereverberation based on spectral subtraction by multi-channel LMS algorithm2011

著者名/発表者名

総ページ数

出版者

[備考]

URL

2012 年度研究成果報告書

王龍標長岡技術科学大学, 産学融合トップランナー養成センター, 産学融合特任准教授 (30510458)