• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2012 Fiscal Year Final Research Report

Distant-talking speech recognition based on spectral subtraction by multi-channel least mean square approach

Research Project

  • PDF
Project/Area Number 22700169
Research Category

Grant-in-Aid for Young Scientists (B)

Allocation TypeSingle-year Grants
Research Field Perception information processing/Intelligent robotics
Research InstitutionNagaoka University of Technology (2012)
Shizuoka University (2010-2011)

Principal Investigator

WANG Longbiao  長岡技術科学大学, 産学融合トップランナー養成センター, 産学融合特任准教授 (30510458)

Project Period (FY) 2010 – 2012
Keywords一般化スペクトルサブトラクション / ハンズフリー音声認 / missing feature theory / マルチチャンネルLMS / ブラインド残響除去
Research Abstract

We proposed a blind dereverberation method based on spectral subtraction using a multi-channel least mean square algorithm (MCLMS). This method was evaluated in a simulated and real noisy reverberant environment with stationary noise. In this study, we also evaluate this method in a noisy reverberant environment with non-stationary noise like music. After suppressing the music, using a blind source separation based on Efficient FastICA (independent component analysis) algorithm, spectral subtraction based dereverberation method is employed to reduce late reverberation. The proposed method achieves an average relative word error reduction rate of 41.9% and 7.9% compared to baseline method and the state-of-art multi-step linear prediction (MSLP) based dreverberation in a real environment, respectively.

  • Research Products

    (17 results)

All 2013 2012 2011 2010 Other

All Journal Article (5 results) Presentation (9 results) Book (2 results) Remarks (1 results)

  • [Journal Article] Speaker identification and verification by combining MFCC and phase information2012

    • Author(s)
      S. Nakagawa, L. Wang and S. Ohtsuka
    • Journal Title

      IEEE Transactions on Audio, Speech and Language Processing

      Volume: Vol.20, No.4 Pages: 1085-1095

    • DOI

      DOI:10.1109/TASL.2011.2172422

  • [Journal Article] Dereverberation and Denoising Based on Generalized Spectral Subtraction by Multi-channel LMS Algorithm Using a Small-scale Microphone Array2012

    • Author(s)
      L. Wang, K. Odani and A. Kai
    • Journal Title

      Eurasip Journal on Advanced in Signal Processing

      Volume: 2012

    • DOI

      DOI:10.1186/1687-6180-2012-12

  • [Journal Article] Identification of a distant speaker and its robustness2011

    • Author(s)
      Y. Jiang, Z. Tang and L. Wang
    • Journal Title

      Chinese Journal of Electronics

      Volume: Vol.20, No.2 Pages: 278-282

    • URL

      http://www.ejournal.org.cn/Jweb_cje/EN/abstract/abstract1109.shtml

  • [Journal Article] Distant-talking speech recognition based on spectral subtraction by multi-channel LMS algorithm2011

    • Author(s)
      L. Wang, N. Kitaoka, S. Nakagawa
    • Journal Title

      IEICE Trans. on Information and Systems

      Volume: Vol.E94-D, No.3 Pages: 659-667

    • URL

      http://search.ieice.org/bin/summary.php?id=e94-d_3_659

  • [Journal Article] Speaker recognition by combining MFCC and phase information in noisy conditions2010

    • Author(s)
      L. Wang, K. Minami, K. Yamamoto, S. Nakagawa
    • Journal Title

      IEICE Trans. on Information and Systems

      Volume: Vol.E93-D,No.9 Pages: 2397-2406

    • URL

      http://search.ieice.org/bin/summary.php?id=e93-d_9_2397

  • [Presentation] Single-sided Approach to Discriminative PLDA Training for Text-Independent SpeakerVerification2013

    • Author(s)
      Zhaofeng Zhang、Lee Kong Aik、LongbiaoWang、Atsuhiko Kai、Ma Bin
    • Organizer
      Proc. of the 2013 SpringMeeting of the ASJ
    • Year and Date
      20130300
  • [Presentation] Distant-talking speaker identification using a reverberation model with various artificial room impulse responses2012

    • Author(s)
      L. Wang, Z. Zhang, A. Kai and Y. Kishi
    • Organizer
      Proc. of APSIPA ASC 2012
    • Year and Date
      20121200
  • [Presentation] Dereverberantion based on Generalized Spectral Subtraction for Distant-talking Speaker Recognition2012

    • Author(s)
      Z. Zhang, L. Wang and A. Kai
    • Organizer
      Proc. of APSIPA ASC 2012
    • Year and Date
      20121200
  • [Presentation] On the Use of Phase Information-based Joint Factor Analysis for Speaker Verification under Channel Mismatch Condition2012

    • Author(s)
      Y. Hirano, L. Wang, A. Kai and S.Nakagawa
    • Organizer
      Proc. of APSIPA ASC 2012
    • Year and Date
      20121200
  • [Presentation] Speech Recognition by Denoising and Dereverberation Based on Spectral Subtraction in a Real Noisy Reverberant Environment2012

    • Author(s)
      K. Odani, L. Wang and A. Kai
    • Organizer
      Proc. of Interspeech 2012
    • Year and Date
      20120900
  • [Presentation] Blind Dereverberation Based on Generalized Spectral Subtraction by Multi-channel LMS Algorithm2011

    • Author(s)
      Kyohei Odani, Longbiao Wang and Atsuhiko Kai
    • Organizer
      Proc. of APSIPA ASC 2011
    • Year and Date
      20111000
  • [Presentation] Evaluation of Hands-free Large Vocabulary Continuous Speech Recognition by Blind Dereverberation Based onSpectral Subtraction by Multi-channelLMS Algorithm2011

    • Author(s)
      Longbiao Wang , Kyohei Odani and Atsuhiko Kai
    • Organizer
      Proc. of Text, Speech and Dialogue
    • Year and Date
      20110900
  • [Presentation] Multimodal interface with N-best display including candidates of spoken word fragments2010

    • Author(s)
      Y. Jang, A. Kai and L. Wang
    • Organizer
      Proc. of APSIPA ASC2010
    • Year and Date
      20101200
  • [Presentation] Compensation approaches for distant Speaker identification under reverberant environments2010

    • Author(s)
      Y. Jiang, Z. Tang and L. Wang
    • Organizer
      Proc. of CCPR 2010
    • Year and Date
      20101000
  • [Book] Dereverberation Based on Spectral Subtraction by Multi-channel LMS Algorithm for Hands-free Speech Recognition2012

    • Author(s)
      Longbiao Wang, Kyohei Odani, Atsuhiko Kai, Norihide Kitaoka and Seiichi Nakagawa
    • Total Pages
      155-174
    • Publisher
      Chapter in Modern Speech Recognition Approaches with Case Studies, S. Ramakrishnan (Eds.), IN-TECH
  • [Book] Evaluation of hands-free large vocabulary continuous speech recognition by blind dereverberation based on spectral subtraction by multi-channel LMS algorithm2011

    • Author(s)
      Longbiao Wang, Kyohei Odani and Atsuhiko Kai
    • Total Pages
      131-138
    • Publisher
      Ivan Habernal, Vaclav Matousek (Eds.), Lecture Notes in Artificial Intelligence, Springer LNAI6836
  • [Remarks]

    • URL

      http://sip.nagaokaut.ac.jp/wang-j.html

URL: 

Published: 2014-08-29  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi