• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Development of Noise Robust Speech Recognition and Its Application on Mobile Environment

Research Project

Project/Area Number 16500097
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Perception information processing/Intelligent robotics
Research InstitutionYamagata University

Principal Investigator

KOSAKA Tetsuo  Yamagata University, Faculty of Engineering, Associate Professor, 工学部, 助教授 (50359569)

Co-Investigator(Kenkyū-buntansha) KOHDA Masaki  Yamagata University, Faculty of Engineering, Professor, 工学部, 教授 (00205337)
KATOH Masaharu  Yamagata University, Faculty of Engineering, Research Assistant, 工学部, 助手 (10250953)
Project Period (FY) 2004 – 2006
Project Status Completed (Fiscal Year 2006)
Budget Amount *help
¥2,900,000 (Direct Cost: ¥2,900,000)
Fiscal Year 2006: ¥900,000 (Direct Cost: ¥900,000)
Fiscal Year 2005: ¥1,000,000 (Direct Cost: ¥1,000,000)
Fiscal Year 2004: ¥1,000,000 (Direct Cost: ¥1,000,000)
Keywordsspeech recognition / noise / acoustic model / hidden Markov model / discrete HMM / MAP estimation / codebook normalization / histogram equalization / 耐雑音性 / 離散混合分布HMM / ケプストラム / コードブック / 分散音声認識 / モバイル環境
Research Abstract

1) Noisy speech recognition using DMHMMs
We have proposed new methods of robust speech recognition using discrete-mixture HMMs (DMHMMs). The aim of this work is to develop robust speech recognition for adverse conditions that contain both stationary and non-stationary noise. In particular, we focus on the issue of impulsive noise, which is a major problem in practical speech recognition system. In order to solve the problem, we have proposed two methods. First, an estimation method of DMHMM parameters based on MAP has been proposed aiming to improve trainability. The second is a method of compensating the observation probabilities of DMHMMs by threshold to reduce adverse effect of outlier values. Experimental evaluations on Japanese LVCSR for read newspaper speech showed that the proposed method achieved the average error rate reduction of 28.1% in adverse conditions that contain both stationary and impulsive noises.
2) Model Based Histogram Equalization for Noise Robust Speech Recognition by Using DMHMMs
Towards further improvement of noisy speech recognition, we have proposed a novel normalization method for codebooks of DMHMMs in this paper. The codebook normalization method is based on histogram equalization (HEQ) and it can compensate the non-linear effects of additive noise in model space. The proposed method was compared with both conventional continuous-mixture HMMs (CHMMs) and DMHMMs. It showed that the proposed method obtained the best performance, and obtained an average relative improvement of 29.2% over the CHMM baseline.

Report

(4 results)
  • 2006 Annual Research Report   Final Research Report Summary
  • 2005 Annual Research Report
  • 2004 Annual Research Report
  • Research Products

    (32 results)

All 2007 2006 2005 2004

All Journal Article (31 results) Book (1 results)

  • [Journal Article] Speech Recognition and Synthesis2007

    • Author(s)
      Vedran Kordic (Editor)
    • Journal Title

      International Journal of Advanced Robotic Systems (in press)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] 発音変形依存モデルを用いた講演音声認識2006

    • Author(s)
      堤怜介, 加藤正治, 小坂哲夫, 好田正紀
    • Journal Title

      電子情報通信学会論文誌D Vol. J89-D, No.2

      Pages: 305-313

    • NAID

      110004669949

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Noisy Speech recognition Based on Codebook Normalization of Discrete-Mixture HMMs2006

    • Author(s)
      T.Kosaka, M.Katoh, M.Kohda
    • Journal Title

      ASA/ASJ Forth Joint Meeting 1pSC27

      Pages: 3041-3041

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Lecture Speech Recognition Using Pronunciation Variant Modeling2006

    • Author(s)
      R.Tsutsumi, M.Katoh, T.Kosaka, M.Kohda
    • Journal Title

      IEICE Transactions D Vol.J89-D, No.2

      Pages: 305-313

    • NAID

      110004669949

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Noisy Speech Recognition Based on Codebook Normalization of Discrete-Mixture HMMs2006

    • Author(s)
      T.Kosaka, M.Katoh, M.Kohda
    • Journal Title

      ASA/ASA Forth Joint Meeting 1pSC27

      Pages: 3041-4041

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Noisy speech recognition based on codebook normalization of discrete-mixture HMMs2006

    • Author(s)
      Tetsuo Kosaka, Masaharu Katoh, Masaki Kohda
    • Journal Title

      ASA/ASJ Forth Joint Meeting 1pSC27

      Pages: 3041-3041

    • Related Report
      2006 Annual Research Report
  • [Journal Article] 離散混合分布HMMのヒストグラム同等化を用いたコードブック正規化2006

    • Author(s)
      小坂哲夫, 加藤正治, 好田正紀
    • Journal Title

      電子情報通信学会技術研究報告 SP2006-25

      Pages: 25-30

    • NAID

      110004750981

    • Related Report
      2006 Annual Research Report
  • [Journal Article] コードブック適応を用いた離散混合分布型HMMによる講演音声認識2006

    • Author(s)
      山本明祥, 熊倉拓哉, 加藤正治, 小坂哲夫, 好田正紀
    • Journal Title

      音声言語情報処理研究報告 2006-SLP-62

      Pages: 25-30

    • NAID

      110004849727

    • Related Report
      2006 Annual Research Report
  • [Journal Article] コードブック適応を用いた離散混合分布型HMMによる講演音声認識2006

    • Author(s)
      山本明祥, 熊倉拓哉, 加藤正治, 小坂哲夫, 好田正紀
    • Journal Title

      日本音響学会講演論文集 2-2-9

      Pages: 69-70

    • NAID

      110004849727

    • Related Report
      2006 Annual Research Report
  • [Journal Article] 話者ベクトルを用いた話者識別法における音響モデルの検討2006

    • Author(s)
      赤津達也, 加藤正治, 小坂哲夫, 好田正紀
    • Journal Title

      日本音響学会講演論文集 2-P-10

      Pages: 113-114

    • Related Report
      2006 Annual Research Report
  • [Journal Article] 参議院会議音声の言語モデル適応2006

    • Author(s)
      加藤正治, 小坂哲夫, 好田正紀
    • Journal Title

      日本音響学会講演論文集 2-P-29

      Pages: 151-152

    • Related Report
      2006 Annual Research Report
  • [Journal Article] 音素モデルを用いた話者ベクトルに基づく話者識別の検討2006

    • Author(s)
      赤津達也, 加藤正治, 小坂哲夫, 好田正紀
    • Journal Title

      電子情報通信学会技術研究報告 SP2006-101

      Pages: 95-99

    • NAID

      110006164204

    • Related Report
      2006 Annual Research Report
  • [Journal Article] 話者ベクトルを用いた話者識別法における次元圧縮の効果2006

    • Author(s)
      赤津達也, 加藤正治, 小坂哲夫, 好田 正紀
    • Journal Title

      日本音響学会講演論文集 1-P-18

      Pages: 159-160

    • Related Report
      2006 Annual Research Report
  • [Journal Article] 発音変形依存モデルを用いた講演音声認識2006

    • Author(s)
      堤怜介, 加藤正治, 小坂哲夫, 好田正紀
    • Journal Title

      電子情報通信学会論文誌D Vol.J89-D,No.2

      Pages: 305-313

    • NAID

      110004669949

    • Related Report
      2005 Annual Research Report
  • [Journal Article] 書き起こしと講演録を用いた言語モデルの作成法の検討2006

    • Author(s)
      加藤正治, 梅本真模, 小坂哲夫, 好田正紀
    • Journal Title

      日本音響学会講演論文集 3-1-7

      Pages: 1203-1204

    • Related Report
      2005 Annual Research Report
  • [Journal Article] 日本語話し言葉コーパスを用いた教師なし適応による講演音声認識の性能改善2006

    • Author(s)
      阿部拓也, 草間隆, 武田千春, 加藤正治, 小坂哲夫, 好田正紀
    • Journal Title

      日本音響学会講演論文集 3-1-8

      Pages: 1205-1206

    • Related Report
      2005 Annual Research Report
  • [Journal Article] 離散混合分布HMMのコードブック正規化による雑音下音声認識2006

    • Author(s)
      遠藤大悟, 加藤正治, 小坂哲夫, 好田正紀
    • Journal Title

      日本音響学会講演論文集 3-1-16

      Pages: 139-140

    • Related Report
      2005 Annual Research Report
  • [Journal Article] Robust Speech Recognition Using Discrete-Mixture HMMs2005

    • Author(s)
      T.Kosaka, M.Katoh, M.Kohda
    • Journal Title

      IEICE Transactions on Information and Systems Vol. E88-D No.12

      Pages: 2811-2818

    • NAID

      110004019504

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Robust Speech Recognition under Non-Stationary Noise Using Discrete-Mixture HMMs2005

    • Author(s)
      T.Kosaka, M.Katoh, M.Kohda
    • Journal Title

      Proc. of 2005 RISP International Workshop on Nonlinear Circuits and Signal Processing

      Pages: 347-350

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Robust Speech Recognition Using Discrete-Mixture HMMs2005

    • Author(s)
      T.Kosaka, M.Katoh, M.Kohda
    • Journal Title

      IEICE Transactions on Information and Systems Vol.E88-D, No.12

      Pages: 2811-2818

    • NAID

      110004019504

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Robust Speech Recognition under Non-Stationary Noise Using Discrete-Mixture HMMs2005

    • Author(s)
      T.Kosaka, M.Katoh, M.Kohda
    • Journal Title

      Proc.of 2005 RISP International Workshop on Nonlinear Circuits and Signal Processing

      Pages: 347-350

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary 2004 Annual Research Report
  • [Journal Article] Robust Speech Recognition Using Discrete-Mixture HMMs2005

    • Author(s)
      T.Kosaka, M.Katoh, M.Kohda
    • Journal Title

      IEICE Transaction on Information and Systems Vol.E88-D No.12

      Pages: 2811-2818

    • NAID

      110004019504

    • Related Report
      2005 Annual Research Report
  • [Journal Article] 離散混合分布型HMMによる講演音声認識の検討2005

    • Author(s)
      小坂哲夫, 山本明祥, 加藤正治, 好田正紀
    • Journal Title

      電子情報通信学会技術研究報告 SP2005-25

      Pages: 31-36

    • Related Report
      2005 Annual Research Report
  • [Journal Article] 日本語話し言葉コーパスを用いた発音変形依存モデルによる講演音声認識の性能評価2005

    • Author(s)
      阿部拓也, 武田千春, 加藤正治, 小坂哲夫, 好田正紀
    • Journal Title

      日本音響学会講演論文集 2-1-1

      Pages: 37-38

    • NAID

      110003488493

    • Related Report
      2005 Annual Research Report
  • [Journal Article] 日本語話し言葉コーパスによる離散混合分布HMMの評価2005

    • Author(s)
      小坂哲夫, 山本明祥, 加藤正治, 好田正紀
    • Journal Title

      日本音響学会講演論文集 2-7-19

      Pages: 95-96

    • Related Report
      2005 Annual Research Report
  • [Journal Article] 日本語話し言葉コーパスを用いた発音変形依存モデルによる講演音声認識の性能評2005

    • Author(s)
      阿部拓也, 草間隆, 武田千春, 加藤正治, 小坂哲夫, 好田正紀
    • Journal Title

      電子情報通信学会技術研究報告 SP2005-94

      Pages: 25-30

    • Related Report
      2005 Annual Research Report
  • [Journal Article] 雑音混合出力分布型HMMによる雑音下音声認識のMFCCでの評価2005

    • Author(s)
      小坂哲夫, 加藤正治, 好田正紀
    • Journal Title

      日本音響学会講演論文集 3-5-II

      Pages: 97-98

    • Related Report
      2004 Annual Research Report
  • [Journal Article] Noisy Speech Recognition with Discrete-Mixture HMMs Based on MAP Estimation2004

    • Author(s)
      T.Kosaka, M.Katoh, M.Kohda
    • Journal Title

      Proc. of The 18th International Congress on Acoustics Vol II

      Pages: 1691-1694

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Noisy Speech Recognition with Discrete-Mixture HMMs Based on MAP Estimation2004

    • Author(s)
      T.Kosaka, M.Katoh, M.Kohda
    • Journal Title

      Proc.of the 18th International Congress on Acoustics Vol.II

      Pages: 1691-1694

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Noisy Speech Recognition with Discrete-Mixture HMMs Based on MAP Estimation2004

    • Author(s)
      T.Kosaka, M.Katoh, M.Kohda
    • Journal Title

      Proc.of The 18th International Congress on Acoustics Vol II

      Pages: 1691-1694

    • Related Report
      2004 Annual Research Report
  • [Journal Article] ETSI標準フロントエンドを用いた雑音下音声認識の検討2004

    • Author(s)
      福士なな子, 加藤正治, 小坂哲夫, 好田正紀
    • Journal Title

      電子情報通信学会技術研究報告 SP2004-II

      Pages: 7-12

    • NAID

      110003295894

    • Related Report
      2004 Annual Research Report
  • [Book] Speech Recognition and Synthesis2007

    • Author(s)
      Vedran Kordic, Editor
    • Publisher
      International Journal of Advanced Robotic Systems(未定)(印刷中)
    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary

URL: 

Published: 2004-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi