Development of Noise Robust Speech Recognition and Its Application on Mobile Environment

Research Project

Project/Area Number	16500097
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	Perception information processing/Intelligent robotics
Research Institution	Yamagata University
Principal Investigator	KOSAKA Tetsuo Yamagata University, Faculty of Engineering, Associate Professor, 工学部, 助教授 (50359569)
Co-Investigator(Kenkyū-buntansha)	KOHDA Masaki Yamagata University, Faculty of Engineering, Professor, 工学部, 教授 (00205337) KATOH Masaharu Yamagata University, Faculty of Engineering, Research Assistant, 工学部, 助手 (10250953)
Project Period (FY)	2004 – 2006
Project Status	Completed (Fiscal Year 2006)
Budget Amount *help	¥2,900,000 (Direct Cost: ¥2,900,000) Fiscal Year 2006: ¥900,000 (Direct Cost: ¥900,000) Fiscal Year 2005: ¥1,000,000 (Direct Cost: ¥1,000,000) Fiscal Year 2004: ¥1,000,000 (Direct Cost: ¥1,000,000)
Keywords	speech recognition / noise / acoustic model / hidden Markov model / discrete HMM / MAP estimation / codebook normalization / histogram equalization / 耐雑音性 / 離散混合分布HMM / ケプストラム / コードブック / 分散音声認識 / モバイル環境
Research Abstract	1) Noisy speech recognition using DMHMMs We have proposed new methods of robust speech recognition using discrete-mixture HMMs (DMHMMs). The aim of this work is to develop robust speech recognition for adverse conditions that contain both stationary and non-stationary noise. In particular, we focus on the issue of impulsive noise, which is a major problem in practical speech recognition system. In order to solve the problem, we have proposed two methods. First, an estimation method of DMHMM parameters based on MAP has been proposed aiming to improve trainability. The second is a method of compensating the observation probabilities of DMHMMs by threshold to reduce adverse effect of outlier values. Experimental evaluations on Japanese LVCSR for read newspaper speech showed that the proposed method achieved the average error rate reduction of 28.1% in adverse conditions that contain both stationary and impulsive noises. 2) Model Based Histogram Equalization for Noise Robust Speech Recognition by Using DMHMMs Towards further improvement of noisy speech recognition, we have proposed a novel normalization method for codebooks of DMHMMs in this paper. The codebook normalization method is based on histogram equalization (HEQ) and it can compensate the non-linear effects of additive noise in model space. The proposed method was compared with both conventional continuous-mixture HMMs (CHMMs) and DMHMMs. It showed that the proposed method obtained the best performance, and obtained an average relative improvement of 29.2% over the CHMM baseline.

Report

(4 results)

2006 Annual Research Report Final Research Report Summary
2005 Annual Research Report
2004 Annual Research Report

Research Products
(32 results)

All 2007 2006 2005 2004

All Journal Article (31 results) Book (1 results)

[Journal Article] Speech Recognition and Synthesis2007
- Author(s)
  Vedran Kordic (Editor)
- Journal Title
  
  International Journal of Advanced Robotic Systems (in press)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2006 Final Research Report Summary
[Journal Article] 発音変形依存モデルを用いた講演音声認識2006
- Author(s)
  堤怜介, 加藤正治, 小坂哲夫, 好田正紀
- Journal Title
  
  電子情報通信学会論文誌D Vol. J89-D, No.2
  
  Pages: 305-313
- NAID
  110004669949
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2006 Final Research Report Summary
[Journal Article] Noisy Speech recognition Based on Codebook Normalization of Discrete-Mixture HMMs2006
- Author(s)
  T.Kosaka, M.Katoh, M.Kohda
- Journal Title
  
  ASA/ASJ Forth Joint Meeting 1pSC27
  
  Pages: 3041-3041
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2006 Final Research Report Summary
[Journal Article] Lecture Speech Recognition Using Pronunciation Variant Modeling2006
- Author(s)
  R.Tsutsumi, M.Katoh, T.Kosaka, M.Kohda
- Journal Title
  
  IEICE Transactions D Vol.J89-D, No.2
  
  Pages: 305-313
- NAID
  110004669949
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2006 Final Research Report Summary
[Journal Article] Noisy Speech Recognition Based on Codebook Normalization of Discrete-Mixture HMMs2006
- Author(s)
  T.Kosaka, M.Katoh, M.Kohda
- Journal Title
  
  ASA/ASA Forth Joint Meeting 1pSC27
  
  Pages: 3041-4041
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2006 Final Research Report Summary
[Journal Article] Noisy speech recognition based on codebook normalization of discrete-mixture HMMs2006
- Author(s)
  Tetsuo Kosaka, Masaharu Katoh, Masaki Kohda
- Journal Title
  
  ASA/ASJ Forth Joint Meeting 1pSC27
  
  Pages: 3041-3041
- Related Report
  2006 Annual Research Report
[Journal Article] 離散混合分布HMMのヒストグラム同等化を用いたコードブック正規化2006
- Author(s)
  小坂哲夫, 加藤正治, 好田正紀
- Journal Title
  
  電子情報通信学会技術研究報告 SP2006-25
  
  Pages: 25-30
- NAID
  110004750981
- Related Report
  2006 Annual Research Report
[Journal Article] コードブック適応を用いた離散混合分布型HMMによる講演音声認識2006
- Author(s)
  山本明祥, 熊倉拓哉, 加藤正治, 小坂哲夫, 好田正紀
- Journal Title
  
  音声言語情報処理研究報告 2006-SLP-62
  
  Pages: 25-30
- NAID
  110004849727
- Related Report
  2006 Annual Research Report
[Journal Article] コードブック適応を用いた離散混合分布型HMMによる講演音声認識2006
- Author(s)
  山本明祥, 熊倉拓哉, 加藤正治, 小坂哲夫, 好田正紀
- Journal Title
  
  日本音響学会講演論文集 2-2-9
  
  Pages: 69-70
- NAID
  110004849727
- Related Report
  2006 Annual Research Report
[Journal Article] 話者ベクトルを用いた話者識別法における音響モデルの検討2006
- Author(s)
  赤津達也, 加藤正治, 小坂哲夫, 好田正紀
- Journal Title
  
  日本音響学会講演論文集 2-P-10
  
  Pages: 113-114
- Related Report
  2006 Annual Research Report
[Journal Article] 参議院会議音声の言語モデル適応2006
- Author(s)
  加藤正治, 小坂哲夫, 好田正紀
- Journal Title
  
  日本音響学会講演論文集 2-P-29
  
  Pages: 151-152
- Related Report
  2006 Annual Research Report
[Journal Article] 音素モデルを用いた話者ベクトルに基づく話者識別の検討2006
- Author(s)
  赤津達也, 加藤正治, 小坂哲夫, 好田正紀
- Journal Title
  
  電子情報通信学会技術研究報告 SP2006-101
  
  Pages: 95-99
- NAID
  110006164204
- Related Report
  2006 Annual Research Report
[Journal Article] 話者ベクトルを用いた話者識別法における次元圧縮の効果2006
- Author(s)
  赤津達也, 加藤正治, 小坂哲夫, 好田正紀
- Journal Title
  
  日本音響学会講演論文集 1-P-18
  
  Pages: 159-160
- Related Report
  2006 Annual Research Report
[Journal Article] 発音変形依存モデルを用いた講演音声認識2006
- Author(s)
  堤怜介, 加藤正治, 小坂哲夫, 好田正紀
- Journal Title
  
  電子情報通信学会論文誌D Vol.J89-D,No.2
  
  Pages: 305-313
- NAID
  110004669949
- Related Report
  2005 Annual Research Report
[Journal Article] 書き起こしと講演録を用いた言語モデルの作成法の検討2006
- Author(s)
  加藤正治, 梅本真模, 小坂哲夫, 好田正紀
- Journal Title
  
  日本音響学会講演論文集 3-1-7
  
  Pages: 1203-1204
- Related Report
  2005 Annual Research Report
[Journal Article] 日本語話し言葉コーパスを用いた教師なし適応による講演音声認識の性能改善2006
- Author(s)
  阿部拓也, 草間隆, 武田千春, 加藤正治, 小坂哲夫, 好田正紀
- Journal Title
  
  日本音響学会講演論文集 3-1-8
  
  Pages: 1205-1206
- Related Report
  2005 Annual Research Report
[Journal Article] 離散混合分布HMMのコードブック正規化による雑音下音声認識2006
- Author(s)
  遠藤大悟, 加藤正治, 小坂哲夫, 好田正紀
- Journal Title
  
  日本音響学会講演論文集 3-1-16
  
  Pages: 139-140
- Related Report
  2005 Annual Research Report
[Journal Article] Robust Speech Recognition Using Discrete-Mixture HMMs2005
- Author(s)
  T.Kosaka, M.Katoh, M.Kohda
- Journal Title
  
  IEICE Transactions on Information and Systems Vol. E88-D No.12
  
  Pages: 2811-2818
- NAID
  110004019504
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2006 Final Research Report Summary
[Journal Article] Robust Speech Recognition under Non-Stationary Noise Using Discrete-Mixture HMMs2005
- Author(s)
  T.Kosaka, M.Katoh, M.Kohda
- Journal Title
  
  Proc. of 2005 RISP International Workshop on Nonlinear Circuits and Signal Processing
  
  Pages: 347-350
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2006 Final Research Report Summary
[Journal Article] Robust Speech Recognition Using Discrete-Mixture HMMs2005
- Author(s)
  T.Kosaka, M.Katoh, M.Kohda
- Journal Title
  
  IEICE Transactions on Information and Systems Vol.E88-D, No.12
  
  Pages: 2811-2818
- NAID
  110004019504
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2006 Final Research Report Summary
[Journal Article] Robust Speech Recognition under Non-Stationary Noise Using Discrete-Mixture HMMs2005
- Author(s)
  T.Kosaka, M.Katoh, M.Kohda
- Journal Title
  
  Proc.of 2005 RISP International Workshop on Nonlinear Circuits and Signal Processing
  
  Pages: 347-350
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2006 Final Research Report Summary 2004 Annual Research Report
[Journal Article] Robust Speech Recognition Using Discrete-Mixture HMMs2005
- Author(s)
  T.Kosaka, M.Katoh, M.Kohda
- Journal Title
  
  IEICE Transaction on Information and Systems Vol.E88-D No.12
  
  Pages: 2811-2818
- NAID
  110004019504
- Related Report
  2005 Annual Research Report
[Journal Article] 離散混合分布型HMMによる講演音声認識の検討2005
- Author(s)
  小坂哲夫, 山本明祥, 加藤正治, 好田正紀
- Journal Title
  
  電子情報通信学会技術研究報告 SP2005-25
  
  Pages: 31-36
- Related Report
  2005 Annual Research Report
[Journal Article] 日本語話し言葉コーパスを用いた発音変形依存モデルによる講演音声認識の性能評価2005
- Author(s)
  阿部拓也, 武田千春, 加藤正治, 小坂哲夫, 好田正紀
- Journal Title
  
  日本音響学会講演論文集 2-1-1
  
  Pages: 37-38
- NAID
  110003488493
- Related Report
  2005 Annual Research Report
[Journal Article] 日本語話し言葉コーパスによる離散混合分布HMMの評価2005
- Author(s)
  小坂哲夫, 山本明祥, 加藤正治, 好田正紀
- Journal Title
  
  日本音響学会講演論文集 2-7-19
  
  Pages: 95-96
- Related Report
  2005 Annual Research Report
[Journal Article] 日本語話し言葉コーパスを用いた発音変形依存モデルによる講演音声認識の性能評2005
- Author(s)
  阿部拓也, 草間隆, 武田千春, 加藤正治, 小坂哲夫, 好田正紀
- Journal Title
  
  電子情報通信学会技術研究報告 SP2005-94
  
  Pages: 25-30
- Related Report
  2005 Annual Research Report
[Journal Article] 雑音混合出力分布型HMMによる雑音下音声認識のMFCCでの評価2005
- Author(s)
  小坂哲夫, 加藤正治, 好田正紀
- Journal Title
  
  日本音響学会講演論文集 3-5-II
  
  Pages: 97-98
- Related Report
  2004 Annual Research Report
[Journal Article] Noisy Speech Recognition with Discrete-Mixture HMMs Based on MAP Estimation2004
- Author(s)
  T.Kosaka, M.Katoh, M.Kohda
- Journal Title
  
  Proc. of The 18th International Congress on Acoustics Vol II
  
  Pages: 1691-1694
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2006 Final Research Report Summary
[Journal Article] Noisy Speech Recognition with Discrete-Mixture HMMs Based on MAP Estimation2004
- Author(s)
  T.Kosaka, M.Katoh, M.Kohda
- Journal Title
  
  Proc.of the 18th International Congress on Acoustics Vol.II
  
  Pages: 1691-1694
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2006 Final Research Report Summary
[Journal Article] Noisy Speech Recognition with Discrete-Mixture HMMs Based on MAP Estimation2004
- Author(s)
  T.Kosaka, M.Katoh, M.Kohda
- Journal Title
  
  Proc.of The 18th International Congress on Acoustics Vol II
  
  Pages: 1691-1694
- Related Report
  2004 Annual Research Report
[Journal Article] ETSI標準フロントエンドを用いた雑音下音声認識の検討2004
- Author(s)
  福士なな子, 加藤正治, 小坂哲夫, 好田正紀
- Journal Title
  
  電子情報通信学会技術研究報告 SP2004-II
  
  Pages: 7-12
- NAID
  110003295894
- Related Report
  2004 Annual Research Report
[Book] Speech Recognition and Synthesis2007
- Author(s)
  Vedran Kordic, Editor
- Publisher
  International Journal of Advanced Robotic Systems(未定)(印刷中)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2006 Final Research Report Summary

Development of Noise Robust Speech Recognition and Its Application on Mobile Environment

Principal Investigator

KOSAKA Tetsuo Yamagata University, Faculty of Engineering, Associate Professor, 工学部, 助教授 (50359569)

¥2,900,000 (Direct Cost: ¥2,900,000)

Report

Research Products

[Journal Article] Speech Recognition and Synthesis2007

Author(s)

Journal Title

Description

Related Report

[Journal Article] 発音変形依存モデルを用いた講演音声認識2006

Author(s)

Journal Title

NAID

Description

Related Report

[Journal Article] Noisy Speech recognition Based on Codebook Normalization of Discrete-Mixture HMMs2006

Author(s)

Journal Title

Description

Related Report

[Journal Article] Lecture Speech Recognition Using Pronunciation Variant Modeling2006

Author(s)

Journal Title

NAID

Description

Related Report

[Journal Article] Noisy Speech Recognition Based on Codebook Normalization of Discrete-Mixture HMMs2006

Author(s)

Journal Title

Description

Related Report

[Journal Article] Noisy speech recognition based on codebook normalization of discrete-mixture HMMs2006

Author(s)

Journal Title

Related Report

[Journal Article] 離散混合分布HMMのヒストグラム同等化を用いたコードブック正規化2006

Author(s)

Journal Title

NAID

Related Report

[Journal Article] コードブック適応を用いた離散混合分布型HMMによる講演音声認識2006

Author(s)

Journal Title

NAID

Related Report

[Journal Article] コードブック適応を用いた離散混合分布型HMMによる講演音声認識2006

Author(s)

Journal Title

NAID

Related Report

[Journal Article] 話者ベクトルを用いた話者識別法における音響モデルの検討2006

Author(s)

Journal Title

Related Report

[Journal Article] 参議院会議音声の言語モデル適応2006

Author(s)

Journal Title

Related Report

[Journal Article] 音素モデルを用いた話者ベクトルに基づく話者識別の検討2006

Author(s)

Journal Title

NAID

Related Report

[Journal Article] 話者ベクトルを用いた話者識別法における次元圧縮の効果2006

Author(s)

Journal Title

Related Report

[Journal Article] 発音変形依存モデルを用いた講演音声認識2006

Author(s)

Journal Title

NAID

Related Report

[Journal Article] 書き起こしと講演録を用いた言語モデルの作成法の検討2006

Author(s)

Journal Title

Related Report

[Journal Article] 日本語話し言葉コーパスを用いた教師なし適応による講演音声認識の性能改善2006

Author(s)