1989 Fiscal Year Final Research Report Summary

Continuous Speech Recognition System for Noisy Environment Using Artificial Intelligence

Research Project

Project/Area Number	62550269
Research Category	Grant-in-Aid for General Scientific Research (C)
Allocation Type	Single-year Grants
Research Field	計算機工学
Research Institution	Keio University
Principal Investigator	NAKAGAWA Masao Faculty of Science and Technology, Keio University Professor, 理工学部・電気工学科, 教授 (30051882)
Project Period (FY)	1987 – 1989
Keywords	HMM / Noise / Continuous Speech Recognition / Trigram
Research Abstract	We have obtained following three results. (1)We have proposed a frame-synchronous HMM connected word recognition algorithm which can obtain multiple candidates for every sentence length. The algorithm can also obtain approximate solution with maximum probability of word sequence using the trigram model. We also introduce beam search into this algorithm to reduce the amount of computation. In experiments with 100 words vocabulary, the sentence recognition rate is 71.3%, and the sentence recognition rate within the 10th best candidates is 88.8 % . The result shows the increases of 2.5% and 12.5% respectively from the results with an algorithm obtaining a single candidate using the trigram model. The increase in amount of computation is cut down by using the beam search. (2)We have proposed a speaker-independent word-based HMM speech recognition system using Separate Vector Quantization(Band-Division Separate VQ HMM Speech Recognition). The proposed system can reduce the effects of extemal noise added to the speech and changes of utterance influenced by noise at the same time. From results of experiments we obtained 5-16 % higher recognition rate than conventional HMM speech recognition system. (3)We have proposed a system for isolated -word recognition using two level --- syllable level and word level --- HMM. Compared to the conventional HMM isolated- word recognition system, this system can reduce the amount of memory for models. The more the vocabulary is large, the more this merit is effective. With 500 words recognition, this system reduces the amount of memory by 61 percent Using this system, the word recognition rate for 100 words by a woman speaker is 99.4 percent. This rate is equal to that of conventional system , and amount of memory for models is reduced by 44 percent.

Research Products
(24 results)

All Other

All Publications (24 results)

[Publications] 片岡淳、南泰浩、中川正雄: "背景雑音を考慮した帯域分割セパレ-トVQーHMM不特定話者音声認識" 電子情報通信学会論文誌.
- Description
  「研究成果報告書概要(和文)」より
[Publications] 南泰浩、中川正雄: "Trigramモデルを用いた複数候補を求めるフレ-ム同期型HMM連続音声認識" 電子情報通信学会論文誌.
- Description
  「研究成果報告書概要(和文)」より
[Publications] 南泰浩、中川正雄: "ADFとプロダクションシステムによる異常信号の検知・除去" 電子通信学会秋季全国大会講演論文集. 1. 129-129 (1987)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 米本美奈、南泰浩、中川正雄: "記憶容量を低減したHMM単語音声認識システム" 電子通信学会秋季全国大会講演論文集. A. 1-16-1-16 (1988)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 南泰浩、中川正雄: "HMM連続音声認識の高速化" 日本音響学会昭和63年度秋季研究発表会講演論文集. I. 258-259 (1988)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 米本美奈、南泰浩、中川正雄: "HMMを2段に用いた単語音声認識システム" 第11回情報理論とその応用シンポジュウム. 865-869 (1988)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 南泰浩、中川正雄: "ビ-ムサ-チによるHMM連続音声認識の高速化手法" 第11回情報理論とその応用シンポジュウム. 871-876 (1988)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 神田直之、南泰浩、中川正雄: "トリグラムモデルを用いた連続単語音声認識における自動単語分類" 電子通信学会春季全国大会講演論文集. 1. 1-20-1-20 (1989)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 岩本直久、南泰浩、水井潔、中川正雄: "可変ビットレ-トADPCM・PARCOR混成音楽符号化方式" 第12回情報理論とその応用シンポジュウム. 761-764 (1989)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 片岡淳、南泰浩、中川正雄: "セパレ-トベクトル量子化を用いたHMM音声認識の耐雑音性に関する検討" 第12回情報理論とその応用シンポジュウム. 831-834 (1989)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 佐藤正俊、南泰浩、水井潔、中川正雄: "ベクトル量子化を用いる可変フレ-ムレ-トPARCORボコ-ダ" 第12回情報理論とその応用シンポジュウム. 755-760 (1989)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 片岡淳、南泰浩、中川正雄: "セパレ-トベクトル量子化を用いたHMM音声認識の耐雑音性に関する検討" 電子情報通信学会春期全国大会講演論文集. A. A-230-A-230 (1989)
- Description
  「研究成果報告書概要(和文)」より
[Publications] Jun, KATAOKA, Yasuhiro MINAMI, Masao NAKAGAWA: "A band-division Separate VQ Speaker-Independent HMM Speech Recognition System Considering Extemal Noise." IEICE of Japan.
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Yasuhiro MINAMI, Masao NAKAGAWA: "Frame Synchronous HMM connected Word Recognition Obtaining Multiple Candidate Sentences Using Trigram Model." IEICE of Japan.
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Yasuhiro MINAMI, Masao NAKAGAWA: "A New Abnormal Signal Detection and Cancellation System by Using ADF and Production System." 1988 Autumn National Convention Record, IEICE, 1987, 1-291.
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Mina YONEMOTO, Yasuhiro MINAMI, Masao NAKAGAWA: "Memory Reduced Word Recognition System Using HMM." 1988 Autumn National Convention Record, IEICE, 1988, A-1-16.
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Yasuhiro MINAMI, Masao NAKAGAWA: "Fast Algorithm for HMM Continuous Speech Recognition." Proc. Autumn Meet. Acoust. Soc. Jpn., 1988, pp.257-258.
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Mina YONEMOTO, Yasuhiro MINAMI, Masao NAKAGAWA: "A System for Isolated-word Recognition Using Two Level Hidden Markov Models." The 11th Symposium on Information Theory and Its Application(SITA'88), 1988, pp.865-869.
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Yasuhiro MINAMI, Masao NAKAGAWA: "A Fast Algorithm Based on Beam Search for HMM Continuous Speech Recognition." The 11th Symposium on Information Theory and Its Application(SITA'88), 1988, pp.871-876.
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Naoyuki KANDA, Yasuhiro MINAMI, Masao NAKAGAWA: "Automatic Word Classification in Connected Word Recognition Using Trigram Model." 1989 Spring National Convention Record, IEICE, 1989, A-20.
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Naohisa IWAMOTO, Yasuhiro MINAMI, Kiyoshi MIZUI, Masao NAKAGAWA: "Variable Bit Rate ADPCM-PARCOR hybrid Instrumental Sound Coding." The 12th Symposium on Information Theory and Its Application(SITA'89), 1989, pp.761-764.
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Jun, KATAOKA, Yasuhiro MINAMI, Masao NAKAGAWA: "A Study of Robustness of HMM Speech Recognition System Using Separate Vector Quantization." The 12th Symposium on Information Theory and Its Application(SITA'89), 1989, pp.831-834.
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Masatoshi SATOU, Yasuhiro MINAMI, Kiyoshi MIZUI, Masao NAKAGAWA: "Variable Frame Rate PARCOR Vocoder Using Vector Quantization." The 12th Symposium on Information Theory and Its Application(SITA'89), 1989, pp.755-760.
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Jun, KATAOKA, Yasuhiro MINAMI, Masao NAKAGAWA: "A Study of Robustness of HMM Speech Recognition System Using Separate Vector Quantization." 1990 Spring National Convention Record, IEICE, 1990, A-230.
- Description
  「研究成果報告書概要(欧文)」より

1989 Fiscal Year Final Research Report Summary

Continuous Speech Recognition System for Noisy Environment Using Artificial Intelligence

Principal Investigator

NAKAGAWA Masao Faculty of Science and Technology, Keio University Professor, 理工学部・電気工学科, 教授 (30051882)

Research Products

[Publications] 片岡淳、南泰浩、中川正雄: "背景雑音を考慮した帯域分割セパレ-トVQーHMM不特定話者音声認識" 電子情報通信学会論文誌.

Description

[Publications] 南泰浩、中川正雄: "Trigramモデルを用いた複数候補を求めるフレ-ム同期型HMM連続音声認識" 電子情報通信学会論文誌.

Description

[Publications] 南泰浩、中川正雄: "ADFとプロダクションシステムによる異常信号の検知・除去" 電子通信学会秋季全国大会講演論文集. 1. 129-129 (1987)

Description

[Publications] 米本美奈、南泰浩、中川正雄: "記憶容量を低減したHMM単語音声認識システム" 電子通信学会秋季全国大会講演論文集. A. 1-16-1-16 (1988)

Description

[Publications] 南泰浩、中川正雄: "HMM連続音声認識の高速化" 日本音響学会昭和63年度秋季研究発表会講演論文集. I. 258-259 (1988)

Description

[Publications] 米本美奈、南泰浩、中川正雄: "HMMを2段に用いた単語音声認識システム" 第11回情報理論とその応用シンポジュウム. 865-869 (1988)

Description

[Publications] 南泰浩、中川正雄: "ビ-ムサ-チによるHMM連続音声認識の高速化手法" 第11回情報理論とその応用シンポジュウム. 871-876 (1988)

Description

[Publications] 神田直之、南泰浩、中川正雄: "トリグラムモデルを用いた連続単語音声認識における自動単語分類" 電子通信学会春季全国大会講演論文集. 1. 1-20-1-20 (1989)

Description

[Publications] 岩本直久、南泰浩、水井潔、中川正雄: "可変ビットレ-トADPCM・PARCOR混成音楽符号化方式" 第12回情報理論とその応用シンポジュウム. 761-764 (1989)

Description

[Publications] 片岡淳、南泰浩、中川正雄: "セパレ-トベクトル量子化を用いたHMM音声認識の耐雑音性に関する検討" 第12回情報理論とその応用シンポジュウム. 831-834 (1989)

Description

[Publications] 佐藤正俊、南泰浩、水井潔、中川正雄: "ベクトル量子化を用いる可変フレ-ムレ-トPARCORボコ-ダ" 第12回情報理論とその応用シンポジュウム. 755-760 (1989)

Description

[Publications] 片岡淳、南泰浩、中川正雄: "セパレ-トベクトル量子化を用いたHMM音声認識の耐雑音性に関する検討" 電子情報通信学会春期全国大会講演論文集. A. A-230-A-230 (1989)

Description

[Publications] Jun, KATAOKA, Yasuhiro MINAMI, Masao NAKAGAWA: "A band-division Separate VQ Speaker-Independent HMM Speech Recognition System Considering Extemal Noise." IEICE of Japan.

Description

[Publications] Yasuhiro MINAMI, Masao NAKAGAWA: "Frame Synchronous HMM connected Word Recognition Obtaining Multiple Candidate Sentences Using Trigram Model." IEICE of Japan.

Description

[Publications] Yasuhiro MINAMI, Masao NAKAGAWA: "A New Abnormal Signal Detection and Cancellation System by Using ADF and Production System." 1988 Autumn National Convention Record, IEICE, 1987, 1-291.

Description

[Publications] Mina YONEMOTO, Yasuhiro MINAMI, Masao NAKAGAWA: "Memory Reduced Word Recognition System Using HMM." 1988 Autumn National Convention Record, IEICE, 1988, A-1-16.

Description

[Publications] Yasuhiro MINAMI, Masao NAKAGAWA: "Fast Algorithm for HMM Continuous Speech Recognition." Proc. Autumn Meet. Acoust. Soc. Jpn., 1988, pp.257-258.

Description

[Publications] Mina YONEMOTO, Yasuhiro MINAMI, Masao NAKAGAWA: "A System for Isolated-word Recognition Using Two Level Hidden Markov Models." The 11th Symposium on Information Theory and Its Application(SITA'88), 1988, pp.865-869.

Description

[Publications] Yasuhiro MINAMI, Masao NAKAGAWA: "A Fast Algorithm Based on Beam Search for HMM Continuous Speech Recognition." The 11th Symposium on Information Theory and Its Application(SITA'88), 1988, pp.871-876.

Description

[Publications] Naoyuki KANDA, Yasuhiro MINAMI, Masao NAKAGAWA: "Automatic Word Classification in Connected Word Recognition Using Trigram Model." 1989 Spring National Convention Record, IEICE, 1989, A-20.

Description

[Publications] Naohisa IWAMOTO, Yasuhiro MINAMI, Kiyoshi MIZUI, Masao NAKAGAWA: "Variable Bit Rate ADPCM-PARCOR hybrid Instrumental Sound Coding." The 12th Symposium on Information Theory and Its Application(SITA'89), 1989, pp.761-764.

Description

[Publications] Jun, KATAOKA, Yasuhiro MINAMI, Masao NAKAGAWA: "A Study of Robustness of HMM Speech Recognition System Using Separate Vector Quantization." The 12th Symposium on Information Theory and Its Application(SITA'89), 1989, pp.831-834.

Description

[Publications] Masatoshi SATOU, Yasuhiro MINAMI, Kiyoshi MIZUI, Masao NAKAGAWA: "Variable Frame Rate PARCOR Vocoder Using Vector Quantization." The 12th Symposium on Information Theory and Its Application(SITA'89), 1989, pp.755-760.

Description

[Publications] Jun, KATAOKA, Yasuhiro MINAMI, Masao NAKAGAWA: "A Study of Robustness of HMM Speech Recognition System Using Separate Vector Quantization." 1990 Spring National Convention Record, IEICE, 1990, A-230.

Description