Speech Recognition Based on Intelligent Beam Search Algorithm

Research Project

Project/Area Number	01460254
Research Category	Grant-in-Aid for General Scientific Research (B)
Allocation Type	Single-year Grants
Research Field	Informatics
Research Institution	Yamagata University
Principal Investigator	KOHDA Masaki Yamagata University, Faculty of Engineering, Professor, 工学部, 教授 (00205337)
Project Period (FY)	1989 – 1991
Project Status	Completed (Fiscal Year 1991)
Budget Amount *help	¥6,900,000 (Direct Cost: ¥6,900,000) Fiscal Year 1991: ¥200,000 (Direct Cost: ¥200,000) Fiscal Year 1990: ¥1,500,000 (Direct Cost: ¥1,500,000) Fiscal Year 1989: ¥5,200,000 (Direct Cost: ¥5,200,000)
Keywords	Speech Recognition / Graph Search / A^* Algorithm / Dynamic Time Warping / Beam Search / Vector Quantization / Hidden Markov Model / Best-First Search / DPマッチング / 予備選択 / DPビ-ムサ-チ / 閾値関数 / 枝刈 / フレ-ム同期DPマッチング
Research Abstract	In a large-vocabulary continuous speech recognition, an investigation of efficient recognition algorithms is extremely important because of executing an enormous computation needed in a matching process within a realistic CPU time. Conventional recognition algorithms based on a dynamic time warping (DTW), a hidden Markov model (HMM) and so on are constructed on the base of an exhaustive search of possible combinations. A dynamic programming technique is introduced to execute the exhaustive search efficiently. The matching process in DTW-based and HMM-based speech recognition systems is regarded as a problem of searching an optimal path through a constrained node. In an application of graph searching algorithms to speechrecognition, two kinds of searching algorithms are effective, that is, a beam searching algorithm and a best-first searching algorithm. A conventional pruning strategy in speech recognition using the beam searching algorithm is based on only a score from the beginning node to the current node. A score estimate from the current node to the terminal node is not used. An A^* algorithm is introduced to speech recognition using the best-first searching algorithm. This report describes new approaches to DTW-based and HMM-based speech recognition algorithms by modeling the matching process from a view point of a graph search. In Chapter I, a DTW-based speech recognition utilizing the beam searching algorithm is described. In Chapter II, a DTW-based speech recognition utilizing the best-first searching algorithm is described. Finally in Chapter III, an HMM-based speech recognition utilizing the best-first searching algorithm is described.

Report

(4 results)

1991 Annual Research Report Final Research Report Summary
1990 Annual Research Report
1989 Annual Research Report

Research Products
(25 results)

All Other

All Publications (25 results)

[Publications] 好田正紀: "DPビ-ムサ-チのしきい値関数の検討" 電子情報通信学会論文誌(DーII). J72ーDーII. 1248-1255 (1989)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 好田正紀: "DPビ-ムサ-チのしきい値関数を入力途中で変更する方法の検討" 電子情報通信学会論文誌(DーII). J75ーDーII. 1-10 (1992)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 好田正紀,加藤正治: "予備選択法を利用するDPビ-ムサ-チの検討" 電子情報通信学会技術研究報告(電子情報通信学会論文誌). SP91ー9. 25-32 (1991)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 好田正紀,加藤正治,伊藤研司: "VQ歪に基づくコストの推定値を用いるDP bestーfirstサ-チの検討" 電子情報通信学会技術研究報告(電子情報通信学会論文誌). SP91ー84. 25-32 (1991)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 好田正紀,加藤正治: "DP bestーfirstサ-チによるワ-ドスポッティングの検討" 情報処理学会東北支部研究会(電子情報通信学会論文誌). 91ー3ー6. 1-5 (1992)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 好田正紀,北村達也: "単語音声認識におけるViterbi bestーfirstサ-チの検討" 情報処理学会東北支部研究会(電子情報通信学会論文誌). 91ー3ー7. 1-10 (1992)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] M. Kohda: ""A study on pruning strategies for DP beam search"" Trans. IEICE. J72-D-II, 8. 1248-1255 (1989)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] M. Kohda: ""A study on modifying pruning strategies for DP beam search at a present input frame"" Trans. IEICE. J75-D-II, 1. 1-10 (1992)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] M. Kohda, M. Katoh: ""A study on utilizing a VQ-based preprocessor in DP beam search for speech recognition"" IEICE, Technical Report. SP91-9. (1991)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] M. Kohda, M. Katoh, K. Itoh: ""A study on the dynamic programming best-first search using a cost estimate based on VQ distortion"" IEICE, Technical Report. SP91-84. (1991)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] M. Kohda, M. Katoh: ""A study of word spotting with the dynamic programming best-first search algorithm"" ASJ Meeting. 1-1-21. (1992)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] M. Kohda, T. Kitamura: "A study on Viterbi best-first search for isolated word recognition"" ASJ Meeting. 1-1-25. (1992)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1991 Final Research Report Summary
[Publications] 好田正紀,加藤正治: "与備選択法を利用するDPビ-ムサ-チの検討" 電子情報通信学会技術研究報告(電子情報通信学会論文誌発表). SP91ー9. 25-32 (1991)
- Related Report
  1991 Annual Research Report
[Publications] 好田正紀,加藤正治,伊藤研司: "VQ歪に基づくコストの推定値を用いるDP bestーfirstサ-チの検討" 電子情報通信学会技術研究報告(電子情報通信学会論文誌発表). SP91ー84. 25-32 (1991)
- Related Report
  1991 Annual Research Report
[Publications] 好田正紀: "DPビ-ムサ-チのしきい値関数を入力途中で変更する方法の検討" 電子情報通信学会論文誌(DーII). J75ーDーII. 1-10 (1992)
- Related Report
  1991 Annual Research Report
[Publications] 好田正紀,加藤正治: "DP bestーfirstサ-チによるワ-ドスポッティングの検討" 情報処理学会東北支部研究会(電子情報通信学会論文誌発表). 91ー3ー6. 1-5 (1992)
- Related Report
  1991 Annual Research Report
[Publications] 好田正紀,北村達也: "単語音声認識におけるViterbi bestーfirstサ-チの検討" 情報処理学会東北支部研究会(電子情報通信学会論文誌発表). 91ー3ー7. 1-10 (1992)
- Related Report
  1991 Annual Research Report
[Publications] 好田正紀,加藤正治,伊藤研司: "DP bestーfirstサ-チにおける推定コストの設定法の検討" 日本音響学会講演論文集. I. (1992)
- Related Report
  1991 Annual Research Report
[Publications] 好田正紀: "DPビ-ムサ-チによる単語音声認識" 東北大学応用情報学研究センタ-シンポジウム予稿集. 16. 19-26 (1990)
- Related Report
  1990 Annual Research Report
[Publications] 好田正紀: "DPビ-ムサ-チにおいて格子点の数を制限する場合の検討" 日本音響学会平成2年度秋季研究発表会講演論文集. I. 91-92 (1990)
- Related Report
  1990 Annual Research Report
[Publications] 好田正紀: "予備選択法を利用するDPビ-ムサ-チの検討" 日本音響学会平成3年度春季研究発表会講演論文集. I. (1991)
- Related Report
  1990 Annual Research Report
[Publications] 好田正紀: "DPビ-ムサ-チの閾値関数を入力途中で変更する方法の検討" 電子情報通信学会論文誌(DーII). (1991)
- Related Report
  1990 Annual Research Report
[Publications] 好田正紀: "DPビ-ムサ-チのしきい値関数の検討" 電子情報通信学会論文誌(D-II). J72-D-II. 1248-1255 (1989)
- Related Report
  1989 Annual Research Report
[Publications] 好田正紀: "DPビ-ムサ-チの閾値関数を入力音声の途中で変更する方法の検討" 電子情報通信学会技術研究報告SP89-13. 89. 9-16 (1989)
- Related Report
  1989 Annual Research Report
[Publications] 好田正紀: "DPビ-ムサ-チによる単語音声認識" 東北大学応用情報学研究センタ-シンポジウム論文集. (1990)
- Related Report
  1989 Annual Research Report

Speech Recognition Based on Intelligent Beam Search Algorithm

Principal Investigator

KOHDA Masaki Yamagata University, Faculty of Engineering, Professor, 工学部, 教授 (00205337)

¥6,900,000 (Direct Cost: ¥6,900,000)

Report

Research Products

[Publications] 好田 正紀: "DPビ-ムサ-チのしきい値関数の検討" 電子情報通信学会論文誌(DーII). J72ーDーII. 1248-1255 (1989)

Description

Related Report

[Publications] 好田 正紀: "DPビ-ムサ-チのしきい値関数を入力途中で変更する方法の検討" 電子情報通信学会論文誌(DーII). J75ーDーII. 1-10 (1992)

Description

Related Report

[Publications] 好田 正紀,加藤 正治: "予備選択法を利用するDPビ-ムサ-チの検討" 電子情報通信学会技術研究報告(電子情報通信学会論文誌). SP91ー9. 25-32 (1991)

Description

Related Report

[Publications] 好田 正紀,加藤 正治,伊藤 研司: "VQ歪に基づくコストの推定値を用いるDP bestーfirstサ-チの検討" 電子情報通信学会技術研究報告(電子情報通信学会論文誌). SP91ー84. 25-32 (1991)

Description

Related Report

[Publications] 好田 正紀,加藤 正治: "DP bestーfirstサ-チによるワ-ドスポッティングの検討" 情報処理学会東北支部研究会(電子情報通信学会論文誌). 91ー3ー6. 1-5 (1992)

Description

Related Report

[Publications] 好田 正紀,北村 達也: "単語音声認識におけるViterbi bestーfirstサ-チの検討" 情報処理学会東北支部研究会(電子情報通信学会論文誌). 91ー3ー7. 1-10 (1992)

Description

Related Report

[Publications] M. Kohda: ""A study on pruning strategies for DP beam search"" Trans. IEICE. J72-D-II, 8. 1248-1255 (1989)

Description

Related Report

[Publications] M. Kohda: ""A study on modifying pruning strategies for DP beam search at a present input frame"" Trans. IEICE. J75-D-II, 1. 1-10 (1992)

Description

Related Report

[Publications] M. Kohda, M. Katoh: ""A study on utilizing a VQ-based preprocessor in DP beam search for speech recognition"" IEICE, Technical Report. SP91-9. (1991)

Description

Related Report

[Publications] M. Kohda, M. Katoh, K. Itoh: ""A study on the dynamic programming best-first search using a cost estimate based on VQ distortion"" IEICE, Technical Report. SP91-84. (1991)

Description

Related Report

[Publications] M. Kohda, M. Katoh: ""A study of word spotting with the dynamic programming best-first search algorithm"" ASJ Meeting. 1-1-21. (1992)

Description

Related Report

[Publications] M. Kohda, T. Kitamura: "A study on Viterbi best-first search for isolated word recognition"" ASJ Meeting. 1-1-25. (1992)

Description

Related Report

[Publications] 好田 正紀,加藤 正治: "与備選択法を利用するDPビ-ムサ-チの検討" 電子情報通信学会技術研究報告(電子情報通信学会論文誌 発表). SP91ー9. 25-32 (1991)

Related Report

[Publications] 好田 正紀,加藤 正治,伊藤 研司: "VQ歪に基づくコストの推定値を用いるDP bestーfirstサ-チの検討" 電子情報通信学会技術研究報告(電子情報通信学会論文誌 発表). SP91ー84. 25-32 (1991)

Related Report

[Publications] 好田 正紀: "DPビ-ムサ-チのしきい値関数を入力途中で変更する方法の検討" 電子情報通信学会論文誌(DーII). J75ーDーII. 1-10 (1992)

Related Report

[Publications] 好田 正紀,加藤 正治: "DP bestーfirstサ-チによるワ-ドスポッティングの検討" 情報処理学会東北支部研究会(電子情報通信学会論文誌 発表). 91ー3ー6. 1-5 (1992)

Related Report

[Publications] 好田 正紀,北村 達也: "単語音声認識におけるViterbi bestーfirstサ-チの検討" 情報処理学会東北支部研究会(電子情報通信学会論文誌 発表). 91ー3ー7. 1-10 (1992)

Related Report

[Publications] 好田 正紀,加藤 正治,伊藤 研司: "DP bestーfirstサ-チにおける推定コストの設定法の検討" 日本音響学会講演論文集. I. (1992)

Related Report

[Publications] 好田 正紀: "DPビ-ムサ-チによる単語音声認識" 東北大学応用情報学研究センタ-シンポジウム予稿集. 16. 19-26 (1990)

Related Report

[Publications] 好田 正紀: "DPビ-ムサ-チにおいて格子点の数を制限する場合の検討" 日本音響学会平成2年度秋季研究発表会講演論文集. I. 91-92 (1990)

Related Report

[Publications] 好田 正紀: "予備選択法を利用するDPビ-ムサ-チの検討" 日本音響学会平成3年度春季研究発表会講演論文集. I. (1991)

Related Report

[Publications] 好田 正紀: "DPビ-ムサ-チの閾値関数を入力途中で変更する方法の検討" 電子情報通信学会論文誌(DーII). (1991)

Related Report

[Publications] 好田正紀: "DPビ-ムサ-チのしきい値関数の検討" 電子情報通信学会論文誌(D-II). J72-D-II. 1248-1255 (1989)

Related Report

[Publications] 好田正紀: "DPビ-ムサ-チの閾値関数を入力音声の途中で変更する方法の検討" 電子情報通信学会技術研究報告SP89-13. 89. 9-16 (1989)

Related Report

[Publications] 好田正紀: "DPビ-ムサ-チによる単語音声認識" 東北大学応用情報学研究センタ-シンポジウム論文集. (1990)

Related Report

[Publications] 好田正紀: "DPビ-ムサ-チのしきい値関数の検討" 電子情報通信学会論文誌(DーII). J72ーDーII. 1248-1255 (1989)

[Publications] 好田正紀: "DPビ-ムサ-チのしきい値関数を入力途中で変更する方法の検討" 電子情報通信学会論文誌(DーII). J75ーDーII. 1-10 (1992)

[Publications] 好田正紀,加藤正治: "予備選択法を利用するDPビ-ムサ-チの検討" 電子情報通信学会技術研究報告(電子情報通信学会論文誌). SP91ー9. 25-32 (1991)

[Publications] 好田正紀,加藤正治,伊藤研司: "VQ歪に基づくコストの推定値を用いるDP bestーfirstサ-チの検討" 電子情報通信学会技術研究報告(電子情報通信学会論文誌). SP91ー84. 25-32 (1991)

[Publications] 好田正紀,加藤正治: "DP bestーfirstサ-チによるワ-ドスポッティングの検討" 情報処理学会東北支部研究会(電子情報通信学会論文誌). 91ー3ー6. 1-5 (1992)

[Publications] 好田正紀,北村達也: "単語音声認識におけるViterbi bestーfirstサ-チの検討" 情報処理学会東北支部研究会(電子情報通信学会論文誌). 91ー3ー7. 1-10 (1992)

[Publications] 好田正紀,加藤正治: "与備選択法を利用するDPビ-ムサ-チの検討" 電子情報通信学会技術研究報告(電子情報通信学会論文誌発表). SP91ー9. 25-32 (1991)

[Publications] 好田正紀,加藤正治,伊藤研司: "VQ歪に基づくコストの推定値を用いるDP bestーfirstサ-チの検討" 電子情報通信学会技術研究報告(電子情報通信学会論文誌発表). SP91ー84. 25-32 (1991)

[Publications] 好田正紀: "DPビ-ムサ-チのしきい値関数を入力途中で変更する方法の検討" 電子情報通信学会論文誌(DーII). J75ーDーII. 1-10 (1992)

[Publications] 好田正紀,加藤正治: "DP bestーfirstサ-チによるワ-ドスポッティングの検討" 情報処理学会東北支部研究会(電子情報通信学会論文誌発表). 91ー3ー6. 1-5 (1992)

[Publications] 好田正紀,北村達也: "単語音声認識におけるViterbi bestーfirstサ-チの検討" 情報処理学会東北支部研究会(電子情報通信学会論文誌発表). 91ー3ー7. 1-10 (1992)

[Publications] 好田正紀,加藤正治,伊藤研司: "DP bestーfirstサ-チにおける推定コストの設定法の検討" 日本音響学会講演論文集. I. (1992)

[Publications] 好田正紀: "DPビ-ムサ-チによる単語音声認識" 東北大学応用情報学研究センタ-シンポジウム予稿集. 16. 19-26 (1990)

[Publications] 好田正紀: "DPビ-ムサ-チにおいて格子点の数を制限する場合の検討" 日本音響学会平成2年度秋季研究発表会講演論文集. I. 91-92 (1990)

[Publications] 好田正紀: "予備選択法を利用するDPビ-ムサ-チの検討" 日本音響学会平成3年度春季研究発表会講演論文集. I. (1991)

[Publications] 好田正紀: "DPビ-ムサ-チの閾値関数を入力途中で変更する方法の検討" 電子情報通信学会論文誌(DーII). (1991)