• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Study of Effective Speech Recognition based on the Bi-directional Search Algorithm

Research Project

Project/Area Number 07680401
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionThe University of Tokushima

Principal Investigator

KITA Kenji  The University of Tokushima Information Science and Intelligent Systems Associate Professor, 工学部, 助教授 (10243734)

Project Period (FY) 1995 – 1997
Project Status Completed (Fiscal Year 1997)
Budget Amount *help
¥2,000,000 (Direct Cost: ¥2,000,000)
Fiscal Year 1997: ¥700,000 (Direct Cost: ¥700,000)
Fiscal Year 1996: ¥600,000 (Direct Cost: ¥600,000)
Fiscal Year 1995: ¥700,000 (Direct Cost: ¥700,000)
Keywordsspeech recognition / bi-directional search / hidden Markov model / one-pass algorithm / acoustic model / language model / finite-state automaton / context-free grammar
Research Abstract

The most widely used search strategy in continuous speech recognition is the left-to-right time-synchronous search. However, the left-to-right strategy often produces erroneous results because it does not utilize a full path score. In this research project, we studied a bi-directional search algorithm consisting of a forward time-synchronous search and a backward time-asynchronous search. In the first pass, the HMM-based one-pass algorithm, guided by a finite-state automaton (FSA), is used time-asynchronously for preparing a partial path map. Here, the partial path map keeps recognition likelihood of all partial paths that lead to any grammar state at every time instant. After the first pass, a new, forward time-synchronous search, guided by a finite-state or context-free grammar (CFG), is performed for finding the best recognition hypothesis. The second pass uses the partial path map to compute the full path likelihood.
We implemented an experimental speech recognition system based on the above-mentioned bi-directional search algorithm. In the system, discrete HMMs were used as acoustic models. We also conducted experiments to compare the accuracy of several kinds of search strategies using the ATR speech database. FSA-based one-directional search attained an accuracy of 77.1%-87.3%, while FSA-backward CGF-forward search attained an accuracy of 88.6%. These results shows the effectiveness of the bi-directional search algorithm.

Report

(4 results)
  • 1997 Annual Research Report   Final Research Report Summary
  • 1996 Annual Research Report
  • 1995 Annual Research Report
  • Research Products

    (28 results)

All Other

All Publications (28 results)

  • [Publications] Kenji Kita et al.: "One-Pass Scarch Algorithm for Continuous Speech Recognition Using Generalized LR Parsing:A CFG-Driven.Frame-Synchronous HMM-Based Approach" Transactions of IPSJ. 36・5. 1252-1259 (1995)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] Kenji Kita et al.: "VOICEDIC:A Practical Application of Speech Recognition Technology" Symbiosis of Human and Artifact. 535-540 (1995)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] 柘植 覚, 北 研二: "Forward-Backward探索に基づく連続音声認識" 電気関係学会四国支部連合大会講演論文集. 258-259 (1996)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] Kenji Kita: "Mixture Probabilistic Context-Free Grammar:An Improvement of a Probabilistic Context-Free Grammar Using Cluster-Based Language Modeling" Journal of Natural Language Processing. 3・4. 103-113 (1996)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] kenji Kita et al.: "Automatic Acquisition of Probabilistic Dialogue Models" Proceedings of ICSLP96. 196-199 (1996)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] 小田 裕樹, 北 研二: "単語の位置情報に基づくコーパスからのコロケーションの自動抽出" 自然言語処理. 5・1. 79-99 (1998)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] 北 研二 他: "音声言語処理" 森北出版, 169 (1996)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] Kenji Kita et al.: "One-Pass Search Algorithm for Continuous Speech Recognition Using Generalized LR Parsing : A CFG-Driven, Frame-Synchronous HMM-Based Approach" Transactions of IPSJ. Vol.36, No.5. 1252-1259 (1995)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] Kenji Kita et al.: "VOICEDIC : A Practical Application of Speech Recognition Technology" Symbiosis of Human and Artifact. 535-540 (1995)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] Satoru Tsuge, Kenji Kita: "Continuous Speech Recognition Based on Forward-Backward Search (in Japanese)" Proceedings of IEICE Shikoku Branch Meeting. 258-259 (1996)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] Kenji Kita: "Mixture Probabilistic Context-Free Grammar : An Improvement of a Probabilistic Context-Free Grammar Using Cluster-Based Language Modeling" Journal of Natural Language Processing. Vol.3, No.4. 103-113 (1996)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] Kenji Kita et al.: "Automatic Acquisition of Probabilistic Dialogue Models" Proceedings of ICSLP96. 196-199 (1996)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] Hiroki Oda, Kenji Kita: "Automatically Extracting Collocations Based on Words Position Information in Corpora (in Jpanese)" Journal of Natural Language Processing. Vol.5, No.1. 79-99 (1998)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1997 Final Research Report Summary
  • [Publications] Kenji Kita et al.: "Collocations in Language Learuing : Corpus-Based Automatic Compilation of Collocations and Bilingual Collocation Concordancer" Computer Assisted Language Learning. 10・3. 229-238 (1997)

    • Related Report
      1997 Annual Research Report
  • [Publications] 北 研二: "確率的言語モデルに基づく多言語コーパスからの言語系統樹の再構築" 自然言語処理. 4・3. 71-82 (1997)

    • Related Report
      1997 Annual Research Report
  • [Publications] 北 研二 他: "発話タイプ付きコーパスを用いた確率的対話モデルの自動生成" 自然言語処理. 4・4. 73-85 (1997)

    • Related Report
      1997 Annual Research Report
  • [Publications] Kenji Kita et al.: "A Probabilistic-model-based Language Clustering Approach : To Reconstruct Language System Tree from the Multilingual Corpus" Proceedings of JSCL-97. 109-114 (1997)

    • Related Report
      1997 Annual Research Report
  • [Publications] Paliwal K.K., 北 研二 他: "自由発話音声認識における音響分析の比較" 日本音響学会講演論文集. 5-6 (1997)

    • Related Report
      1997 Annual Research Report
  • [Publications] 小田 裕樹, 北 研二: "単語の位置情報に基づくコーパスからのコロケーションの自動抽出" 自然言語処理. 5・1. 79-99 (1998)

    • Related Report
      1997 Annual Research Report
  • [Publications] 拓殖 覚: "Forward-Backward探索に基づく連続音声認識" 電気関係学会四国支部連合大会講演論文集. 258-259 (1996)

    • Related Report
      1996 Annual Research Report
  • [Publications] Kenji Kita: "Mixture Probabilistic Context-Free Grammar:An Improvement of a Probabilistic Context-Free Grammar Using Cluster-Based Language Modeling" Journal of Natural Language Processing. 3・4. 103-113 (1996)

    • Related Report
      1996 Annual Research Report
  • [Publications] Kenji Kita: "Automatic Acquisition of Probabilistic Dialogue Models" Proceedings of ICSLP96. 196-199 (1996)

    • Related Report
      1996 Annual Research Report
  • [Publications] Kenji Kita: "Automatic Acquisition of Probabilistic Dialogue Models" Proceedings of IIZUKA96. 925-928 (1996)

    • Related Report
      1996 Annual Research Report
  • [Publications] Kenji Kita: "Improvement of a Probabilistic CFG Using a Cluster-Based Language Modeling Technique" Proceedings of IIZUKA96. 929-932 (1996)

    • Related Report
      1996 Annual Research Report
  • [Publications] Kenji Kita: "Dialogue Knowledge Acquisition from Annotated Corpora" Proceedings of IEEE SMC96. 556-561 (1996)

    • Related Report
      1996 Annual Research Report
  • [Publications] Kenji Kita: "One-Pass Search Algorithm for Continuous Speech Recognition Using Generalized LR Parsing : A CFG-Driven,Frame-Synchronous HMM-Based Approach" Transactions of Information Processing Society of Japan. 36. 1252-1259 (1995)

    • Related Report
      1995 Annual Research Report
  • [Publications] Kenji Kita: "Voicedic : A Practical Application of Speech Recognition Technology" Symbiosis of Human Artifact. 535-540 (1995)

    • Related Report
      1995 Annual Research Report
  • [Publications] Tatsuya Iwasa: "Error Correction of Speech Recognition Outputs Using Generalized LR Parsing and Confusion Matrix" Proceedings of ROCLING VIII. 101-110 (1995)

    • Related Report
      1995 Annual Research Report

URL: 

Published: 1995-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi