• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

1994 Fiscal Year Final Research Report Summary

Robust speech understanding against inter-speaker variation and ungrammatical utterances based on high-accuracy speech recognition and semantic driven parsing method

Research Project

Project/Area Number 05452357
Research Category

Grant-in-Aid for General Scientific Research (B)

Allocation TypeSingle-year Grants
Research Field Intelligent informatics
Research InstitutionKYOTO UNIVERSITY

Principal Investigator

DOSHITA Shuji  Fac.Eng., Kyoto Univ., Professor, 工学部, 教授 (00025925)

Co-Investigator(Kenkyū-buntansha) ARAKI Masahiro  Fac.Eng., Kyoto Univ., Research Associate, 工学部, 助手 (50252490)
KAWAHARA Tatsuya  Fac.Eng., Kyoto Univ., Research Associate, 工学部, 助手 (00234104)
Project Period (FY) 1993 – 1994
KeywordsSpeech Recognition / Natural Language Understanding / Semantic Analysis / Inter-speaker Variation / Robust Parser
Research Abstract

The aim of this research is to construct a robust speech understanding system against inter-speaker variation and ungrammatical utterances. In order to implement such robust system, we develop a high accuracy speech recognizer with a speaker adaptation method and a semantic driven parsing method.
1.Speaker adaptation of HMM phoneme recognizer
We develop a speaker adaptation method using continuous speech input against inter-speaker variation. We use maximum a posteriori probability estimation to Continuous density Hidden Markov Model (HMM) based on Pair-Wise Bays Classifiers as the phone classifier. We performed experimental evaluation of adaptation to 8 speakers. As a result, the keywords recognition rate of the adapted model of a speaker reached 80.2 %, Which is higher by 11.0 % than that of the baseline model, while the accuracy is lowered for another speaker.
2.Word/Phrase spotting method
Even in spontaneous speech, most words and phrases are correctly uttered. Then, we need a word/phrase spotting method for robust parsing. In order to increase accuracy of these spotter, we develop a heuristic language model that models the rest of target word/phrase. Also, we implement a island-driven praser that can skip filled pauses and unknown words.
Robust speech parser by incremental analysis
In natural dialogues, fragmentary utterances are frequently used. Existing approach can hardly deal with these phenomena because it presupposes a complete sentence input. We try to use incremental parsing method with relaxation to such fragmentary utterances. For implementation, we use marker passer to integrate input fragment to recognized plan structure.

  • Research Products

    (12 results)

All Other

All Publications (12 results)

  • [Publications] 河原達也: "単語対制約をヒューリスティックとするA^*探索に基づ話音声認識" 電子情報通信学会論文誌. J77-DII,1. 1-8 (1994)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 河原達也: "ヒューリスティックな言語モデルを用いた会話音声中の単語スポッテグ" 電子情報通信学会論文誌. (採録決定). (1995)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] M.Araki: "Cooperative Spoken Dialogue Mod using Bayesian Network and Event Hierchy" Trans.IEICE. (to appear). (1995)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] T.Kawahara: "Heuristic search integrating syactic,semantic and dialog-level constrnts." Proc.IEEE Int'l Conf Acoust.,Spch & Signal Process.vol.2. 25-28 (1994)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] T.Kawahara: "Keywod and phrase spotting witheuristic language model." Proc.Int'l Conf.on Spoken Langue Processing.vol.2. 815-818 (1994)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] M.Araki: "A cooperative man-machine dialoe model for problem solving." Proc.Int'l Conf.on Spoken Langue Processing.vol.2. 883-886 (1994)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] T.Kawahara: "Continuous speech recognition based on A^* search with word-pair constraint as heuristics" Trans.IEICE. J77-DII. 1-8 (1994)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] T.Kawahara: "Word spotting in spontaneous speech with heuristic language model" Trans.IEICE. (to appear). (1995)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] M.Araki: "Cooperative Spoken Dialogue Model using Baysian Network and Event Hierarchy" Trans.IEICE. (to appear). (1995)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] T.Kawahara: "Heuristic search integrating syntactic, semantic and dialog-level constraints." Proc.IEEE Int'l Conf. Acoust., Speech & Signal Process.Vol.2. 25-28 (1994)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] T.Kawahara: "Keyword and phrase spotting with heuristic language model." Proc.Int'l Conf.on Spoken Language Processing. Vol.2. 815-818 (1994)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] M.Araki: "A cooperative man-machine dealogue model for problem solving." Proc.Int'l Conf.on Spoken Language Processing. Vol.2. 883-886 (1994)

    • Description
      「研究成果報告書概要(欧文)」より

URL: 

Published: 1996-04-15  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi