• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Robust speech understanding against inter-speaker variation and ungrammatical utterances based on high-accuracy speech recognition and semantic driven parsing method

Research Project

Project/Area Number 05452357
Research Category

Grant-in-Aid for General Scientific Research (B)

Allocation TypeSingle-year Grants
Research Field Intelligent informatics
Research InstitutionKYOTO UNIVERSITY

Principal Investigator

DOSHITA Shuji  Fac.Eng., Kyoto Univ., Professor, 工学部, 教授 (00025925)

Co-Investigator(Kenkyū-buntansha) ARAKI Masahiro  Fac.Eng., Kyoto Univ., Research Associate, 工学部, 助手 (50252490)
KAWAHARA Tatsuya  Fac.Eng., Kyoto Univ., Research Associate, 工学部, 助手 (00234104)
北澤 茂良  静岡大学, 工学部, 助教授 (00109018)
Project Period (FY) 1993 – 1994
Project Status Completed (Fiscal Year 1994)
Budget Amount *help
¥5,200,000 (Direct Cost: ¥5,200,000)
Fiscal Year 1994: ¥1,500,000 (Direct Cost: ¥1,500,000)
Fiscal Year 1993: ¥3,700,000 (Direct Cost: ¥3,700,000)
KeywordsSpeech Recognition / Natural Language Understanding / Semantic Analysis / Inter-speaker Variation / Robust Parser / ロバストパーサ
Research Abstract

The aim of this research is to construct a robust speech understanding system against inter-speaker variation and ungrammatical utterances. In order to implement such robust system, we develop a high accuracy speech recognizer with a speaker adaptation method and a semantic driven parsing method.
1.Speaker adaptation of HMM phoneme recognizer
We develop a speaker adaptation method using continuous speech input against inter-speaker variation. We use maximum a posteriori probability estimation to Continuous density Hidden Markov Model (HMM) based on Pair-Wise Bays Classifiers as the phone classifier. We performed experimental evaluation of adaptation to 8 speakers. As a result, the keywords recognition rate of the adapted model of a speaker reached 80.2 %, Which is higher by 11.0 % than that of the baseline model, while the accuracy is lowered for another speaker.
2.Word/Phrase spotting method
Even in spontaneous speech, most words and phrases are correctly uttered. Then, we need a word/phrase spotting method for robust parsing. In order to increase accuracy of these spotter, we develop a heuristic language model that models the rest of target word/phrase. Also, we implement a island-driven praser that can skip filled pauses and unknown words.
Robust speech parser by incremental analysis
In natural dialogues, fragmentary utterances are frequently used. Existing approach can hardly deal with these phenomena because it presupposes a complete sentence input. We try to use incremental parsing method with relaxation to such fragmentary utterances. For implementation, we use marker passer to integrate input fragment to recognized plan structure.

Report

(3 results)
  • 1994 Annual Research Report   Final Research Report Summary
  • 1993 Annual Research Report
  • Research Products

    (24 results)

All Other

All Publications (24 results)

  • [Publications] 河原達也: "単語対制約をヒューリスティックとするA^*探索に基づ話音声認識" 電子情報通信学会論文誌. J77-DII,1. 1-8 (1994)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] 河原達也: "ヒューリスティックな言語モデルを用いた会話音声中の単語スポッテグ" 電子情報通信学会論文誌. (採録決定). (1995)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] M.Araki: "Cooperative Spoken Dialogue Mod using Bayesian Network and Event Hierchy" Trans.IEICE. (to appear). (1995)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] T.Kawahara: "Heuristic search integrating syactic,semantic and dialog-level constrnts." Proc.IEEE Int'l Conf Acoust.,Spch & Signal Process.vol.2. 25-28 (1994)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] T.Kawahara: "Keywod and phrase spotting witheuristic language model." Proc.Int'l Conf.on Spoken Langue Processing.vol.2. 815-818 (1994)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] M.Araki: "A cooperative man-machine dialoe model for problem solving." Proc.Int'l Conf.on Spoken Langue Processing.vol.2. 883-886 (1994)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] T.Kawahara: "Continuous speech recognition based on A^* search with word-pair constraint as heuristics" Trans.IEICE. J77-DII. 1-8 (1994)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] T.Kawahara: "Word spotting in spontaneous speech with heuristic language model" Trans.IEICE. (to appear). (1995)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] M.Araki: "Cooperative Spoken Dialogue Model using Baysian Network and Event Hierarchy" Trans.IEICE. (to appear). (1995)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] T.Kawahara: "Heuristic search integrating syntactic, semantic and dialog-level constraints." Proc.IEEE Int'l Conf. Acoust., Speech & Signal Process.Vol.2. 25-28 (1994)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] T.Kawahara: "Keyword and phrase spotting with heuristic language model." Proc.Int'l Conf.on Spoken Language Processing. Vol.2. 815-818 (1994)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] M.Araki: "A cooperative man-machine dealogue model for problem solving." Proc.Int'l Conf.on Spoken Language Processing. Vol.2. 883-886 (1994)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1994 Final Research Report Summary
  • [Publications] 河原達也: "単語対制約をヒューリスティックとするA^*探索に基づく会話音声認識" 電子情報通信学会論文誌. J77-DII,1. 1-8 (1994)

    • Related Report
      1994 Annual Research Report
  • [Publications] 河原達也: "ヒューリスティックな言語モデルを用いた会話音声中の単語スポッティング" 電子情報通信学会論文誌. J78-DII(採録決定). (1995)

    • Related Report
      1994 Annual Research Report
  • [Publications] M.Araki: "Cooperative Spoken Dialogue Model Using Bayesian Network and Event Hierarchy" Trans,IEICE. (to appear). (1995)

    • Related Report
      1994 Annual Research Report
  • [Publications] T.Kawahara: "Heuristic search integrating syntactic,semantic and dialog-level constraints." Proc.IEEE Int'l Conf.Acoust,Speech & Signal Process.2. 25-28 (1994)

    • Related Report
      1994 Annual Research Report
  • [Publications] T.Kawahara: "Keyword and phrase spotting with heuristic language model." Proc.Int'l Conf.on Spoken Language Processing. 2. 815-818 (1994)

    • Related Report
      1994 Annual Research Report
  • [Publications] M.Araki: "A cooperative man-machine dialogue model for problem solving." Proc.Int'l Conf.on Spoken Language Processing. 2. 883-886 (1994)

    • Related Report
      1994 Annual Research Report
  • [Publications] 河原達也,松本真治,堂下修司: "単語対制約をヒューリスティックとするA^*探索に基づく会話音声認識" 電子情報通信学会論文誌. J77-DII.No.1. 1-8 (1994)

    • Related Report
      1993 Annual Research Report
  • [Publications] Kawahara,Araki,Doshita.: "Reducing syntactic perplexity of user utterances with automaton dialogue model." Int'l Sympo.on Spoken Dialogue. 65-68 (1993)

    • Related Report
      1993 Annual Research Report
  • [Publications] 北岡教英,河原達也,堂下修司: "格構造を利用したright-to-left A^*探索に基づく会話音声認識." 電子情報通信学会技術報告. SP93-19. 41-48 (1993)

    • Related Report
      1993 Annual Research Report
  • [Publications] 河原達也,荒木雅弘: "Spontaneous speechの理解のための処理モデル." Spontaneous Speechの分析・理解・生成に関するシンポジウム. 49-53 (1993)

    • Related Report
      1993 Annual Research Report
  • [Publications] 北岡教英,河原達也,堂下修司: "自由発話認識・理解のためのフレーズスポッティング." 電子情報通信学会技術報告. SP93-116. 15-22 (1993)

    • Related Report
      1993 Annual Research Report
  • [Publications] Araki,Kawahara,Doshita.: "A keyword-driven parser for spontaneous speech understanding." Int'l Sympo.on Spoken Dialogue. 113-116 (1993)

    • Related Report
      1993 Annual Research Report

URL: 

Published: 1993-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi