Robust speech understanding against inter-speaker variation and ungrammatical utterances based on high-accuracy speech recognition and semantic driven parsing method

Research Project

Project/Area Number	05452357
Research Category	Grant-in-Aid for General Scientific Research (B)
Allocation Type	Single-year Grants
Research Field	Intelligent informatics
Research Institution	KYOTO UNIVERSITY
Principal Investigator	DOSHITA Shuji Fac.Eng., Kyoto Univ., Professor, 工学部, 教授 (00025925)
Co-Investigator(Kenkyū-buntansha)	ARAKI Masahiro Fac.Eng., Kyoto Univ., Research Associate, 工学部, 助手 (50252490) KAWAHARA Tatsuya Fac.Eng., Kyoto Univ., Research Associate, 工学部, 助手 (00234104) 北澤茂良静岡大学, 工学部, 助教授 (00109018)
Project Period (FY)	1993 – 1994
Project Status	Completed (Fiscal Year 1994)
Budget Amount *help	¥5,200,000 (Direct Cost: ¥5,200,000) Fiscal Year 1994: ¥1,500,000 (Direct Cost: ¥1,500,000) Fiscal Year 1993: ¥3,700,000 (Direct Cost: ¥3,700,000)
Keywords	Speech Recognition / Natural Language Understanding / Semantic Analysis / Inter-speaker Variation / Robust Parser / ロバストパーサ
Research Abstract	The aim of this research is to construct a robust speech understanding system against inter-speaker variation and ungrammatical utterances. In order to implement such robust system, we develop a high accuracy speech recognizer with a speaker adaptation method and a semantic driven parsing method. 1.Speaker adaptation of HMM phoneme recognizer We develop a speaker adaptation method using continuous speech input against inter-speaker variation. We use maximum a posteriori probability estimation to Continuous density Hidden Markov Model (HMM) based on Pair-Wise Bays Classifiers as the phone classifier. We performed experimental evaluation of adaptation to 8 speakers. As a result, the keywords recognition rate of the adapted model of a speaker reached 80.2 %, Which is higher by 11.0 % than that of the baseline model, while the accuracy is lowered for another speaker. 2.Word/Phrase spotting method Even in spontaneous speech, most words and phrases are correctly uttered. Then, we need a word/phrase spotting method for robust parsing. In order to increase accuracy of these spotter, we develop a heuristic language model that models the rest of target word/phrase. Also, we implement a island-driven praser that can skip filled pauses and unknown words. Robust speech parser by incremental analysis In natural dialogues, fragmentary utterances are frequently used. Existing approach can hardly deal with these phenomena because it presupposes a complete sentence input. We try to use incremental parsing method with relaxation to such fragmentary utterances. For implementation, we use marker passer to integrate input fragment to recognized plan structure.

Report

(3 results)

1994 Annual Research Report Final Research Report Summary
1993 Annual Research Report

Research Products
(24 results)

All Other

All Publications (24 results)

[Publications] 河原達也: "単語対制約をヒューリスティックとするA^*探索に基づ話音声認識" 電子情報通信学会論文誌. J77-DII,1. 1-8 (1994)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] 河原達也: "ヒューリスティックな言語モデルを用いた会話音声中の単語スポッテグ" 電子情報通信学会論文誌. (採録決定). (1995)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] M.Araki: "Cooperative Spoken Dialogue Mod using Bayesian Network and Event Hierchy" Trans.IEICE. (to appear). (1995)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] T.Kawahara: "Heuristic search integrating syactic,semantic and dialog-level constrnts." Proc.IEEE Int'l Conf Acoust.,Spch & Signal Process.vol.2. 25-28 (1994)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] T.Kawahara: "Keywod and phrase spotting witheuristic language model." Proc.Int'l Conf.on Spoken Langue Processing.vol.2. 815-818 (1994)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] M.Araki: "A cooperative man-machine dialoe model for problem solving." Proc.Int'l Conf.on Spoken Langue Processing.vol.2. 883-886 (1994)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] T.Kawahara: "Continuous speech recognition based on A^* search with word-pair constraint as heuristics" Trans.IEICE. J77-DII. 1-8 (1994)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] T.Kawahara: "Word spotting in spontaneous speech with heuristic language model" Trans.IEICE. (to appear). (1995)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] M.Araki: "Cooperative Spoken Dialogue Model using Baysian Network and Event Hierarchy" Trans.IEICE. (to appear). (1995)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] T.Kawahara: "Heuristic search integrating syntactic, semantic and dialog-level constraints." Proc.IEEE Int'l Conf. Acoust., Speech & Signal Process.Vol.2. 25-28 (1994)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] T.Kawahara: "Keyword and phrase spotting with heuristic language model." Proc.Int'l Conf.on Spoken Language Processing. Vol.2. 815-818 (1994)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] M.Araki: "A cooperative man-machine dealogue model for problem solving." Proc.Int'l Conf.on Spoken Language Processing. Vol.2. 883-886 (1994)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] 河原達也: "単語対制約をヒューリスティックとするA^*探索に基づく会話音声認識" 電子情報通信学会論文誌. J77-DII,1. 1-8 (1994)
- Related Report
  1994 Annual Research Report
[Publications] 河原達也: "ヒューリスティックな言語モデルを用いた会話音声中の単語スポッティング" 電子情報通信学会論文誌. J78-DII(採録決定). (1995)
- Related Report
  1994 Annual Research Report
[Publications] M.Araki: "Cooperative Spoken Dialogue Model Using Bayesian Network and Event Hierarchy" Trans,IEICE. (to appear). (1995)
- Related Report
  1994 Annual Research Report
[Publications] T.Kawahara: "Heuristic search integrating syntactic,semantic and dialog-level constraints." Proc.IEEE Int'l Conf.Acoust,Speech & Signal Process.2. 25-28 (1994)
- Related Report
  1994 Annual Research Report
[Publications] T.Kawahara: "Keyword and phrase spotting with heuristic language model." Proc.Int'l Conf.on Spoken Language Processing. 2. 815-818 (1994)
- Related Report
  1994 Annual Research Report
[Publications] M.Araki: "A cooperative man-machine dialogue model for problem solving." Proc.Int'l Conf.on Spoken Language Processing. 2. 883-886 (1994)
- Related Report
  1994 Annual Research Report
[Publications] 河原達也,松本真治,堂下修司: "単語対制約をヒューリスティックとするA^*探索に基づく会話音声認識" 電子情報通信学会論文誌. J77-DII.No.1. 1-8 (1994)
- Related Report
  1993 Annual Research Report
[Publications] Kawahara,Araki,Doshita.: "Reducing syntactic perplexity of user utterances with automaton dialogue model." Int'l Sympo.on Spoken Dialogue. 65-68 (1993)
- Related Report
  1993 Annual Research Report
[Publications] 北岡教英,河原達也,堂下修司: "格構造を利用したright-to-left A^*探索に基づく会話音声認識." 電子情報通信学会技術報告. SP93-19. 41-48 (1993)
- Related Report
  1993 Annual Research Report
[Publications] 河原達也,荒木雅弘: "Spontaneous speechの理解のための処理モデル." Spontaneous Speechの分析・理解・生成に関するシンポジウム. 49-53 (1993)
- Related Report
  1993 Annual Research Report
[Publications] 北岡教英,河原達也,堂下修司: "自由発話認識・理解のためのフレーズスポッティング." 電子情報通信学会技術報告. SP93-116. 15-22 (1993)
- Related Report
  1993 Annual Research Report
[Publications] Araki,Kawahara,Doshita.: "A keyword-driven parser for spontaneous speech understanding." Int'l Sympo.on Spoken Dialogue. 113-116 (1993)
- Related Report
  1993 Annual Research Report

Robust speech understanding against inter-speaker variation and ungrammatical utterances based on high-accuracy speech recognition and semantic driven parsing method

Principal Investigator

DOSHITA Shuji Fac.Eng., Kyoto Univ., Professor, 工学部, 教授 (00025925)

¥5,200,000 (Direct Cost: ¥5,200,000)

Report

Research Products

[Publications] 河原達也: "単語対制約をヒューリスティックとするA^*探索に基づ話音声認識" 電子情報通信学会論文誌. J77-DII,1. 1-8 (1994)

Description

Related Report

[Publications] 河原達也: "ヒューリスティックな言語モデルを用いた会話音声中の単語スポッテグ" 電子情報通信学会論文誌. (採録決定). (1995)

Description

Related Report

[Publications] M.Araki: "Cooperative Spoken Dialogue Mod using Bayesian Network and Event Hierchy" Trans.IEICE. (to appear). (1995)

Description

Related Report

[Publications] T.Kawahara: "Heuristic search integrating syactic,semantic and dialog-level constrnts." Proc.IEEE Int'l Conf Acoust.,Spch & Signal Process.vol.2. 25-28 (1994)

Description

Related Report

[Publications] T.Kawahara: "Keywod and phrase spotting witheuristic language model." Proc.Int'l Conf.on Spoken Langue Processing.vol.2. 815-818 (1994)

Description

Related Report

[Publications] M.Araki: "A cooperative man-machine dialoe model for problem solving." Proc.Int'l Conf.on Spoken Langue Processing.vol.2. 883-886 (1994)

Description

Related Report

[Publications] T.Kawahara: "Continuous speech recognition based on A^* search with word-pair constraint as heuristics" Trans.IEICE. J77-DII. 1-8 (1994)

Description

Related Report

[Publications] T.Kawahara: "Word spotting in spontaneous speech with heuristic language model" Trans.IEICE. (to appear). (1995)

Description

Related Report

[Publications] M.Araki: "Cooperative Spoken Dialogue Model using Baysian Network and Event Hierarchy" Trans.IEICE. (to appear). (1995)

Description

Related Report

[Publications] T.Kawahara: "Heuristic search integrating syntactic, semantic and dialog-level constraints." Proc.IEEE Int'l Conf. Acoust., Speech & Signal Process.Vol.2. 25-28 (1994)

Description

Related Report

[Publications] T.Kawahara: "Keyword and phrase spotting with heuristic language model." Proc.Int'l Conf.on Spoken Language Processing. Vol.2. 815-818 (1994)

Description

Related Report

[Publications] M.Araki: "A cooperative man-machine dealogue model for problem solving." Proc.Int'l Conf.on Spoken Language Processing. Vol.2. 883-886 (1994)

Description

Related Report

[Publications] 河原達也: "単語対制約をヒューリスティックとするA^*探索に基づく会話音声認識" 電子情報通信学会論文誌. J77-DII,1. 1-8 (1994)

Related Report

[Publications] 河原達也: "ヒューリスティックな言語モデルを用いた会話音声中の単語スポッティング" 電子情報通信学会論文誌. J78-DII(採録決定). (1995)

Related Report

[Publications] M.Araki: "Cooperative Spoken Dialogue Model Using Bayesian Network and Event Hierarchy" Trans,IEICE. (to appear). (1995)

Related Report

[Publications] T.Kawahara: "Heuristic search integrating syntactic,semantic and dialog-level constraints." Proc.IEEE Int'l Conf.Acoust,Speech & Signal Process.2. 25-28 (1994)

Related Report

[Publications] T.Kawahara: "Keyword and phrase spotting with heuristic language model." Proc.Int'l Conf.on Spoken Language Processing. 2. 815-818 (1994)

Related Report

[Publications] M.Araki: "A cooperative man-machine dialogue model for problem solving." Proc.Int'l Conf.on Spoken Language Processing. 2. 883-886 (1994)

Related Report

[Publications] 河原達也,松本真治,堂下修司: "単語対制約をヒューリスティックとするA^*探索に基づく会話音声認識" 電子情報通信学会論文誌. J77-DII.No.1. 1-8 (1994)

Related Report

[Publications] Kawahara,Araki,Doshita.: "Reducing syntactic perplexity of user utterances with automaton dialogue model." Int'l Sympo.on Spoken Dialogue. 65-68 (1993)

Related Report

[Publications] 北岡教英,河原達也,堂下修司: "格構造を利用したright-to-left A^*探索に基づく会話音声認識." 電子情報通信学会技術報告. SP93-19. 41-48 (1993)

Related Report

[Publications] 河原達也,荒木雅弘: "Spontaneous speechの理解のための処理モデル." Spontaneous Speechの分析・理解・生成に関するシンポジウム. 49-53 (1993)

Related Report

[Publications] 北岡教英,河原達也,堂下修司: "自由発話認識・理解のためのフレーズスポッティング." 電子情報通信学会技術報告. SP93-116. 15-22 (1993)

Related Report

[Publications] Araki,Kawahara,Doshita.: "A keyword-driven parser for spontaneous speech understanding." Int'l Sympo.on Spoken Dialogue. 113-116 (1993)

Related Report