Development of a Speech Understanding system and a Spoken Dialog system

Research Project

Project/Area Number	02555067
Research Category	Grant-in-Aid for Developmental Scientific Research (B)
Allocation Type	Single-year Grants
Research Field	情報工学
Research Institution	Toyohashi University of Technology
Principal Investigator	NAKAGAWA Seiichi Toyohashi University of Technology, Department of Information & Computer Sciences, Professor, 工学部, 教授 (20115893)
Co-Investigator(Kenkyū-buntansha)	HAMADA Masahiro Matsushita Electric Industrial Co.,LTD, Central Research Laboratories, Researche, 中央研究所, 研究員 TSUBOKA Eiichi Matsushita Electric Industrial Co., LTD, Central Research, 中央研究所, 室長 YAMAMOTO Mikio Toyohashi University of Technology, Department of Information & Computer Science, 工学部, 助手 (40210562)
Project Period (FY)	1990 – 1992
Project Status	Completed (Fiscal Year 1992)
Budget Amount *help	¥10,600,000 (Direct Cost: ¥10,600,000) Fiscal Year 1992: ¥1,000,000 (Direct Cost: ¥1,000,000) Fiscal Year 1991: ¥1,500,000 (Direct Cost: ¥1,500,000) Fiscal Year 1990: ¥8,100,000 (Direct Cost: ¥8,100,000)
Keywords	speech recognition / speech understanding / spoken dialog / hidden Markov model / syntactic analysis / dialog model / 並列処理 / 構文分析
Research Abstract	We developed the spoken Japanese dialog system. This dialog system is in the closed world of sightseeing guide. The system guides the information about singhtseeing, and user can input to the system through natural language speech. This sysem consists of speech recognition part, sentence understanding part, dialog proessing part, user utterance prediction part, and so on. The speech recognition part recognized the input speech using syllable HMMs (Hidden Markov Model) that model the syllables of speech. CFG (Context Free Grammar) is used for modeling the linguistical restriction of user utterances. In the sentence understanding part, the text obtained form the speech recognition is processed using Japanese lexicon and KAKARIUKE rules (dependency grammar), then transformed to the semantic network using case frames. In the dialog processing part, the ellipsis complement and pronoun reference are performed, then the dialog is proceeded by the interpretation of the dialog rules. This dialog rules can easily adjusted to the various situations. In the dialog, ambiguities of meanings of input sentences often occur. The part of dialog for clarification and verification is performed to disambiguate them. The system leads the user and asks the user a question positively to get the information for the disambiguation. There process can make the dialog certainly. On such a limitative task domain, however, user tends to speak various sentence types, so it is difficult to recognize the speech correctly. The user utterance prediction part predicts the word/syntax of user's utterance for the system's response to improve the reliability of spoken dialog between the system and user. On the system evaluation, we got the enough speech recognition rate for progressing the dialog, The dialog system could converse with a user naturally.

Report

(4 results)

1992 Annual Research Report Final Research Report Summary
1991 Annual Research Report
1990 Annual Research Report

Research Products
(32 results)

All Other

All Publications (32 results)

[Publications] 中川聖一: "固定長セグメントの統計量を用いたHMMによる音節認識" 電子情報通信学会論文誌. 75-DII. 843-851 (1992)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1992 Final Research Report Summary
[Publications] 山本幹雄: "信念様相論理の効率的な部分系" 情報処理学会論文誌. 33. 1193-1202 (1992)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1992 Final Research Report Summary
[Publications] 山本幹雄: "音声対話文における助詞落ち・倒置の分析と解析手法" 情報処理学会論文誌. 33. 1322-1330 (1992)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1992 Final Research Report Summary
[Publications] Mikio YAMAMOTO: "A Spoken dialog system with verification and Clarification queries" IEICE Trans.Inf & Syst.E76-D. 84-94 (1993)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1992 Final Research Report Summary
[Publications] 中川聖一: "ニューラルネットワークによる確率密度関数・事後確率の推定と母音認識" 電子情報通信学会論文誌. 76-DII. (1993)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1992 Final Research Report Summary
[Publications] 中川聖一: "ワードスポッティング法を用いた文脈自由文法制御フレーム同期型HMM連続音声認識法" 電子情報通信学会論文誌. 76-DII. (1993)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1992 Final Research Report Summary
[Publications] 中川聖一: "情報理論の基礎と応用" 近代科学社, 239 (1992)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1992 Final Research Report Summary
[Publications] Seiichi Nakagawa: "Syllable recognition by hidden Markov model using fixed-length segmental statistics" IEICE Trans.Vol.75-DII, No.5. 843-851 (1992)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1992 Final Research Report Summary
[Publications] Mikio Yamamoto: "An efficient sub-system of doxastic model logic" IPSJ Trans.Vol.33, No.10. 1193-1202 (1992)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1992 Final Research Report Summary
[Publications] Mikio Yamamoto: "An analysis and parsing method of the omission of postposition and inversion of Japanese spoken sentence in dialog" JPSJ Trans.Vol.33, No.11. 1322-1330 (1992)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1992 Final Research Report Summary
[Publications] Mikio Yamamoto: "A spoken dialog system with verification and clarification queries" IEICE Trans.Vol.E76-DII, No.1. 84-94 (1992)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1992 Final Research Report Summary
[Publications] Seiichi Nakagawa: "Estimation of probability density function and a posteriori probability by neural networks, and vowel recognition" IEICE Trans.Vol.76-DII.
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1992 Final Research Report Summary
[Publications] Seiichi Nakagawa: "Context-free grammar driven, frame-synchronous HMM-based continuous speech recognition methods using word spotting" IEICE Trans.Vol.76-DII.
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1992 Final Research Report Summary
[Publications] Seiichi Nakagawa: "A context-free grammar driven, one pass HMM-based continuous speech recognition method" IEICE Trans.Vol.76-DII.
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1992 Final Research Report Summary
[Publications] 中川聖一: "固定長セグメントの統計量を用いたHMMによる音節認識" 電子情報通信学会論文誌. 75-DII. 843-851 (1992)
- Related Report
  1992 Annual Research Report
[Publications] 山本幹雄: "信念様相論理の効率的な部分系" 情報処理学会論文誌. 33. 1193-1202 (1992)
- Related Report
  1992 Annual Research Report
[Publications] 山本幹雄: "音声対話文における助詞落ち・倒置の分析と解析手法" 情報処理学会論文誌. 33. 1322-1330 (1992)
- Related Report
  1992 Annual Research Report
[Publications] Mikio Yamamoto: "A Spokin dialog systim with verification and clarification queries" IEICE Trans.Inf & Syst.E76-D. 84-94 (1993)
- Related Report
  1992 Annual Research Report
[Publications] 中川聖一: "ニューラルネットワークによる確率密度関数・事後確率の推定と母音認識" 電子情報通信学会論文誌. 76-DII. (1993)
- Related Report
  1992 Annual Research Report
[Publications] 中川聖一: "ワードスポッティング法を用いた文脈自由文法制御フレーム同期型HMM連続音声認識法" 電子情報通信学会論文誌. 76-DII. (1993)
- Related Report
  1992 Annual Research Report
[Publications] 中川聖一: "情報理論の基礎と応用" 近代科学社, 239 (1992)
- Related Report
  1992 Annual Research Report
[Publications] 中川聖一: "連続出力分布型HMMの話者適応化による日本語音韻・音節認識" 日本音響学会誌. 47. 459-467 (1991)
- Related Report
  1991 Annual Research Report
[Publications] Seiichi Nakagawa: "Comparison of syntax-oriented spoken Japanese understanding system with semantic oriented system." 電子情報通信学会論文誌. E74. 1854-1862 (1991)
- Related Report
  1991 Annual Research Report
[Publications] Seiichi Nakagawa: "Comparison of language models by context-free grammar and quasi/simplified-trigram" 電子情報通信学会論文誌. E74. 1897-1906 (1991)
- Related Report
  1991 Annual Research Report
[Publications] 中川聖一: "シ-ケンシャルニュ-ラルネットワ-クを用いた音声認識" 電子情報通信学会論文誌. 74-DII. 1174-1183 (1991)
- Related Report
  1991 Annual Research Report
[Publications] 中川聖一: "固定長セグメントの統計量を用いたHMMによる音節認識" 電子情報通信学会論文誌. (1992)
- Related Report
  1991 Annual Research Report
[Publications] 中川聖一・村瀬功: "連続音声認識システムの評価法ータスクの複雑性と文認識率との関係" 電子情報通信学会論文誌. 72DーII. 683-693 (1990)
- Related Report
  1990 Annual Research Report
[Publications] 中川聖一・平田好充・橋本秦秀: "連続出力分布型HMMによる日本語音韻認識の検討" 日本音響学会誌. 46. 486-496 (1990)
- Related Report
  1990 Annual Research Report
[Publications] S.Nakagagawa,Y.Ueda: "Automatic Extroction of phonotactics based on Hidden Markov Models and Langwage I olenti fiction" Studia Phonlogica. 24. (1991)
- Related Report
  1990 Annual Research Report
[Publications] S.Nakagawa,Y.Hashimoto: "Segmentation of Continuows Speech by HMM and Bayesian Probabity." System and Computers in Japan. 21. 23-32 (1990)
- Related Report
  1990 Annual Research Report
[Publications] 中川聖一,竹本信治,田口勝豊: "交通規則文に関する質問応答システムLICENCEにおける日本語文からの一階述語論理式への変換" 情報処理学会論文誌. 32. (1991)
- Related Report
  1990 Annual Research Report
[Publications] 中川聖一・鹿野清宏・東倉洋一: "音声・聴覚と神経回路綱モデル" オ-ム社, 235 (1990)
- Related Report
  1990 Annual Research Report

Development of a Speech Understanding system and a Spoken Dialog system

Principal Investigator

NAKAGAWA Seiichi Toyohashi University of Technology, Department of Information & Computer Sciences, Professor, 工学部, 教授 (20115893)

¥10,600,000 (Direct Cost: ¥10,600,000)

Report

Research Products

[Publications] 中川 聖一: "固定長セグメントの統計量を用いたHMMによる音節認識" 電子情報通信学会論文誌. 75-DII. 843-851 (1992)

Description

Related Report

[Publications] 山本 幹雄: "信念様相論理の効率的な部分系" 情報処理学会論文誌. 33. 1193-1202 (1992)

Description

Related Report

[Publications] 山本 幹雄: "音声対話文における助詞落ち・倒置の分析と解析手法" 情報処理学会論文誌. 33. 1322-1330 (1992)

Description

Related Report

[Publications] Mikio YAMAMOTO: "A Spoken dialog system with verification and Clarification queries" IEICE Trans.Inf & Syst.E76-D. 84-94 (1993)

Description

Related Report

[Publications] 中川 聖一: "ニューラルネットワークによる確率密度関数・事後確率の推定と母音認識" 電子情報通信学会論文誌. 76-DII. (1993)

Description

Related Report

[Publications] 中川 聖一: "ワードスポッティング法を用いた文脈自由文法制御フレーム同期型HMM連続音声認識法" 電子情報通信学会論文誌. 76-DII. (1993)

Description

Related Report

[Publications] 中川 聖一: "情報理論の基礎と応用" 近代科学社, 239 (1992)

Description

Related Report

[Publications] Seiichi Nakagawa: "Syllable recognition by hidden Markov model using fixed-length segmental statistics" IEICE Trans.Vol.75-DII, No.5. 843-851 (1992)

Description

Related Report

[Publications] Mikio Yamamoto: "An efficient sub-system of doxastic model logic" IPSJ Trans.Vol.33, No.10. 1193-1202 (1992)

Description

Related Report

[Publications] Mikio Yamamoto: "An analysis and parsing method of the omission of postposition and inversion of Japanese spoken sentence in dialog" JPSJ Trans.Vol.33, No.11. 1322-1330 (1992)

Description

Related Report

[Publications] Mikio Yamamoto: "A spoken dialog system with verification and clarification queries" IEICE Trans.Vol.E76-DII, No.1. 84-94 (1992)

Description

Related Report

[Publications] Seiichi Nakagawa: "Estimation of probability density function and a posteriori probability by neural networks, and vowel recognition" IEICE Trans.Vol.76-DII.

Description

Related Report

[Publications] Seiichi Nakagawa: "Context-free grammar driven, frame-synchronous HMM-based continuous speech recognition methods using word spotting" IEICE Trans.Vol.76-DII.

Description

Related Report

[Publications] Seiichi Nakagawa: "A context-free grammar driven, one pass HMM-based continuous speech recognition method" IEICE Trans.Vol.76-DII.

Description

Related Report

[Publications] 中川 聖一: "固定長セグメントの統計量を用いたHMMによる音節認識" 電子情報通信学会論文誌. 75-DII. 843-851 (1992)

Related Report

[Publications] 山本 幹雄: "信念様相論理の効率的な部分系" 情報処理学会論文誌. 33. 1193-1202 (1992)

Related Report

[Publications] 山本 幹雄: "音声対話文における助詞落ち・倒置の分析と解析手法" 情報処理学会論文誌. 33. 1322-1330 (1992)

Related Report

[Publications] Mikio Yamamoto: "A Spokin dialog systim with verification and clarification queries" IEICE Trans.Inf & Syst.E76-D. 84-94 (1993)

Related Report

[Publications] 中川 聖一: "ニューラルネットワークによる確率密度関数・事後確率の推定と母音認識" 電子情報通信学会論文誌. 76-DII. (1993)

Related Report

[Publications] 中川 聖一: "ワードスポッティング法を用いた文脈自由文法制御フレーム同期型HMM連続音声認識法" 電子情報通信学会論文誌. 76-DII. (1993)

Related Report

[Publications] 中川 聖一: "情報理論の基礎と応用" 近代科学社, 239 (1992)

Related Report

[Publications] 中川 聖一: "連続出力分布型HMMの話者適応化による日本語音韻・音節認識" 日本音響学会誌. 47. 459-467 (1991)

Related Report

[Publications] Seiichi Nakagawa: "Comparison of syntax-oriented spoken Japanese understanding system with semantic oriented system." 電子情報通信学会論文誌. E74. 1854-1862 (1991)

Related Report

[Publications] Seiichi Nakagawa: "Comparison of language models by context-free grammar and quasi/simplified-trigram" 電子情報通信学会論文誌. E74. 1897-1906 (1991)

Related Report

[Publications] 中川 聖一: "シ-ケンシャルニュ-ラルネットワ-クを用いた音声認識" 電子情報通信学会論文誌. 74-DII. 1174-1183 (1991)

Related Report

[Publications] 中川 聖一: "固定長セグメントの統計量を用いたHMMによる音節認識" 電子情報通信学会論文誌. (1992)

Related Report

[Publications] 中川 聖一・村瀬 功: "連続音声認識システムの評価法ータスクの複雑性と文認識率との関係" 電子情報通信学会論文誌. 72DーII. 683-693 (1990)

Related Report

[Publications] 中川 聖一・平田 好充・橋本 秦秀: "連続出力分布型HMMによる日本語音韻認識の検討" 日本音響学会誌. 46. 486-496 (1990)

Related Report

[Publications] S.Nakagagawa,Y.Ueda: "Automatic Extroction of phonotactics based on Hidden Markov Models and Langwage I olenti fiction" Studia Phonlogica. 24. (1991)

Related Report

[Publications] S.Nakagawa,Y.Hashimoto: "Segmentation of Continuows Speech by HMM and Bayesian Probabity." System and Computers in Japan. 21. 23-32 (1990)

Related Report

[Publications] 中川聖一: "固定長セグメントの統計量を用いたHMMによる音節認識" 電子情報通信学会論文誌. 75-DII. 843-851 (1992)

[Publications] 山本幹雄: "信念様相論理の効率的な部分系" 情報処理学会論文誌. 33. 1193-1202 (1992)

[Publications] 山本幹雄: "音声対話文における助詞落ち・倒置の分析と解析手法" 情報処理学会論文誌. 33. 1322-1330 (1992)

[Publications] 中川聖一: "ニューラルネットワークによる確率密度関数・事後確率の推定と母音認識" 電子情報通信学会論文誌. 76-DII. (1993)

[Publications] 中川聖一: "ワードスポッティング法を用いた文脈自由文法制御フレーム同期型HMM連続音声認識法" 電子情報通信学会論文誌. 76-DII. (1993)

[Publications] 中川聖一: "情報理論の基礎と応用" 近代科学社, 239 (1992)

[Publications] 中川聖一: "固定長セグメントの統計量を用いたHMMによる音節認識" 電子情報通信学会論文誌. 75-DII. 843-851 (1992)

[Publications] 山本幹雄: "信念様相論理の効率的な部分系" 情報処理学会論文誌. 33. 1193-1202 (1992)

[Publications] 山本幹雄: "音声対話文における助詞落ち・倒置の分析と解析手法" 情報処理学会論文誌. 33. 1322-1330 (1992)

[Publications] 中川聖一: "ニューラルネットワークによる確率密度関数・事後確率の推定と母音認識" 電子情報通信学会論文誌. 76-DII. (1993)

[Publications] 中川聖一: "ワードスポッティング法を用いた文脈自由文法制御フレーム同期型HMM連続音声認識法" 電子情報通信学会論文誌. 76-DII. (1993)

[Publications] 中川聖一: "情報理論の基礎と応用" 近代科学社, 239 (1992)

[Publications] 中川聖一: "連続出力分布型HMMの話者適応化による日本語音韻・音節認識" 日本音響学会誌. 47. 459-467 (1991)

[Publications] 中川聖一: "シ-ケンシャルニュ-ラルネットワ-クを用いた音声認識" 電子情報通信学会論文誌. 74-DII. 1174-1183 (1991)

[Publications] 中川聖一: "固定長セグメントの統計量を用いたHMMによる音節認識" 電子情報通信学会論文誌. (1992)

[Publications] 中川聖一・村瀬功: "連続音声認識システムの評価法ータスクの複雑性と文認識率との関係" 電子情報通信学会論文誌. 72DーII. 683-693 (1990)

[Publications] 中川聖一・平田好充・橋本秦秀: "連続出力分布型HMMによる日本語音韻認識の検討" 日本音響学会誌. 46. 486-496 (1990)

[Publications] 中川聖一,竹本信治,田口勝豊: "交通規則文に関する質問応答システムLICENCEにおける日本語文からの一階述語論理式への変換" 情報処理学会論文誌. 32. (1991)

[Publications] 中川聖一・鹿野清宏・東倉洋一: "音声・聴覚と神経回路綱モデル" オ-ム社, 235 (1990)