1992 Fiscal Year Final Research Report Summary

Development of a Speech Understanding system and a Spoken Dialog system

Research Project

Project/Area Number	02555067
Research Category	Grant-in-Aid for Developmental Scientific Research (B)
Allocation Type	Single-year Grants
Research Field	情報工学
Research Institution	Toyohashi University of Technology
Principal Investigator	NAKAGAWA Seiichi Toyohashi University of Technology, Department of Information & Computer Sciences, Professor, 工学部, 教授 (20115893)
Co-Investigator(Kenkyū-buntansha)	HAMADA Masahiro Matsushita Electric Industrial Co.,LTD, Central Research Laboratories, Researche, 中央研究所, 研究員 TSUBOKA Eiichi Matsushita Electric Industrial Co., LTD, Central Research, 中央研究所, 室長 YAMAMOTO Mikio Toyohashi University of Technology, Department of Information & Computer Science, 工学部, 助手 (40210562)
Project Period (FY)	1990 – 1992
Keywords	speech recognition / speech understanding / spoken dialog / hidden Markov model / syntactic analysis / dialog model
Research Abstract	We developed the spoken Japanese dialog system. This dialog system is in the closed world of sightseeing guide. The system guides the information about singhtseeing, and user can input to the system through natural language speech. This sysem consists of speech recognition part, sentence understanding part, dialog proessing part, user utterance prediction part, and so on. The speech recognition part recognized the input speech using syllable HMMs (Hidden Markov Model) that model the syllables of speech. CFG (Context Free Grammar) is used for modeling the linguistical restriction of user utterances. In the sentence understanding part, the text obtained form the speech recognition is processed using Japanese lexicon and KAKARIUKE rules (dependency grammar), then transformed to the semantic network using case frames. In the dialog processing part, the ellipsis complement and pronoun reference are performed, then the dialog is proceeded by the interpretation of the dialog rules. This dialog rules can easily adjusted to the various situations. In the dialog, ambiguities of meanings of input sentences often occur. The part of dialog for clarification and verification is performed to disambiguate them. The system leads the user and asks the user a question positively to get the information for the disambiguation. There process can make the dialog certainly. On such a limitative task domain, however, user tends to speak various sentence types, so it is difficult to recognize the speech correctly. The user utterance prediction part predicts the word/syntax of user's utterance for the system's response to improve the reliability of spoken dialog between the system and user. On the system evaluation, we got the enough speech recognition rate for progressing the dialog, The dialog system could converse with a user naturally.

Research Products
(14 results)

All Other

All Publications (14 results)

[Publications] 中川聖一: "固定長セグメントの統計量を用いたHMMによる音節認識" 電子情報通信学会論文誌. 75-DII. 843-851 (1992)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 山本幹雄: "信念様相論理の効率的な部分系" 情報処理学会論文誌. 33. 1193-1202 (1992)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 山本幹雄: "音声対話文における助詞落ち・倒置の分析と解析手法" 情報処理学会論文誌. 33. 1322-1330 (1992)
- Description
  「研究成果報告書概要(和文)」より
[Publications] Mikio YAMAMOTO: "A Spoken dialog system with verification and Clarification queries" IEICE Trans.Inf & Syst.E76-D. 84-94 (1993)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 中川聖一: "ニューラルネットワークによる確率密度関数・事後確率の推定と母音認識" 電子情報通信学会論文誌. 76-DII. (1993)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 中川聖一: "ワードスポッティング法を用いた文脈自由文法制御フレーム同期型HMM連続音声認識法" 電子情報通信学会論文誌. 76-DII. (1993)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 中川聖一: "情報理論の基礎と応用" 近代科学社, 239 (1992)
- Description
  「研究成果報告書概要(和文)」より
[Publications] Seiichi Nakagawa: "Syllable recognition by hidden Markov model using fixed-length segmental statistics" IEICE Trans.Vol.75-DII, No.5. 843-851 (1992)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Mikio Yamamoto: "An efficient sub-system of doxastic model logic" IPSJ Trans.Vol.33, No.10. 1193-1202 (1992)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Mikio Yamamoto: "An analysis and parsing method of the omission of postposition and inversion of Japanese spoken sentence in dialog" JPSJ Trans.Vol.33, No.11. 1322-1330 (1992)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Mikio Yamamoto: "A spoken dialog system with verification and clarification queries" IEICE Trans.Vol.E76-DII, No.1. 84-94 (1992)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Seiichi Nakagawa: "Estimation of probability density function and a posteriori probability by neural networks, and vowel recognition" IEICE Trans.Vol.76-DII.
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Seiichi Nakagawa: "Context-free grammar driven, frame-synchronous HMM-based continuous speech recognition methods using word spotting" IEICE Trans.Vol.76-DII.
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Seiichi Nakagawa: "A context-free grammar driven, one pass HMM-based continuous speech recognition method" IEICE Trans.Vol.76-DII.
- Description
  「研究成果報告書概要(欧文)」より

1992 Fiscal Year Final Research Report Summary

Development of a Speech Understanding system and a Spoken Dialog system

Principal Investigator

NAKAGAWA Seiichi Toyohashi University of Technology, Department of Information & Computer Sciences, Professor, 工学部, 教授 (20115893)

Research Products

[Publications] 中川 聖一: "固定長セグメントの統計量を用いたHMMによる音節認識" 電子情報通信学会論文誌. 75-DII. 843-851 (1992)

Description

[Publications] 山本 幹雄: "信念様相論理の効率的な部分系" 情報処理学会論文誌. 33. 1193-1202 (1992)

Description

[Publications] 山本 幹雄: "音声対話文における助詞落ち・倒置の分析と解析手法" 情報処理学会論文誌. 33. 1322-1330 (1992)

Description

[Publications] Mikio YAMAMOTO: "A Spoken dialog system with verification and Clarification queries" IEICE Trans.Inf & Syst.E76-D. 84-94 (1993)

Description

[Publications] 中川 聖一: "ニューラルネットワークによる確率密度関数・事後確率の推定と母音認識" 電子情報通信学会論文誌. 76-DII. (1993)

Description

[Publications] 中川 聖一: "ワードスポッティング法を用いた文脈自由文法制御フレーム同期型HMM連続音声認識法" 電子情報通信学会論文誌. 76-DII. (1993)

Description

[Publications] 中川 聖一: "情報理論の基礎と応用" 近代科学社, 239 (1992)

Description

[Publications] Seiichi Nakagawa: "Syllable recognition by hidden Markov model using fixed-length segmental statistics" IEICE Trans.Vol.75-DII, No.5. 843-851 (1992)

Description

[Publications] Mikio Yamamoto: "An efficient sub-system of doxastic model logic" IPSJ Trans.Vol.33, No.10. 1193-1202 (1992)

Description

[Publications] Mikio Yamamoto: "An analysis and parsing method of the omission of postposition and inversion of Japanese spoken sentence in dialog" JPSJ Trans.Vol.33, No.11. 1322-1330 (1992)

Description

[Publications] Mikio Yamamoto: "A spoken dialog system with verification and clarification queries" IEICE Trans.Vol.E76-DII, No.1. 84-94 (1992)

Description

[Publications] Seiichi Nakagawa: "Estimation of probability density function and a posteriori probability by neural networks, and vowel recognition" IEICE Trans.Vol.76-DII.

Description

[Publications] Seiichi Nakagawa: "Context-free grammar driven, frame-synchronous HMM-based continuous speech recognition methods using word spotting" IEICE Trans.Vol.76-DII.

Description

[Publications] Seiichi Nakagawa: "A context-free grammar driven, one pass HMM-based continuous speech recognition method" IEICE Trans.Vol.76-DII.

Description

[Publications] 中川聖一: "固定長セグメントの統計量を用いたHMMによる音節認識" 電子情報通信学会論文誌. 75-DII. 843-851 (1992)

[Publications] 山本幹雄: "信念様相論理の効率的な部分系" 情報処理学会論文誌. 33. 1193-1202 (1992)

[Publications] 山本幹雄: "音声対話文における助詞落ち・倒置の分析と解析手法" 情報処理学会論文誌. 33. 1322-1330 (1992)

[Publications] 中川聖一: "ニューラルネットワークによる確率密度関数・事後確率の推定と母音認識" 電子情報通信学会論文誌. 76-DII. (1993)

[Publications] 中川聖一: "ワードスポッティング法を用いた文脈自由文法制御フレーム同期型HMM連続音声認識法" 電子情報通信学会論文誌. 76-DII. (1993)

[Publications] 中川聖一: "情報理論の基礎と応用" 近代科学社, 239 (1992)