• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

1992 Fiscal Year Final Research Report Summary

Development of a Speech Understanding system and a Spoken Dialog system

Research Project

Project/Area Number 02555067
Research Category

Grant-in-Aid for Developmental Scientific Research (B)

Allocation TypeSingle-year Grants
Research Field 情報工学
Research InstitutionToyohashi University of Technology

Principal Investigator

NAKAGAWA Seiichi  Toyohashi University of Technology, Department of Information & Computer Sciences, Professor, 工学部, 教授 (20115893)

Co-Investigator(Kenkyū-buntansha) HAMADA Masahiro  Matsushita Electric Industrial Co.,LTD, Central Research Laboratories, Researche, 中央研究所, 研究員
TSUBOKA Eiichi  Matsushita Electric Industrial Co., LTD, Central Research, 中央研究所, 室長
YAMAMOTO Mikio  Toyohashi University of Technology, Department of Information & Computer Science, 工学部, 助手 (40210562)
Project Period (FY) 1990 – 1992
Keywordsspeech recognition / speech understanding / spoken dialog / hidden Markov model / syntactic analysis / dialog model
Research Abstract

We developed the spoken Japanese dialog system. This dialog system is in the closed world of sightseeing guide. The system guides the information about singhtseeing, and user can input to the system through natural language speech. This sysem consists of speech recognition part, sentence understanding part, dialog proessing part, user utterance prediction part, and so on.
The speech recognition part recognized the input speech using syllable HMMs (Hidden Markov Model) that model the syllables of speech. CFG (Context Free Grammar) is used for modeling the linguistical restriction of user utterances.
In the sentence understanding part, the text obtained form the speech recognition is processed using Japanese lexicon and KAKARIUKE rules (dependency grammar), then transformed to the semantic network using case frames.
In the dialog processing part, the ellipsis complement and pronoun reference are performed, then the dialog is proceeded by the interpretation of the dialog rules. This dialog rules can easily adjusted to the various situations.
In the dialog, ambiguities of meanings of input sentences often occur. The part of dialog for clarification and verification is performed to disambiguate them. The system leads the user and asks the user a question positively to get the information for the disambiguation. There process can make the dialog certainly.
On such a limitative task domain, however, user tends to speak various sentence types, so it is difficult to recognize the speech correctly. The user utterance prediction part predicts the word/syntax of user's utterance for the system's response to improve the reliability of spoken dialog between the system and user.
On the system evaluation, we got the enough speech recognition rate for progressing the dialog, The dialog system could converse with a user naturally.

  • Research Products

    (14 results)

All Other

All Publications (14 results)

  • [Publications] 中川 聖一: "固定長セグメントの統計量を用いたHMMによる音節認識" 電子情報通信学会論文誌. 75-DII. 843-851 (1992)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 山本 幹雄: "信念様相論理の効率的な部分系" 情報処理学会論文誌. 33. 1193-1202 (1992)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 山本 幹雄: "音声対話文における助詞落ち・倒置の分析と解析手法" 情報処理学会論文誌. 33. 1322-1330 (1992)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Mikio YAMAMOTO: "A Spoken dialog system with verification and Clarification queries" IEICE Trans.Inf & Syst.E76-D. 84-94 (1993)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 中川 聖一: "ニューラルネットワークによる確率密度関数・事後確率の推定と母音認識" 電子情報通信学会論文誌. 76-DII. (1993)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 中川 聖一: "ワードスポッティング法を用いた文脈自由文法制御フレーム同期型HMM連続音声認識法" 電子情報通信学会論文誌. 76-DII. (1993)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 中川 聖一: "情報理論の基礎と応用" 近代科学社, 239 (1992)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Seiichi Nakagawa: "Syllable recognition by hidden Markov model using fixed-length segmental statistics" IEICE Trans.Vol.75-DII, No.5. 843-851 (1992)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Mikio Yamamoto: "An efficient sub-system of doxastic model logic" IPSJ Trans.Vol.33, No.10. 1193-1202 (1992)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Mikio Yamamoto: "An analysis and parsing method of the omission of postposition and inversion of Japanese spoken sentence in dialog" JPSJ Trans.Vol.33, No.11. 1322-1330 (1992)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Mikio Yamamoto: "A spoken dialog system with verification and clarification queries" IEICE Trans.Vol.E76-DII, No.1. 84-94 (1992)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Seiichi Nakagawa: "Estimation of probability density function and a posteriori probability by neural networks, and vowel recognition" IEICE Trans.Vol.76-DII.

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Seiichi Nakagawa: "Context-free grammar driven, frame-synchronous HMM-based continuous speech recognition methods using word spotting" IEICE Trans.Vol.76-DII.

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Seiichi Nakagawa: "A context-free grammar driven, one pass HMM-based continuous speech recognition method" IEICE Trans.Vol.76-DII.

    • Description
      「研究成果報告書概要(欧文)」より

URL: 

Published: 1994-03-24  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi