• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Development of a Speech Understanding system and a Spoken Dialog system

Research Project

Project/Area Number 02555067
Research Category

Grant-in-Aid for Developmental Scientific Research (B)

Allocation TypeSingle-year Grants
Research Field 情報工学
Research InstitutionToyohashi University of Technology

Principal Investigator

NAKAGAWA Seiichi  Toyohashi University of Technology, Department of Information & Computer Sciences, Professor, 工学部, 教授 (20115893)

Co-Investigator(Kenkyū-buntansha) HAMADA Masahiro  Matsushita Electric Industrial Co.,LTD, Central Research Laboratories, Researche, 中央研究所, 研究員
TSUBOKA Eiichi  Matsushita Electric Industrial Co., LTD, Central Research, 中央研究所, 室長
YAMAMOTO Mikio  Toyohashi University of Technology, Department of Information & Computer Science, 工学部, 助手 (40210562)
Project Period (FY) 1990 – 1992
Project Status Completed (Fiscal Year 1992)
Budget Amount *help
¥10,600,000 (Direct Cost: ¥10,600,000)
Fiscal Year 1992: ¥1,000,000 (Direct Cost: ¥1,000,000)
Fiscal Year 1991: ¥1,500,000 (Direct Cost: ¥1,500,000)
Fiscal Year 1990: ¥8,100,000 (Direct Cost: ¥8,100,000)
Keywordsspeech recognition / speech understanding / spoken dialog / hidden Markov model / syntactic analysis / dialog model / 並列処理 / 構文分析
Research Abstract

We developed the spoken Japanese dialog system. This dialog system is in the closed world of sightseeing guide. The system guides the information about singhtseeing, and user can input to the system through natural language speech. This sysem consists of speech recognition part, sentence understanding part, dialog proessing part, user utterance prediction part, and so on.
The speech recognition part recognized the input speech using syllable HMMs (Hidden Markov Model) that model the syllables of speech. CFG (Context Free Grammar) is used for modeling the linguistical restriction of user utterances.
In the sentence understanding part, the text obtained form the speech recognition is processed using Japanese lexicon and KAKARIUKE rules (dependency grammar), then transformed to the semantic network using case frames.
In the dialog processing part, the ellipsis complement and pronoun reference are performed, then the dialog is proceeded by the interpretation of the dialog rules. This dialog rules can easily adjusted to the various situations.
In the dialog, ambiguities of meanings of input sentences often occur. The part of dialog for clarification and verification is performed to disambiguate them. The system leads the user and asks the user a question positively to get the information for the disambiguation. There process can make the dialog certainly.
On such a limitative task domain, however, user tends to speak various sentence types, so it is difficult to recognize the speech correctly. The user utterance prediction part predicts the word/syntax of user's utterance for the system's response to improve the reliability of spoken dialog between the system and user.
On the system evaluation, we got the enough speech recognition rate for progressing the dialog, The dialog system could converse with a user naturally.

Report

(4 results)
  • 1992 Annual Research Report   Final Research Report Summary
  • 1991 Annual Research Report
  • 1990 Annual Research Report
  • Research Products

    (32 results)

All Other

All Publications (32 results)

  • [Publications] 中川 聖一: "固定長セグメントの統計量を用いたHMMによる音節認識" 電子情報通信学会論文誌. 75-DII. 843-851 (1992)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1992 Final Research Report Summary
  • [Publications] 山本 幹雄: "信念様相論理の効率的な部分系" 情報処理学会論文誌. 33. 1193-1202 (1992)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1992 Final Research Report Summary
  • [Publications] 山本 幹雄: "音声対話文における助詞落ち・倒置の分析と解析手法" 情報処理学会論文誌. 33. 1322-1330 (1992)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1992 Final Research Report Summary
  • [Publications] Mikio YAMAMOTO: "A Spoken dialog system with verification and Clarification queries" IEICE Trans.Inf & Syst.E76-D. 84-94 (1993)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1992 Final Research Report Summary
  • [Publications] 中川 聖一: "ニューラルネットワークによる確率密度関数・事後確率の推定と母音認識" 電子情報通信学会論文誌. 76-DII. (1993)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1992 Final Research Report Summary
  • [Publications] 中川 聖一: "ワードスポッティング法を用いた文脈自由文法制御フレーム同期型HMM連続音声認識法" 電子情報通信学会論文誌. 76-DII. (1993)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1992 Final Research Report Summary
  • [Publications] 中川 聖一: "情報理論の基礎と応用" 近代科学社, 239 (1992)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1992 Final Research Report Summary
  • [Publications] Seiichi Nakagawa: "Syllable recognition by hidden Markov model using fixed-length segmental statistics" IEICE Trans.Vol.75-DII, No.5. 843-851 (1992)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1992 Final Research Report Summary
  • [Publications] Mikio Yamamoto: "An efficient sub-system of doxastic model logic" IPSJ Trans.Vol.33, No.10. 1193-1202 (1992)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1992 Final Research Report Summary
  • [Publications] Mikio Yamamoto: "An analysis and parsing method of the omission of postposition and inversion of Japanese spoken sentence in dialog" JPSJ Trans.Vol.33, No.11. 1322-1330 (1992)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1992 Final Research Report Summary
  • [Publications] Mikio Yamamoto: "A spoken dialog system with verification and clarification queries" IEICE Trans.Vol.E76-DII, No.1. 84-94 (1992)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1992 Final Research Report Summary
  • [Publications] Seiichi Nakagawa: "Estimation of probability density function and a posteriori probability by neural networks, and vowel recognition" IEICE Trans.Vol.76-DII.

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1992 Final Research Report Summary
  • [Publications] Seiichi Nakagawa: "Context-free grammar driven, frame-synchronous HMM-based continuous speech recognition methods using word spotting" IEICE Trans.Vol.76-DII.

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1992 Final Research Report Summary
  • [Publications] Seiichi Nakagawa: "A context-free grammar driven, one pass HMM-based continuous speech recognition method" IEICE Trans.Vol.76-DII.

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1992 Final Research Report Summary
  • [Publications] 中川 聖一: "固定長セグメントの統計量を用いたHMMによる音節認識" 電子情報通信学会論文誌. 75-DII. 843-851 (1992)

    • Related Report
      1992 Annual Research Report
  • [Publications] 山本 幹雄: "信念様相論理の効率的な部分系" 情報処理学会論文誌. 33. 1193-1202 (1992)

    • Related Report
      1992 Annual Research Report
  • [Publications] 山本 幹雄: "音声対話文における助詞落ち・倒置の分析と解析手法" 情報処理学会論文誌. 33. 1322-1330 (1992)

    • Related Report
      1992 Annual Research Report
  • [Publications] Mikio Yamamoto: "A Spokin dialog systim with verification and clarification queries" IEICE Trans.Inf & Syst.E76-D. 84-94 (1993)

    • Related Report
      1992 Annual Research Report
  • [Publications] 中川 聖一: "ニューラルネットワークによる確率密度関数・事後確率の推定と母音認識" 電子情報通信学会論文誌. 76-DII. (1993)

    • Related Report
      1992 Annual Research Report
  • [Publications] 中川 聖一: "ワードスポッティング法を用いた文脈自由文法制御フレーム同期型HMM連続音声認識法" 電子情報通信学会論文誌. 76-DII. (1993)

    • Related Report
      1992 Annual Research Report
  • [Publications] 中川 聖一: "情報理論の基礎と応用" 近代科学社, 239 (1992)

    • Related Report
      1992 Annual Research Report
  • [Publications] 中川 聖一: "連続出力分布型HMMの話者適応化による日本語音韻・音節認識" 日本音響学会誌. 47. 459-467 (1991)

    • Related Report
      1991 Annual Research Report
  • [Publications] Seiichi Nakagawa: "Comparison of syntax-oriented spoken Japanese understanding system with semantic oriented system." 電子情報通信学会論文誌. E74. 1854-1862 (1991)

    • Related Report
      1991 Annual Research Report
  • [Publications] Seiichi Nakagawa: "Comparison of language models by context-free grammar and quasi/simplified-trigram" 電子情報通信学会論文誌. E74. 1897-1906 (1991)

    • Related Report
      1991 Annual Research Report
  • [Publications] 中川 聖一: "シ-ケンシャルニュ-ラルネットワ-クを用いた音声認識" 電子情報通信学会論文誌. 74-DII. 1174-1183 (1991)

    • Related Report
      1991 Annual Research Report
  • [Publications] 中川 聖一: "固定長セグメントの統計量を用いたHMMによる音節認識" 電子情報通信学会論文誌. (1992)

    • Related Report
      1991 Annual Research Report
  • [Publications] 中川 聖一・村瀬 功: "連続音声認識システムの評価法ータスクの複雑性と文認識率との関係" 電子情報通信学会論文誌. 72DーII. 683-693 (1990)

    • Related Report
      1990 Annual Research Report
  • [Publications] 中川 聖一・平田 好充・橋本 秦秀: "連続出力分布型HMMによる日本語音韻認識の検討" 日本音響学会誌. 46. 486-496 (1990)

    • Related Report
      1990 Annual Research Report
  • [Publications] S.Nakagagawa,Y.Ueda: "Automatic Extroction of phonotactics based on Hidden Markov Models and Langwage I olenti fiction" Studia Phonlogica. 24. (1991)

    • Related Report
      1990 Annual Research Report
  • [Publications] S.Nakagawa,Y.Hashimoto: "Segmentation of Continuows Speech by HMM and Bayesian Probabity." System and Computers in Japan. 21. 23-32 (1990)

    • Related Report
      1990 Annual Research Report
  • [Publications] 中川 聖一,竹本 信治,田口 勝豊: "交通規則文に関する質問応答システムLICENCEにおける日本語文からの一階述語論理式への変換" 情報処理学会論文誌. 32. (1991)

    • Related Report
      1990 Annual Research Report
  • [Publications] 中川 聖一・鹿野 清宏・東倉 洋一: "音声・聴覚と神経回路綱モデル" オ-ム社, 235 (1990)

    • Related Report
      1990 Annual Research Report

URL: 

Published: 1990-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi