• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

1993 Fiscal Year Final Research Report Summary

A Study on Ambiguous Utterance Understanding for Speech Input

Research Project

Project/Area Number 03452167
Research Category

Grant-in-Aid for General Scientific Research (B)

Allocation TypeSingle-year Grants
Research Field 情報工学
Research InstitutionToyohashi University of Technology

Principal Investigator

NAKAGAWA Seiichi  Toyohashi University of Technology, Faculty of Engineering, Professor, 工学部, 教授 (20115893)

Co-Investigator(Kenkyū-buntansha) YAMAMOTO Mikio  Toyohashi University of Technology, Faculty of Engineering, Assistant, 工学部, 助手 (40210562)
INOUE Katsumi  Toyohashi University of Technology, Faculty of Engineering, Lecture, 工学部, 講師 (10252321)
Project Period (FY) 1991 – 1993
KeywordsSpeech Input / Speech Recognition / Dialog System / Spoken Dialog / Ambiguous Input / Speech Understanding / Language Understaning
Research Abstract

We proposed an unsupervised speaker adaptation method on sequencial concatenation training that used the theory of MAPE(Maximum A Posteriori probabitity Estimation) for continuous parameter HMM.In this method, we should only specify the syllable label sequence for the utterrance. The label sequences were provided automatically by the recognizer which used a speaker-independent model in advance. The experimental results on continuous speech recognition showed that the better model gave a performance comparable to that of supervised adaptation.
Secondly, we proposed a method to process interjection and unknown words so that a speech recognition system could deal with spontaneous speech in dialog. We have evaluated the peerformance of our speech recognition system using test sentence sets including interjection or unknown words, and confirmed that the proposed method worked well.
Thirdly we investigated the menu-guided spoken natural language understanding system that could understand all user's inputs. This work was motivated by the following fact that a user could not understand what to say or how to say to a computer in natural language. The system displays a menu that consists of acceptable content words and the usur chooses one word from the menu and speaks out phrase that includes the word. The experimental showed that our system performed well for the novice users.

  • Research Products

    (12 results)

All Other

All Publications (12 results)

  • [Publications] 中川聖一: "ニューラルネットワークによる確率密度関数・事後確率の推定と母音認識" 電子情報通信学会論文誌. 76-DII. 1081-1089 (1993)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 中川聖一: "ワードスポッティング法を用いた文脈自由文法制御フレーム同期型HMM連続音声認識法" 電子情報通信学会論文誌. 76-DII. 1329-1336 (1993)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 中川聖一: "文脈自由文法制御によるOne Pass型連続音声認識" 電子情報通信学会論文誌. 76-DII. 1337-1345 (1993)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 中川聖一: "最大事後確率推定法を用いた連続出力分布型HMMの適応化" 日本音響学会誌. 49. 721-728 (1993)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 中川聖一: "エルゴディックHMMとその状態シーケンスを用いた音声による言語の識別" 電子情報通信学会論文誌. J77-A. 182-189 (1994)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 中川聖一: "不特定話者の数字音声認識によるHMMと確率文脈自由文法の比較" 電子情報通信学会論文誌. J77-DII. 263-270 (1994)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Seiichi Nakagawa: "Estimation of probability density function and posteriori Vowel recognition" IEICE Trans.Vol.76-D II, No.6. 1081-1089 (1993)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Seiichi Nakagawa: "Context-free grammar driven, frame synchronous HMM-based continuous speech recognition methods using word spotting" IEICE Trans.Vol.76-D II, No.7. 1329-1336 (1993)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Seiichi Nakagawa: "A context-free grammar driven, one pass HMM-based continuous speech recognition method" IEICE Trans.Vol.76-D II, No.7. 1337-1345 (1993)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Seiichi Nakagawa: "Spoken adaptation for continuous parameter HMM using maximum a posteriori probability estimation" Journal of Acoustic Society of Japan. Vol.49, No.10. 721-728 (1993)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Seiichi Nakagawa: "Spoken language identification by ergodic HMMs and its state sequences" IEICE Trans.Vol.77-A, No.2. 182-189 (1994)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Seiichi Nakagawa: "Comparison of HMM and SCFG by speaker independent spoken digit recognition" IEICE Trans.Vol.77-D II, No.2. 263-270 (1994)

    • Description
      「研究成果報告書概要(欧文)」より

URL: 

Published: 1995-03-27  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi