Grant-in-Aid for Overseas Scientific Survey.
|Research Institution||Osaka University|
MIZOGUCHI Riichiro The Institute of Scientific and Industrial Research, Osaka University, 産業科学研究所, 教授 (20116106)
YUNGーHWAN Oh 韓国科学技術院, 電子計算機学科, 助教授
山下 洋一 大阪大学, 産業科学研究所, 助手 (80174689)
池田 満 大阪大学, 産業科学研究所, 助手 (80212786)
OH YungーHwan 韓国科学技術院, 電子計算機学科, 教授
来村 徳信 大阪大学, 産業科学研究所, 助手 (20252710)
KITAMURA Yoshinobu The Institute of Scientific and Industrial Research, Osaka University
OH Yung hwan Department of Computer Science, Korea Advanced Institute of Scientific and Techn
YAMASHITA Yoichi The Institute of Scientific and Industrial Research, Osaka University
IKEDA Mitsuru The Institute of Scientific and Industrial Research, Osaka University
|Project Fiscal Year
1992 – 1993
Completed(Fiscal Year 1993)
|Budget Amount *help
¥8,200,000 (Direct Cost : ¥8,200,000)
Fiscal Year 1993 : ¥2,000,000 (Direct Cost : ¥2,000,000)
Fiscal Year 1992 : ¥6,200,000 (Direct Cost : ¥6,200,000)
|Keywords||Speech Recognition / Speech Understanding / Korean Language / Fuzzy / Dialog Model / ATMS / 音声認識 / 音声理解 / 韓国語 / ファジイ / 対話モデル / ファジィ|
The objective of this research is development of fundamental techniques necessary to understanding spoken dialogue, which include knowledge-based speech recognition system, non-monotonic reasoning in natural language processing, and dialogue modeling. The following are the summary of the research results.
1) We verified the efficiency of the knowledge-based approach for Korean speech recognition. Furthermore, some new ideas were proposed to improve the speech recognition. To avoid the difficulties in segmentation, a non-uniform unit is introduced. Every unit has its stationary point at each end of the unit, and transient part in the middle. The parameter trajectory is described by symbolic representation and fuzzy linguistic variables. Redundancy of speech data is used to improve the performance of the recognition system in the post-processor. The prototype system was tested with continuous Korean digit speech of unknown length, and the recognition rate of 97% was obtained.
2) Understanding of continuous speech is generally a tough problem, since acoustic information is unreliable. An efficient search mechanism is indispensable because the combination of ambiguous information is very large. Then, we developed a framework of speech understanding system based on ATMS, which is a method of non-monotonic reasoning. The introduction of ATMS reduced elapsed time of natural language processing from 64 sec to 45 sec for understanding speech of 8 Japanese sentences.
3) Two kinds of dialogue model characterizing structures in dialogue were proposed for understanding spoken dialogue. One is the SR-plan model which describes utterance pairs composed of the stimulus and the response. The other is Topic Packet Network (TPN) and corresponds to the discourse segments. A mechanism for predicting the next utterance was also developed based on these dialogue models and evaluated on some sample dialogues.