Project/Area Number |
10650358
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
情報通信工学
|
Research Institution | Fukui University |
Principal Investigator |
TANIGUCHI Shuji (1999-2000) Fukui University, Faculty of Engineering, Associate Professor, 工学部, 助教授 (70115301)
小泉 卓也 (1998) 福井大学, 工学部, 教授 (80020204)
|
Co-Investigator(Kenkyū-buntansha) |
MORI Mikio Fukui University, Faculty of Engineering, Assistant, 工学部, 助手 (70313731)
谷口 秀次 福井大学, 工学部, 助教授 (70115301)
|
Project Period (FY) |
1998 – 2000
|
Project Status |
Completed (Fiscal Year 2000)
|
Budget Amount *help |
¥3,500,000 (Direct Cost: ¥3,500,000)
Fiscal Year 2000: ¥900,000 (Direct Cost: ¥900,000)
Fiscal Year 1999: ¥900,000 (Direct Cost: ¥900,000)
Fiscal Year 1998: ¥1,700,000 (Direct Cost: ¥1,700,000)
|
Keywords | subword / learning vector quantization / auditory model / recurrent neural network / hidden Markov model / speaker-independency / speaker adaptation / robust isolated word recognition / 連続音声認識 / サブワード境界抽出 / 不特定話者音声認識 / 雑音耐性 / サブワード境界検出 / 音声認識 / 不特定話者 / 話者依存性 / セグメンテーション / SCHMM / 連結HMM / マルチHMM |
Research Abstract |
Our final goal is to develop a reliable continuous speech recognition system based on a model of human auditory system. So, we have studied as follows : (1) On the base of a subword-unit-based isolated word recognizer (VQ-SWR) with the discrete hidden Markov models (DHMMs) as a recognition tool, which we developed before, the research to improve the robustness for speakers and some environment noises have been done. As experimental results, findings can be summarized as follows : [1] A new recognizer with the DHMMs replaced with the semi-continuous HMMs have been developed. Experimental results showed a considerable improvement of the new recognizer in speakerindependency. [2] We have developed a new subword-unit-based isolated word recognizer incorporated a multiparty and a speaker adaptation step on the base of the VQ-SWR.This is made up of DHMMs and a learning vector quantizer (LVQ) incorporated a feedback of information on the classification of input subword which is obtained from the
… More
output of the LVQ.Experimental results showed that the new recognizer performance including the robustness for speaker and noise in stationary states is higher than those accomplished with the conventional recognizer VQ-SWR. (2) To aim at achieving higher word recognition rates and higher noise robustness than the VQ-SWR, we have proposed a new recognizer (CM-RN-SWR) made up of a model (NLF-COM) of human cochlea called "a nonlinear feedback model for cochlea", a simple multi-layer recurrent neural network (RNN) which has feedback connections of self-loop type, and DHMMs for words. The NLF-COM and the RNN which were developed before by us has been used as a model of the human auditory system, and as a kind of spectrum analyzer for speech sounds and a subword recognizer, respectively. Experimental results showed that recognition accuracies for clean speech and speech in the presence of pseud-white noise are considerably improved in speaker-dependent applications in comparison with the VQ-SWR. Less
|