• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2002 Fiscal Year Final Research Report Summary

High performance speech and gesture recognition based on the stochastic model with mutual state-observation-dependencies

Research Project

Project/Area Number 12680399
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionWaseda University

Principal Investigator

KOBAYASHI Tetsunori  School of Science and Engineering, Professor, 理工学部, 教授 (30162001)

Project Period (FY) 2000 – 2002
Keywordsstochastic model / acoustic model / PHMM / SPHMM / speech recognition / gesture recognition
Research Abstract

Aiming at treating more complicated temporal changes of stochastic phenomena, Partly-Hidden Markov Model (PHMM), is proposed and applied to speech and gesture recognition. It can treat the observation dependent behaviors in both observations and state transitions. Some simulation experiments showed the high potential of PHMM. In addition, from the gesture recognition and the isolated spoken word recognition experiments, PHMM showed the performance to exceed HMM.
In the formulation of original PHMM, we used common pair of hidden state and observable state to determine the stochastic phenomena of the observation and the state transition. In the formulation modified here, we use common hidden state but different observable state for the observation and for the state transition separately. This slight modification brought the big flexibility in the modeling of phenomena and reduced the word errors compared with HMM and traditional PHMM using continuous speech.
We also proposed Smoothed Partly-Hidden Markov Model (SPHMM), in which the observation and state transition probabilities are defined by the geometric means of PHMM-based ones and HMM-based ones. From continuous speech recognition experiments, it was found that SPHMM gave the best performance compared with HMM and PHMM when the weight of smoothing was set adequately.

  • Research Products

    (12 results)

All Other

All Publications (12 results)

  • [Publications] 益満健, 小林哲則: "部分隠れマルコフモデルとそのジェスチャ認識への応用"情報処理学会論文誌. Vol.41. 3060-3069 (2000)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 古山純子, 小林哲則: "部分隠れマルコフモデルによる単語音声認識"電子情報通信学会論文誌(D-II). No.11. 2379-2387 (2000)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] T.Ogawa, T.Kobayashi: "Generalization of State-Observation-Dependency in Partly-Hidden Markov Models"Proc.ICSLP2002. VOLUME4. 2673-2676 (2002)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] T.Ogawa, T.Kobayashi: "Hybrid Modeling of PHMM and HMM for Speech Recognition"Proc.ICASSP2003. (未定). (2003)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 小川哲司, 小林哲則: "部分隠れマルコフモデルによる連続音声認識"電子情報通信学会 技術研究報告. SP2002-40. 25-30 (2002)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 小川哲司, 小林哲則: "部分隠れマルコフモデルの拡張と連続音声認識による評価"音響学会秋季研究発表会講演論文集. 1-9-26. 51-52 (2002)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Ken Masumitsu, Tetsunori Kobayashi: "Partly-Hidden Markov Model and Its Application to Gesture Recognition"IPSJ JOURNAL. Vol. 41, No. 11. 3060-3069 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Junko Furuyama, Tetsunori Kobayashi: "Spoken Word Recognition Using Partly-Hidden Markov Models"IEICE Trans. (D-II). Vol. J83-D-II, No. 11. 2379-2387 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Tetsuji Ogawa, Tetsunori Kobayashi: "Generalization of State Observation Dependency in Partly-Hidden Markov Models"IEEE Proc. ICSLP2002. VOLUME 4. 2673-2676 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Tetsuji Ogawa, Tetsunori Kobayashi: "Hybrid Modeling of PHMM and HMM for Speech Recognition"IEEE Proc. ICASSP2003. (2003)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Tetsuji Ogawa, Tetsunori Kobayashi: "Continuous Speech Recognition Using Partly-Hidden Markov Models"IEICE Technical Report. SP2002-40. 25-30 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Tetsuji Ogawa, Tetsunori Kobayashi: "Extension of Partly-Hidden Markov Models and evaluation using the continuous speech recognition"ASJ Proc. Autumn Meeting of ASJ. 1-9-20. 51-52 (2002)

    • Description
      「研究成果報告書概要(欧文)」より

URL: 

Published: 2004-04-14  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi