• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

High performance speech and gesture recognition based on the stochastic model with mutual state-observation-dependencies

Research Project

Project/Area Number 12680399
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionWaseda University

Principal Investigator

KOBAYASHI Tetsunori  School of Science and Engineering, Professor, 理工学部, 教授 (30162001)

Project Period (FY) 2000 – 2002
Project Status Completed (Fiscal Year 2002)
Budget Amount *help
¥3,600,000 (Direct Cost: ¥3,600,000)
Fiscal Year 2002: ¥800,000 (Direct Cost: ¥800,000)
Fiscal Year 2001: ¥900,000 (Direct Cost: ¥900,000)
Fiscal Year 2000: ¥1,900,000 (Direct Cost: ¥1,900,000)
Keywordsstochastic model / acoustic model / PHMM / SPHMM / speech recognition / gesture recognition / 時系列パターン認識
Research Abstract

Aiming at treating more complicated temporal changes of stochastic phenomena, Partly-Hidden Markov Model (PHMM), is proposed and applied to speech and gesture recognition. It can treat the observation dependent behaviors in both observations and state transitions. Some simulation experiments showed the high potential of PHMM. In addition, from the gesture recognition and the isolated spoken word recognition experiments, PHMM showed the performance to exceed HMM.
In the formulation of original PHMM, we used common pair of hidden state and observable state to determine the stochastic phenomena of the observation and the state transition. In the formulation modified here, we use common hidden state but different observable state for the observation and for the state transition separately. This slight modification brought the big flexibility in the modeling of phenomena and reduced the word errors compared with HMM and traditional PHMM using continuous speech.
We also proposed Smoothed Partly-Hidden Markov Model (SPHMM), in which the observation and state transition probabilities are defined by the geometric means of PHMM-based ones and HMM-based ones. From continuous speech recognition experiments, it was found that SPHMM gave the best performance compared with HMM and PHMM when the weight of smoothing was set adequately.

Report

(4 results)
  • 2002 Annual Research Report   Final Research Report Summary
  • 2001 Annual Research Report
  • 2000 Annual Research Report
  • Research Products

    (20 results)

All Other

All Publications (20 results)

  • [Publications] 益満健, 小林哲則: "部分隠れマルコフモデルとそのジェスチャ認識への応用"情報処理学会論文誌. Vol.41. 3060-3069 (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] 古山純子, 小林哲則: "部分隠れマルコフモデルによる単語音声認識"電子情報通信学会論文誌(D-II). No.11. 2379-2387 (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] T.Ogawa, T.Kobayashi: "Generalization of State-Observation-Dependency in Partly-Hidden Markov Models"Proc.ICSLP2002. VOLUME4. 2673-2676 (2002)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] T.Ogawa, T.Kobayashi: "Hybrid Modeling of PHMM and HMM for Speech Recognition"Proc.ICASSP2003. (未定). (2003)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] 小川哲司, 小林哲則: "部分隠れマルコフモデルによる連続音声認識"電子情報通信学会 技術研究報告. SP2002-40. 25-30 (2002)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] 小川哲司, 小林哲則: "部分隠れマルコフモデルの拡張と連続音声認識による評価"音響学会秋季研究発表会講演論文集. 1-9-26. 51-52 (2002)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Ken Masumitsu, Tetsunori Kobayashi: "Partly-Hidden Markov Model and Its Application to Gesture Recognition"IPSJ JOURNAL. Vol. 41, No. 11. 3060-3069 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Junko Furuyama, Tetsunori Kobayashi: "Spoken Word Recognition Using Partly-Hidden Markov Models"IEICE Trans. (D-II). Vol. J83-D-II, No. 11. 2379-2387 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Tetsuji Ogawa, Tetsunori Kobayashi: "Generalization of State Observation Dependency in Partly-Hidden Markov Models"IEEE Proc. ICSLP2002. VOLUME 4. 2673-2676 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Tetsuji Ogawa, Tetsunori Kobayashi: "Hybrid Modeling of PHMM and HMM for Speech Recognition"IEEE Proc. ICASSP2003. (2003)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Tetsuji Ogawa, Tetsunori Kobayashi: "Continuous Speech Recognition Using Partly-Hidden Markov Models"IEICE Technical Report. SP2002-40. 25-30 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Tetsuji Ogawa, Tetsunori Kobayashi: "Extension of Partly-Hidden Markov Models and evaluation using the continuous speech recognition"ASJ Proc. Autumn Meeting of ASJ. 1-9-20. 51-52 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Tetsuji Ogawa, Tetsunori Kobayashi: "Generalization of State-Observation-Dependency in Partly Hidden Markov Models"ICSLP2002. 2673-2676 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] Tetsuji Ogawa, Tetsunori Kobayashi: "Hybrid Modeling of PHMM and HMM for Speech Recognition"ICASSP2003. (CD-ROM). (2003)

    • Related Report
      2002 Annual Research Report
  • [Publications] 小川哲司, 小林哲則: "部分隠れマルコフモデルによる連続音声認識"電子情報通信学会 技術研究報告. SP2002-40. 25-30 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] 小川哲司, 小林哲則: "部分隠れマルコフモデルの連続音声認識による評価"日本音響学会秋期研究発表会講演論文集. 51-52 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] 牛久祐輔, 小川哲司, 小林哲則: "複数の話者依存モデルを用いた話者空間表現に基づく話者適応"日本音響学会秋季研究発表会講演論文集. 3-1-9. 129-130 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] 古山純子,小林哲則: "部分隠れマルコフモデルによる単語音声認識"電子情報通信学会論文誌DII. Vol.J83-D-II,No.11. 2379-2387 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] 益満健,小林哲則: "部分隠れマルコフモデルとそのジェスチャ認識への応用"情報処理学会論文誌. Vol.41,No.11. 3060-3069 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] 小川哲司,小林哲則: "音素単位の部分隠れマルコフモデルにおける状態・出力依存関係の一般化"日本音響学会秋季研究発表会講演論文集. 1-5-10. 19-20 (2000)

    • Related Report
      2000 Annual Research Report

URL: 

Published: 2000-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi