• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

1997 Fiscal Year Final Research Report Summary

Continuous speech recognition with adaptabilty to the speaking rate of an input speech

Research Project

Project/Area Number 07458064
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionTohoku University

Principal Investigator

MAKINO Shozo  Tohoku Univ., Computer Center, Prof., 大型計算機センター, 教授 (00089806)

Co-Investigator(Kenkyū-buntansha) SUZUKI Motoyuki  Tohoku Univ., Computer Center, Research Associ., 大型計算機センター, 助手 (30282015)
SONE Hideaki  Tohoku Univ.Graduate School of Information Sceiences Assosci.Prof., 情報科学研究科, 助教授 (40134019)
Project Period (FY) 1995 – 1997
Keywordscontinuous speech recognition / phoneme recognition / speaking rate / speakaer adaptation
Research Abstract

This tesearch developed a spoken word recognition system which used phoneme duration information estimated from the speaking rate of an input speech. In this research, the speaking rate is assumed to be reflected to the average vowel length. The acoustic processor transforms the input speech into a similarity matrix using the modified LVQ2. The average vowel length is computed from the preliminary recognition result. The duration of each phoneme in each word template is estimated from the average length of vowels in the input speech. By taking into account the estimated phoneme duration, the spoken word recognition experiments were carried out using the DTW.The word recognition score was 97.3% for the 212 word vocabulary uttered by 5 male speakers (test set). The phoneme duration information is collected from the 212 word vocabulary uttered by another 5 male and 10 female speakers (training set). The hybrid combination of the prceiding phoneme dependent estimation and the follwoing phoneme dependent estimation gave the best performance.
The above-mentioned method was extended to phoneme recognition. The phoneme accuracy increased from 71.8% to 86.3% for phonemes in the 212 word vocabulary uttered by 5 male speakers (test set).

  • Research Products

    (6 results)

All Other

All Publications (6 results)

  • [Publications] M.SUZUKI, S.MAKINO et al.: "A New HMnet Constrution Algorithm Requiring No Contextual Factors" IEICE Trans.on Information and Systems. E78-D, 6. 662-668 (1995)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] H.MORI, H.ASO, S.MAKINO: "Robust n-gram Model of Japanese Character and its Application to Document Recognition" IEICE Trans.on Information and Systems. E79-D, 5. 471-476 (1996)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Y.Okimoto, S.Makino: "Phoneme recogniton using reference patterns constructed with discriminative traning and DP matching" Jour.Acoust.Soc.America. 100, 4. 2791-2791 (1996)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] M.SUZUKI,S.MAKINO,A.ITO,H.ASO,H.SHIMODAIRA: "A New HMnet Construction Algorithm Requiring No Contextual Factors" IEICE Trans.on Information and Systems. E78-D,6. 662-668 (1995)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] H.MORI,H.ASO,S.MAKINO: "Robust n-gram Model of Japanese Character and its application to Document Recognition" IEICE Trans.on Information and Systems. E79-D,5. 471-476 (1996)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Y.Okimoto, S.Makino: "Phoneme recognition using reference patterns constructed with discriminative training and DP matching." Jour.Acoust.Soc.America. 100,4. 2791-2791 (1996)

    • Description
      「研究成果報告書概要(欧文)」より

URL: 

Published: 1999-03-16  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi