1997 Fiscal Year Final Research Report Summary

Continuous speech recognition with adaptabilty to the speaking rate of an input speech

Research Project

Project/Area Number	07458064
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Single-year Grants
Section	一般
Research Field	Intelligent informatics
Research Institution	Tohoku University
Principal Investigator	MAKINO Shozo Tohoku Univ., Computer Center, Prof., 大型計算機センター, 教授 (00089806)
Co-Investigator(Kenkyū-buntansha)	SUZUKI Motoyuki Tohoku Univ., Computer Center, Research Associ., 大型計算機センター, 助手 (30282015) SONE Hideaki Tohoku Univ.Graduate School of Information Sceiences Assosci.Prof., 情報科学研究科, 助教授 (40134019)
Project Period (FY)	1995 – 1997
Keywords	continuous speech recognition / phoneme recognition / speaking rate / speakaer adaptation
Research Abstract	This tesearch developed a spoken word recognition system which used phoneme duration information estimated from the speaking rate of an input speech. In this research, the speaking rate is assumed to be reflected to the average vowel length. The acoustic processor transforms the input speech into a similarity matrix using the modified LVQ2. The average vowel length is computed from the preliminary recognition result. The duration of each phoneme in each word template is estimated from the average length of vowels in the input speech. By taking into account the estimated phoneme duration, the spoken word recognition experiments were carried out using the DTW.The word recognition score was 97.3% for the 212 word vocabulary uttered by 5 male speakers (test set). The phoneme duration information is collected from the 212 word vocabulary uttered by another 5 male and 10 female speakers (training set). The hybrid combination of the prceiding phoneme dependent estimation and the follwoing phoneme dependent estimation gave the best performance. The above-mentioned method was extended to phoneme recognition. The phoneme accuracy increased from 71.8% to 86.3% for phonemes in the 212 word vocabulary uttered by 5 male speakers (test set).

Research Products
(6 results)

All Other

All Publications (6 results)

[Publications] M.SUZUKI, S.MAKINO et al.: "A New HMnet Constrution Algorithm Requiring No Contextual Factors" IEICE Trans.on Information and Systems. E78-D, 6. 662-668 (1995)
- Description
  「研究成果報告書概要(和文)」より
[Publications] H.MORI, H.ASO, S.MAKINO: "Robust n-gram Model of Japanese Character and its Application to Document Recognition" IEICE Trans.on Information and Systems. E79-D, 5. 471-476 (1996)
- Description
  「研究成果報告書概要(和文)」より
[Publications] Y.Okimoto, S.Makino: "Phoneme recogniton using reference patterns constructed with discriminative traning and DP matching" Jour.Acoust.Soc.America. 100, 4. 2791-2791 (1996)
- Description
  「研究成果報告書概要(和文)」より
[Publications] M.SUZUKI,S.MAKINO,A.ITO,H.ASO,H.SHIMODAIRA: "A New HMnet Construction Algorithm Requiring No Contextual Factors" IEICE Trans.on Information and Systems. E78-D,6. 662-668 (1995)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] H.MORI,H.ASO,S.MAKINO: "Robust n-gram Model of Japanese Character and its application to Document Recognition" IEICE Trans.on Information and Systems. E79-D,5. 471-476 (1996)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Y.Okimoto, S.Makino: "Phoneme recognition using reference patterns constructed with discriminative training and DP matching." Jour.Acoust.Soc.America. 100,4. 2791-2791 (1996)
- Description
  「研究成果報告書概要(欧文)」より

1997 Fiscal Year Final Research Report Summary

Continuous speech recognition with adaptabilty to the speaking rate of an input speech

Principal Investigator

MAKINO Shozo Tohoku Univ., Computer Center, Prof., 大型計算機センター, 教授 (00089806)

Research Products

[Publications] M.SUZUKI, S.MAKINO et al.: "A New HMnet Constrution Algorithm Requiring No Contextual Factors" IEICE Trans.on Information and Systems. E78-D, 6. 662-668 (1995)

Description

[Publications] H.MORI, H.ASO, S.MAKINO: "Robust n-gram Model of Japanese Character and its Application to Document Recognition" IEICE Trans.on Information and Systems. E79-D, 5. 471-476 (1996)

Description

[Publications] Y.Okimoto, S.Makino: "Phoneme recogniton using reference patterns constructed with discriminative traning and DP matching" Jour.Acoust.Soc.America. 100, 4. 2791-2791 (1996)

Description

[Publications] M.SUZUKI,S.MAKINO,A.ITO,H.ASO,H.SHIMODAIRA: "A New HMnet Construction Algorithm Requiring No Contextual Factors" IEICE Trans.on Information and Systems. E78-D,6. 662-668 (1995)

Description

[Publications] H.MORI,H.ASO,S.MAKINO: "Robust n-gram Model of Japanese Character and its application to Document Recognition" IEICE Trans.on Information and Systems. E79-D,5. 471-476 (1996)

Description

[Publications] Y.Okimoto, S.Makino: "Phoneme recognition using reference patterns constructed with discriminative training and DP matching." Jour.Acoust.Soc.America. 100,4. 2791-2791 (1996)

Description