2001 Fiscal Year Final Research Report Summary

Computer-Assisted Pronunciation Learning System using Speech Recognition Techniaues

Research Project

Project/Area Number	11558037
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Single-year Grants
Section	展開研究
Research Field	Intelligent informatics
Research Institution	KYOTO UNIVERSITY
Principal Investigator	KAWAHARA Tatsuya Kyoto University, Graduate School of Informatics, Associate Professor, 情報学研究科, 助教授 (00234104)
Co-Investigator(Kenkyū-buntansha)	KATAGIRI Shigeru NTT Communication Science Laboratories, Executive Manager, コミュニケーション科学基礎研究所, 研究部長 DOSHITA Shuji Ryukoku University, Faculty of Science and Technology, Professor, 理工学部, 教授 (00025925) DANTSUJI Masatake Kyoto University, Center for Information and Multimedia Studies, Professor, 総合情報メディアセンター, 教授 (10188469) SHIMIZU Masaaki Kyoto University, Center for Information and Multimedia Studies, Assistant Professor, 総合情報メディアセンター, 助手 (10314262) OKUNO Hiroshi Kyoto University, Graduate School of Informatics, Professor, 情報学研究科, 教授 (60318201)
Project Period (FY)	1999 – 2001
Keywords	speech processing / language learning / CALL / speech recognition / phonology / prosody
Research Abstract	A Computer-Assisted Language Learning (CALL) system focusing pronunciation training is studied for English learning by Japanese students. First, we model typical English pronunciation errors of Japanese learners and design a system that detects pronunciation errors and generates -effective instruction utilizing speech recognition technologies. For a given training text, a network of error candidates is generated for speech recognition to align the utterance and detect errors. Then, a segment-input pair-wise classifier is applied forverification. This method realizes reliable errordetectionandeffective instruction based on articulatory information. Then, we develop a computer-assisted English prosody learning system. Learners' pronunciation is evaluated by automatic detection of sentence stressed syllables and foot durations. Syllable HMMs are categorized based on error patterns of stress. We also propose a method of multi-stage discrimination that reflects native speakers' perception. Furthermore, foot templates are constructed from native speech database in order to evaluate stress-timing. Finally, we study to estimate non-native speakers' intelligibility and to determine which pronunciation errors affect intelligibility the most. A preliminary study showed that error rates computed by a speech recognition-based system can be used to characterize intelligibility. We use the error rate distributions to assess the student's intelligibility and compute a priority function to find which areas of study are most likely to improve the intelligibility.

Research Products
(14 results)

All Other

All Publications (14 results)

[Publications] C.-H.Jo: "Japanese pronunciation instruction system using speech recognition methods"IEICE Trans.. E83-D,11. 1960-1968 (2000)
- Description
  「研究成果報告書概要(和文)」より
[Publications] Y.Tsubota: "Computer-assisted english vowel learning system for Japanese speakers using cross language formant structures"Proc. Int'l Conf. Spoken Language Processing (ICSLP). 3. 56-569 (2000)
- Description
  「研究成果報告書概要(和文)」より
[Publications] K.Imoto: "Modelling of the perception of english sentence stress for computer-assisted language learning"Proc. Int'l Conf. Spoken Language Processing (ICSLP). 3. 175-178 (2000)
- Description
  「研究成果報告書概要(和文)」より
[Publications] A.Raux: "Optimizing computer-assisted pronunciation instruction by selecting relevant training topics"InSTIL 2002 Advanced Workshop. (2002)
- Description
  「研究成果報告書概要(和文)」より
[Publications] Y.Tsubota: "CALL system for Japanese students of English using formant structure estimation and pronunciation error prediction"InSTIL 2002 Advanced Workshop. (2002)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 河原達也: "日本語ディクテーション基本ソフトウェア(99年度版)"日本音響学会誌. 57・3. 210-214 (2001)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 鹿野清宏: "音声認識システム"オーム社. 200 (2001)
- Description
  「研究成果報告書概要(和文)」より
[Publications] C.-H.Jo, T.Kawahara, S.Doshita, and M.Dantsuji: "Japanese pronunciation instruction system using speech recognition methods"IEICE Trans.. Vol.E83-D, No.11. 1960-1968 (2000)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] A. Raux and T. Kawahara: "Optimizing computer-assisted pronunciation instruction by selecting relevant training topics"InSTIL 2002 Advanced Workshop. (2002)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Y.Tsubota, T.Kawahara, and M.Dantsuji: "CALL system for Japanese students of English using formant structure estimation and pronunciation error prediction"InSTIL 2002 Advanced Workshop. (2002)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Y.Tsubota, M.Dantsuji, and T.Kawahara.: "Computer-assisted English vowel learning system for Japanese speakers using cross language formant structures"Proc. ICSLP. Vol.3. 566-569 (2000)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] K. Imoto, M.Dantsuji, and T.Kawahara.: "Modelling of the perception of English sentence stress for computer-assisted language learning"Proc. ICSLP.. Vol.3. 175-178 (2000)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] C.-H. Jo, T.Kawahara, and S.Doshita.: "The use of duration similarity templates in speech rhythm training"Proc. IEEE Region 10 Conference (TENCON). 146-149 (1999)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] C.-H. Jo, T.Kawahara, and S.Doshita.: "Mora-timed speech rhythm training system using rhythm pattern templates"Proc. Int'l Conf. On Speech Processing. 129-134 (1999)
- Description
  「研究成果報告書概要(欧文)」より

2001 Fiscal Year Final Research Report Summary

Computer-Assisted Pronunciation Learning System using Speech Recognition Techniaues

Principal Investigator

KAWAHARA Tatsuya Kyoto University, Graduate School of Informatics, Associate Professor, 情報学研究科, 助教授 (00234104)

Research Products

[Publications] C.-H.Jo: "Japanese pronunciation instruction system using speech recognition methods"IEICE Trans.. E83-D,11. 1960-1968 (2000)

Description

[Publications] Y.Tsubota: "Computer-assisted english vowel learning system for Japanese speakers using cross language formant structures"Proc. Int'l Conf. Spoken Language Processing (ICSLP). 3. 56-569 (2000)

Description

[Publications] K.Imoto: "Modelling of the perception of english sentence stress for computer-assisted language learning"Proc. Int'l Conf. Spoken Language Processing (ICSLP). 3. 175-178 (2000)

Description

[Publications] A.Raux: "Optimizing computer-assisted pronunciation instruction by selecting relevant training topics"InSTIL 2002 Advanced Workshop. (2002)

Description

[Publications] Y.Tsubota: "CALL system for Japanese students of English using formant structure estimation and pronunciation error prediction"InSTIL 2002 Advanced Workshop. (2002)

Description

[Publications] 河原達也: "日本語ディクテーション基本ソフトウェア(99年度版)"日本音響学会誌. 57・3. 210-214 (2001)

Description

[Publications] 鹿野清宏: "音声認識システム"オーム社. 200 (2001)

Description

[Publications] C.-H.Jo, T.Kawahara, S.Doshita, and M.Dantsuji: "Japanese pronunciation instruction system using speech recognition methods"IEICE Trans.. Vol.E83-D, No.11. 1960-1968 (2000)

Description

[Publications] A. Raux and T. Kawahara: "Optimizing computer-assisted pronunciation instruction by selecting relevant training topics"InSTIL 2002 Advanced Workshop. (2002)

Description

[Publications] Y.Tsubota, T.Kawahara, and M.Dantsuji: "CALL system for Japanese students of English using formant structure estimation and pronunciation error prediction"InSTIL 2002 Advanced Workshop. (2002)

Description

[Publications] Y.Tsubota, M.Dantsuji, and T.Kawahara.: "Computer-assisted English vowel learning system for Japanese speakers using cross language formant structures"Proc. ICSLP. Vol.3. 566-569 (2000)

Description

[Publications] K. Imoto, M.Dantsuji, and T.Kawahara.: "Modelling of the perception of English sentence stress for computer-assisted language learning"Proc. ICSLP.. Vol.3. 175-178 (2000)

Description

[Publications] C.-H. Jo, T.Kawahara, and S.Doshita.: "The use of duration similarity templates in speech rhythm training"Proc. IEEE Region 10 Conference (TENCON). 146-149 (1999)

Description

[Publications] C.-H. Jo, T.Kawahara, and S.Doshita.: "Mora-timed speech rhythm training system using rhythm pattern templates"Proc. Int'l Conf. On Speech Processing. 129-134 (1999)

Description