• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2001 Fiscal Year Final Research Report Summary

Computer-Assisted Pronunciation Learning System using Speech Recognition Techniaues

Research Project

Project/Area Number 11558037
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeSingle-year Grants
Section展開研究
Research Field Intelligent informatics
Research InstitutionKYOTO UNIVERSITY

Principal Investigator

KAWAHARA Tatsuya  Kyoto University, Graduate School of Informatics, Associate Professor, 情報学研究科, 助教授 (00234104)

Co-Investigator(Kenkyū-buntansha) KATAGIRI Shigeru  NTT Communication Science Laboratories, Executive Manager, コミュニケーション科学基礎研究所, 研究部長
DOSHITA Shuji  Ryukoku University, Faculty of Science and Technology, Professor, 理工学部, 教授 (00025925)
DANTSUJI Masatake  Kyoto University, Center for Information and Multimedia Studies, Professor, 総合情報メディアセンター, 教授 (10188469)
SHIMIZU Masaaki  Kyoto University, Center for Information and Multimedia Studies, Assistant Professor, 総合情報メディアセンター, 助手 (10314262)
OKUNO Hiroshi  Kyoto University, Graduate School of Informatics, Professor, 情報学研究科, 教授 (60318201)
Project Period (FY) 1999 – 2001
Keywordsspeech processing / language learning / CALL / speech recognition / phonology / prosody
Research Abstract

A Computer-Assisted Language Learning (CALL) system focusing pronunciation training is studied for English learning by Japanese students.
First, we model typical English pronunciation errors of Japanese learners and design a system that detects pronunciation errors and generates -effective instruction utilizing speech recognition technologies. For a given training text, a network of error candidates is generated for speech recognition to align the utterance and detect errors. Then, a segment-input pair-wise classifier is applied forverification. This method realizes reliable errordetectionandeffective instruction based on articulatory information.
Then, we develop a computer-assisted English prosody learning system. Learners' pronunciation is evaluated by automatic detection of sentence stressed syllables and foot durations. Syllable HMMs are categorized based on error patterns of stress. We also propose a method of multi-stage discrimination that reflects native speakers' perception. Furthermore, foot templates are constructed from native speech database in order to evaluate stress-timing.
Finally, we study to estimate non-native speakers' intelligibility and to determine which pronunciation errors affect intelligibility the most. A preliminary study showed that error rates computed by a speech recognition-based system can be used to characterize intelligibility. We use the error rate distributions to assess the student's intelligibility and compute a priority function to find which areas of study are most likely to improve the intelligibility.

  • Research Products

    (14 results)

All Other

All Publications (14 results)

  • [Publications] C.-H.Jo: "Japanese pronunciation instruction system using speech recognition methods"IEICE Trans.. E83-D,11. 1960-1968 (2000)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Y.Tsubota: "Computer-assisted english vowel learning system for Japanese speakers using cross language formant structures"Proc. Int'l Conf. Spoken Language Processing (ICSLP). 3. 56-569 (2000)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] K.Imoto: "Modelling of the perception of english sentence stress for computer-assisted language learning"Proc. Int'l Conf. Spoken Language Processing (ICSLP). 3. 175-178 (2000)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] A.Raux: "Optimizing computer-assisted pronunciation instruction by selecting relevant training topics"InSTIL 2002 Advanced Workshop. (2002)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Y.Tsubota: "CALL system for Japanese students of English using formant structure estimation and pronunciation error prediction"InSTIL 2002 Advanced Workshop. (2002)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 河原達也: "日本語ディクテーション基本ソフトウェア(99年度版)"日本音響学会誌. 57・3. 210-214 (2001)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 鹿野清宏: "音声認識システム"オーム社. 200 (2001)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] C.-H.Jo, T.Kawahara, S.Doshita, and M.Dantsuji: "Japanese pronunciation instruction system using speech recognition methods"IEICE Trans.. Vol.E83-D, No.11. 1960-1968 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] A. Raux and T. Kawahara: "Optimizing computer-assisted pronunciation instruction by selecting relevant training topics"InSTIL 2002 Advanced Workshop. (2002)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Y.Tsubota, T.Kawahara, and M.Dantsuji: "CALL system for Japanese students of English using formant structure estimation and pronunciation error prediction"InSTIL 2002 Advanced Workshop. (2002)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Y.Tsubota, M.Dantsuji, and T.Kawahara.: "Computer-assisted English vowel learning system for Japanese speakers using cross language formant structures"Proc. ICSLP. Vol.3. 566-569 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] K. Imoto, M.Dantsuji, and T.Kawahara.: "Modelling of the perception of English sentence stress for computer-assisted language learning"Proc. ICSLP.. Vol.3. 175-178 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] C.-H. Jo, T.Kawahara, and S.Doshita.: "The use of duration similarity templates in speech rhythm training"Proc. IEEE Region 10 Conference (TENCON). 146-149 (1999)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] C.-H. Jo, T.Kawahara, and S.Doshita.: "Mora-timed speech rhythm training system using rhythm pattern templates"Proc. Int'l Conf. On Speech Processing. 129-134 (1999)

    • Description
      「研究成果報告書概要(欧文)」より

URL: 

Published: 2003-09-17  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi