• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2007 Fiscal Year Final Research Report Summary

Support of Foreign Language Pronunciation Training of Primary School Students Based on Comprehensive Description of the Pronunciation

Research Project

Project/Area Number 17300261
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeSingle-year Grants
Section一般
Research Field Educational technology
Research InstitutionThe University of Tokyo

Principal Investigator

MINEMATSU Nobuaki  The University of Tokyo, Graduate School of Frontier Sciences, Associate Professor (90273333)

Co-Investigator(Kenkyū-buntansha) HIROSE Keikichi  The University of Tokyo, Graduate School of Information Science and Technology, Professor (50111472)
HARADA Yasunari  Waseda University, School of Law, Professor (80189711)
YAMAUCHI Yutaka  Tokyo International University, School of Business and Commerce, Associate Professor (30306245)
KOCHIYAMA Akiko  Chubu University, Department of Humanity and Social Sciences, Associate Professor (80350990)
MAKINO Takehiko  Chuo University, Faculty of Economics, Associate Professor (00269482)
Project Period (FY) 2005 – 2007
KeywordsForeign Language Learning / Structural Representation of Speech / Structural Phonology / Pronunciation Portfolio / CALL system / Speech recognition
Research Abstract

The Ministry of Education, Culture, Sports, Science and Technology in Japan announced that English education will be introduced into primary schools in 2011. This means that the number of Japanese students of English will be drastically increased in 2011 but it is a fact that the number of English teachers is not sufficient at all for those students. In this study, to solve this problem a new technique was built for supporting young students' learning English and assessing their pronunciation. Children's voices are very difficult to process adequately with the current speech technology. For example, if an assessment system is built with adult speech samples, the system cannot deal with children's voices adequately due to a large acoustic difference between adults' voices and children's voices. If a large number of samples of children voices are available, a good system is possible but the recording with young children is a very heavy task Speaker adaptation technology can be used to ad … More apt a system for adults into a system for a specific young child (student). But in this case, bad pronunciation may be judged as good because of over-adaptation to the specific young student. To solve these problems completely, we proposed a new speech technique of representing an utterance through removing the acoustic features showing the age and gender of the speaker. In the proposed method, only the timbre contrasts were extracted from speech events, where a contrast was measured as Bhattacharyya distance because the distance is completely invariant with any kind of linear or non-linear transformation. Speaker differences can be described as acoustic transformation of voices and then, the proposed representation is speaker-invariant. Using this new structural representation, a system of assessing English vowels produced by students of any age was built. The system has four functions. 1) recording or logging vowel system changes of individual students, caused by training. 2) classification of learners purely based on pronunciation variation, irrespective of age and gender, 3) generation of instructions on which vowels to correct at first, and 4) very motivating user-interface for pronunciation training. During a three-year period, a proto-type system was tested and evaluated in many locations, such as high schools, junior high schools and primary schools. Then, over 500 students aging from 3 to 70 joined our pronunciation test.
The analysis results showed the very high validity of the proposed method and system. Further, using the data, we classified over-500 students based on their pronunciations, irrespective of age and gender, and we defined 5 typical Japanese pronunciations of English. Less

  • Research Products

    (30 results)

All 2008 2007 2006 2005

All Journal Article (21 results) (of which Peer Reviewed: 6 results) Presentation (8 results) Book (1 results)

  • [Journal Article] 音声の構造的表象に基づく日本語孤立母音系列を対象とした音声認識2008

    • Author(s)
      村上隆夫, 峯松信明, 広瀬啓吉
    • Journal Title

      電子情報通信学会論文誌 Vol. J91-A, No. 2

      Pages: 181-191

    • Description
      「研究成果報告書概要(和文)」より
    • Peer Reviewed
  • [Journal Article] Recognition of isolated utterances of Japanese vowel sequences based on structural representation of speech2008

    • Author(s)
      T. Murakami, K. Maruyama, N. Minematsu, K. Hirose
    • Journal Title

      IEICE Trans Vol. J91-D, No. 2

      Pages: 181-191

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Consideration of infants' vocal imitation through modeling speech as timbre-based melody2008

    • Author(s)
      N. Minematsu, T. Nishimura
    • Journal Title

      Included in "New frontiers in Artificial Intelligence" LNAI4914

      Pages: 26-39

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] 音声の構造的表象に基づく英語学習者発音の音響的分析2007

    • Author(s)
      朝川智, 峯松信明, 広瀬啓吉
    • Journal Title

      電子情報通信学会論文誌 Vol. J90-D, No. 5

      Pages: 1249-1262

    • Description
      「研究成果報告書概要(和文)」より
    • Peer Reviewed
  • [Journal Article] Are learners myna birds to the averaged distributions of native speakers? -a note of warning from a serious speech engineer-2007

    • Author(s)
      N. Minematsu
    • Journal Title

      Proc. ISCA Workshop on Speech and Language Technology in Education (CD-ROM)

    • Description
      「研究成果報告書概要(和文)」より
    • Peer Reviewed
  • [Journal Article] Structural representation of the pronunciation and its use for classifying Japanese learners of English2007

    • Author(s)
      N. Minematsu, K. Kamata, S. Asakawa, T. Makino, K. Hirose
    • Journal Title

      Proc. ISCA Workshop on Speech and Language Technology in Education (CD-ROM)

    • Description
      「研究成果報告書概要(和文)」より
    • Peer Reviewed
  • [Journal Article] Structural assessment of language learners' pronunciation2007

    • Author(s)
      N. Minematsu, K. Kamata, S. Asakawa, T. Makino, T. Nishimura, K. Hiorse
    • Journal Title

      Proc. InterSpeech

      Pages: 210-213

    • Description
      「研究成果報告書概要(和文)」より
    • Peer Reviewed
  • [Journal Article] Automatic recognition of connected vowels only using speaker-invariant representation of speech dynamics2007

    • Author(s)
      S. Asakawa, N. Minematsu, K. Hirose
    • Journal Title

      Proc. InterSpeech

      Pages: 890-893

    • Description
      「研究成果報告書概要(和文)」より
    • Peer Reviewed
  • [Journal Article] 音声の構造的表象に基づく発音矯正必要度の計算手法の検討2007

    • Author(s)
      鎌田圭, 朝川智, 峯松信明, 牧野武彦, 広瀬啓吉
    • Journal Title

      信学技報SP2007-30

      Pages: 37-42

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] Acoustic analysis of the pronunciation of Japanese learners of English based on structural representation of speech2007

    • Author(s)
      S. Asakawa, N. Minematsu, K. Hirose
    • Journal Title

      IEICT Trans. V01. J90-D, No. 5

      Pages: 1249-1262

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Structural assessment of language learners' pronunciation2007

    • Author(s)
      N. Minematsu, K. Kamata, S. Asakawa, T. Makino, T. Nishimura, K. Hirose
    • Journal Title

      Proc. InterSpeech

      Pages: 210-213

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Are learners myna birds to the averaged distributions of native speakers? - a note of warning from a serious speech engineer -2007

    • Author(s)
      N. Minematsu
    • Journal Title

      Proc. Int. Workshop on Speech and Lan- guage Technology in Education (CD-ROM)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Structural representation of the pronunciation and its use for classifying Japanese learners of English2007

    • Author(s)
      N. Minematsu, K. Kamata, S. Asakawa, T. Makino, K. Hirose
    • Journal Title

      Proc. Int. Workshop on Speech and Lan- guage Technology in Education (CD-ROM)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Theorem of the invariant structure and its derivation of speech Gestalt2006

    • Author(s)
      N. Minematsu, T. Nishimura, K. Nishinara, K. Sakuraba
    • Journal Title

      Proc. Int. Workshop on Speech Recognition and Intrinsic Variations

      Pages: 47-52

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Structural representation of the pronunciation and its use for CALL2006

    • Author(s)
      N. Minematsu, S. Asakawa, K. Hirose
    • Journal Title

      Proc. Int. Workshop on Spoken Language Technology

      Pages: 126-129

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Speech recognition onb with supra-segmental features - hearing speech as music -2006

    • Author(s)
      N. Minematsu, T. Nishimura, T. Murakami, K. Hirose
    • Journal Title

      Proc. Int. Conf. on Speech Prosody

      Pages: 589-594

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Mathematical evidence of the acoustic universal structure2005

    • Author(s)
      N. Minematsu
    • Journal Title

      Proc. ICASSP

      Pages: 5734-5737

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Structural representation of the pronunciation and its use in the pronunciation training2005

    • Author(s)
      N. Minematsu, S. Asakawa, K. Hirose, T. Makino
    • Journal Title

      Proc. Workshop on Phonetics Teaching and Learning (CD-ROM)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Japanese vowel recognition based on structural representation of speech2005

    • Author(s)
      T. Murakami, K. Maruyama, N. Minematsu, K. Hirose
    • Journal Title

      Proc. EuroSpeech

      Pages: 1261-1264

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Structural representatior of the non-native pronunciations2005

    • Author(s)
      S. Asakawa, N. Minematsu, T. I. Jaakkola, K. Hirose
    • Journal Title

      Proc. EuroSpeech

      Pages: 165-168

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Japanese vowel recognition using external structure of speech2005

    • Author(s)
      T. Murakami, K. Maruyama, N. Minematsu, K. Hirose
    • Journal Title

      Proc. IEEE Automatic Speech Recognition and Understanding Workshop

      Pages: 203-208

    • Description
      「研究成果報告書概要(欧文)」より
  • [Presentation] 大規模英語学習者を対象とした音声の構造的表象に基づく発音評価とその応用2008

    • Author(s)
      高澤真章, 鎌田圭, 竹内京子, 朝川智, 峯松信明, 牧野武彦, 広瀬啓吉
    • Organizer
      日本音響学会春季全国大会
    • Place of Presentation
      千葉工大
    • Year and Date
      20080300
    • Description
      「研究成果報告書概要(和文)」より
  • [Presentation] 音声の構造的表象に基づく英語発音分析結果の視覚化に対する一考察2008

    • Author(s)
      鎌田圭, 高澤真章, 朝川智, 峯松信明, 牧野武彦, 広瀬啓吉
    • Organizer
      日本音響学会春季全国大会
    • Place of Presentation
      千葉工大
    • Year and Date
      20080300
    • Description
      「研究成果報告書概要(和文)」より
  • [Presentation] 大規模英語学習者を対象とした音声の構造的表象に基づく発音分類とその応用2008

    • Author(s)
      鎌田圭, 高澤真章, 竹内京子, 朝川智, 峯松信明, 牧野武彦, 広瀬啓吉
    • Organizer
      情報処理学会全国大会
    • Place of Presentation
      筑波大学
    • Year and Date
      20080300
    • Description
      「研究成果報告書概要(和文)」より
  • [Presentation] 発音の構造的表象に基づく母音矯正度推定の高精度化2007

    • Author(s)
      鎌田圭, 朝川智, 峯松信明, 牧野武彦, 広瀬啓吉
    • Organizer
      日本音響学会秋季全国大会
    • Place of Presentation
      山梨大学
    • Year and Date
      20070900
    • Description
      「研究成果報告書概要(和文)」より
  • [Presentation] 音声の構造的表象を用いた音声認識における特徴量空間分割とその効果2007

    • Author(s)
      朝川智, 峯松信明, 広瀬啓吉
    • Organizer
      日本音響学会秋季全国大会
    • Place of Presentation
      山梨大学
    • Year and Date
      20070900
    • Description
      「研究成果報告書概要(和文)」より
  • [Presentation] Structural representation of the pronunciation and its application to computer-aided language learning2006

    • Author(s)
      N. Minematsu
    • Organizer
      ASA & ASJ Joint meeting
    • Place of Presentation
      Honolulu
    • Year and Date
      20061200
    • Description
      「研究成果報告書概要(欧文)」より
  • [Presentation] Universal and invariant representation of speech2006

    • Author(s)
      N. Minematsu, T. Nishimura
    • Organizer
      Int. Conf. on Infant Study
    • Place of Presentation
      Kyoto
    • Year and Date
      20060600
    • Description
      「研究成果報告書概要(欧文)」より
  • [Presentation] Structural representation of individual learners2005

    • Author(s)
      N. Minematsu
    • Organizer
      ASSTA Research Workshop of Assessing Spoken Language Proficiency
    • Place of Presentation
      Australia
    • Year and Date
      20050800
    • Description
      「研究成果報告書概要(欧文)」より
  • [Book] Included in“New Frontiers in Artificial Intelligence", LNAI49142008

    • Author(s)
      N. Minematsu, T. Nishimura
    • Total Pages
      14
    • Publisher
      Consideration of infants' vocal imitation through modeling speech as timbre-based melody
    • Description
      「研究成果報告書概要(和文)」より

URL: 

Published: 2010-06-09  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi