2007 Fiscal Year Final Research Report Summary

Support of Foreign Language Pronunciation Training of Primary School Students Based on Comprehensive Description of the Pronunciation

Research Project

Project/Area Number	17300261
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Single-year Grants
Section	一般
Research Field	Educational technology
Research Institution	The University of Tokyo
Principal Investigator	MINEMATSU Nobuaki The University of Tokyo, Graduate School of Frontier Sciences, Associate Professor (90273333)
Co-Investigator(Kenkyū-buntansha)	HIROSE Keikichi The University of Tokyo, Graduate School of Information Science and Technology, Professor (50111472) HARADA Yasunari Waseda University, School of Law, Professor (80189711) YAMAUCHI Yutaka Tokyo International University, School of Business and Commerce, Associate Professor (30306245) KOCHIYAMA Akiko Chubu University, Department of Humanity and Social Sciences, Associate Professor (80350990) MAKINO Takehiko Chuo University, Faculty of Economics, Associate Professor (00269482)
Project Period (FY)	2005 – 2007
Keywords	Foreign Language Learning / Structural Representation of Speech / Structural Phonology / Pronunciation Portfolio / CALL system / Speech recognition
Research Abstract	The Ministry of Education, Culture, Sports, Science and Technology in Japan announced that English education will be introduced into primary schools in 2011. This means that the number of Japanese students of English will be drastically increased in 2011 but it is a fact that the number of English teachers is not sufficient at all for those students. In this study, to solve this problem a new technique was built for supporting young students' learning English and assessing their pronunciation. Children's voices are very difficult to process adequately with the current speech technology. For example, if an assessment system is built with adult speech samples, the system cannot deal with children's voices adequately due to a large acoustic difference between adults' voices and children's voices. If a large number of samples of children voices are available, a good system is possible but the recording with young children is a very heavy task Speaker adaptation technology can be used to ad … More apt a system for adults into a system for a specific young child (student). But in this case, bad pronunciation may be judged as good because of over-adaptation to the specific young student. To solve these problems completely, we proposed a new speech technique of representing an utterance through removing the acoustic features showing the age and gender of the speaker. In the proposed method, only the timbre contrasts were extracted from speech events, where a contrast was measured as Bhattacharyya distance because the distance is completely invariant with any kind of linear or non-linear transformation. Speaker differences can be described as acoustic transformation of voices and then, the proposed representation is speaker-invariant. Using this new structural representation, a system of assessing English vowels produced by students of any age was built. The system has four functions. 1) recording or logging vowel system changes of individual students, caused by training. 2) classification of learners purely based on pronunciation variation, irrespective of age and gender, 3) generation of instructions on which vowels to correct at first, and 4) very motivating user-interface for pronunciation training. During a three-year period, a proto-type system was tested and evaluated in many locations, such as high schools, junior high schools and primary schools. Then, over 500 students aging from 3 to 70 joined our pronunciation test. The analysis results showed the very high validity of the proposed method and system. Further, using the data, we classified over-500 students based on their pronunciations, irrespective of age and gender, and we defined 5 typical Japanese pronunciations of English. Less

Research Products
(30 results)

All 2008 2007 2006 2005

All Journal Article (21 results) (of which Peer Reviewed: 6 results) Presentation (8 results) Book (1 results)

[Journal Article] 音声の構造的表象に基づく日本語孤立母音系列を対象とした音声認識2008
- Author(s)
  村上隆夫, 峯松信明, 広瀬啓吉
- Journal Title
  
  電子情報通信学会論文誌 Vol. J91-A, No. 2
  
  Pages: 181-191
- Description
  「研究成果報告書概要(和文)」より
- Peer Reviewed
[Journal Article] Recognition of isolated utterances of Japanese vowel sequences based on structural representation of speech2008
- Author(s)
  T. Murakami, K. Maruyama, N. Minematsu, K. Hirose
- Journal Title
  
  IEICE Trans Vol. J91-D, No. 2
  
  Pages: 181-191
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Consideration of infants' vocal imitation through modeling speech as timbre-based melody2008
- Author(s)
  N. Minematsu, T. Nishimura
- Journal Title
  
  Included in "New frontiers in Artificial Intelligence" LNAI4914
  
  Pages: 26-39
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] 音声の構造的表象に基づく英語学習者発音の音響的分析2007
- Author(s)
  朝川智, 峯松信明, 広瀬啓吉
- Journal Title
  
  電子情報通信学会論文誌 Vol. J90-D, No. 5
  
  Pages: 1249-1262
- Description
  「研究成果報告書概要(和文)」より
- Peer Reviewed
[Journal Article] Are learners myna birds to the averaged distributions of native speakers? -a note of warning from a serious speech engineer-2007
- Author(s)
  N. Minematsu
- Journal Title
  
  Proc. ISCA Workshop on Speech and Language Technology in Education (CD-ROM)
- Description
  「研究成果報告書概要(和文)」より
- Peer Reviewed
[Journal Article] Structural representation of the pronunciation and its use for classifying Japanese learners of English2007
- Author(s)
  N. Minematsu, K. Kamata, S. Asakawa, T. Makino, K. Hirose
- Journal Title
  
  Proc. ISCA Workshop on Speech and Language Technology in Education (CD-ROM)
- Description
  「研究成果報告書概要(和文)」より
- Peer Reviewed
[Journal Article] Structural assessment of language learners' pronunciation2007
- Author(s)
  N. Minematsu, K. Kamata, S. Asakawa, T. Makino, T. Nishimura, K. Hiorse
- Journal Title
  
  Proc. InterSpeech
  
  Pages: 210-213
- Description
  「研究成果報告書概要(和文)」より
- Peer Reviewed
[Journal Article] Automatic recognition of connected vowels only using speaker-invariant representation of speech dynamics2007
- Author(s)
  S. Asakawa, N. Minematsu, K. Hirose
- Journal Title
  
  Proc. InterSpeech
  
  Pages: 890-893
- Description
  「研究成果報告書概要(和文)」より
- Peer Reviewed
[Journal Article] 音声の構造的表象に基づく発音矯正必要度の計算手法の検討2007
- Author(s)
  鎌田圭, 朝川智, 峯松信明, 牧野武彦, 広瀬啓吉
- Journal Title
  
  信学技報SP2007-30
  
  Pages: 37-42
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] Acoustic analysis of the pronunciation of Japanese learners of English based on structural representation of speech2007
- Author(s)
  S. Asakawa, N. Minematsu, K. Hirose
- Journal Title
  
  IEICT Trans. V01. J90-D, No. 5
  
  Pages: 1249-1262
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Structural assessment of language learners' pronunciation2007
- Author(s)
  N. Minematsu, K. Kamata, S. Asakawa, T. Makino, T. Nishimura, K. Hirose
- Journal Title
  
  Proc. InterSpeech
  
  Pages: 210-213
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Are learners myna birds to the averaged distributions of native speakers? - a note of warning from a serious speech engineer -2007
- Author(s)
  N. Minematsu
- Journal Title
  
  Proc. Int. Workshop on Speech and Lan- guage Technology in Education (CD-ROM)
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Structural representation of the pronunciation and its use for classifying Japanese learners of English2007
- Author(s)
  N. Minematsu, K. Kamata, S. Asakawa, T. Makino, K. Hirose
- Journal Title
  
  Proc. Int. Workshop on Speech and Lan- guage Technology in Education (CD-ROM)
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Theorem of the invariant structure and its derivation of speech Gestalt2006
- Author(s)
  N. Minematsu, T. Nishimura, K. Nishinara, K. Sakuraba
- Journal Title
  
  Proc. Int. Workshop on Speech Recognition and Intrinsic Variations
  
  Pages: 47-52
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Structural representation of the pronunciation and its use for CALL2006
- Author(s)
  N. Minematsu, S. Asakawa, K. Hirose
- Journal Title
  
  Proc. Int. Workshop on Spoken Language Technology
  
  Pages: 126-129
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Speech recognition onb with supra-segmental features - hearing speech as music -2006
- Author(s)
  N. Minematsu, T. Nishimura, T. Murakami, K. Hirose
- Journal Title
  
  Proc. Int. Conf. on Speech Prosody
  
  Pages: 589-594
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Mathematical evidence of the acoustic universal structure2005
- Author(s)
  N. Minematsu
- Journal Title
  
  Proc. ICASSP
  
  Pages: 5734-5737
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Structural representation of the pronunciation and its use in the pronunciation training2005
- Author(s)
  N. Minematsu, S. Asakawa, K. Hirose, T. Makino
- Journal Title
  
  Proc. Workshop on Phonetics Teaching and Learning (CD-ROM)
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Japanese vowel recognition based on structural representation of speech2005
- Author(s)
  T. Murakami, K. Maruyama, N. Minematsu, K. Hirose
- Journal Title
  
  Proc. EuroSpeech
  
  Pages: 1261-1264
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Structural representatior of the non-native pronunciations2005
- Author(s)
  S. Asakawa, N. Minematsu, T. I. Jaakkola, K. Hirose
- Journal Title
  
  Proc. EuroSpeech
  
  Pages: 165-168
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Japanese vowel recognition using external structure of speech2005
- Author(s)
  T. Murakami, K. Maruyama, N. Minematsu, K. Hirose
- Journal Title
  
  Proc. IEEE Automatic Speech Recognition and Understanding Workshop
  
  Pages: 203-208
- Description
  「研究成果報告書概要(欧文)」より
[Presentation] 大規模英語学習者を対象とした音声の構造的表象に基づく発音評価とその応用2008
- Author(s)
  高澤真章, 鎌田圭, 竹内京子, 朝川智, 峯松信明, 牧野武彦, 広瀬啓吉
- Organizer
  日本音響学会春季全国大会
- Place of Presentation
  千葉工大
- Year and Date
  20080300
- Description
  「研究成果報告書概要(和文)」より
[Presentation] 音声の構造的表象に基づく英語発音分析結果の視覚化に対する一考察2008
- Author(s)
  鎌田圭, 高澤真章, 朝川智, 峯松信明, 牧野武彦, 広瀬啓吉
- Organizer
  日本音響学会春季全国大会
- Place of Presentation
  千葉工大
- Year and Date
  20080300
- Description
  「研究成果報告書概要(和文)」より
[Presentation] 大規模英語学習者を対象とした音声の構造的表象に基づく発音分類とその応用2008
- Author(s)
  鎌田圭, 高澤真章, 竹内京子, 朝川智, 峯松信明, 牧野武彦, 広瀬啓吉
- Organizer
  情報処理学会全国大会
- Place of Presentation
  筑波大学
- Year and Date
  20080300
- Description
  「研究成果報告書概要(和文)」より
[Presentation] 発音の構造的表象に基づく母音矯正度推定の高精度化2007
- Author(s)
  鎌田圭, 朝川智, 峯松信明, 牧野武彦, 広瀬啓吉
- Organizer
  日本音響学会秋季全国大会
- Place of Presentation
  山梨大学
- Year and Date
  20070900
- Description
  「研究成果報告書概要(和文)」より
[Presentation] 音声の構造的表象を用いた音声認識における特徴量空間分割とその効果2007
- Author(s)
  朝川智, 峯松信明, 広瀬啓吉
- Organizer
  日本音響学会秋季全国大会
- Place of Presentation
  山梨大学
- Year and Date
  20070900
- Description
  「研究成果報告書概要(和文)」より
[Presentation] Structural representation of the pronunciation and its application to computer-aided language learning2006
- Author(s)
  N. Minematsu
- Organizer
  ASA & ASJ Joint meeting
- Place of Presentation
  Honolulu
- Year and Date
  20061200
- Description
  「研究成果報告書概要(欧文)」より
[Presentation] Universal and invariant representation of speech2006
- Author(s)
  N. Minematsu, T. Nishimura
- Organizer
  Int. Conf. on Infant Study
- Place of Presentation
  Kyoto
- Year and Date
  20060600
- Description
  「研究成果報告書概要(欧文)」より
[Presentation] Structural representation of individual learners2005
- Author(s)
  N. Minematsu
- Organizer
  ASSTA Research Workshop of Assessing Spoken Language Proficiency
- Place of Presentation
  Australia
- Year and Date
  20050800
- Description
  「研究成果報告書概要(欧文)」より
[Book] Included in“New Frontiers in Artificial Intelligence", LNAI49142008
- Author(s)
  N. Minematsu, T. Nishimura
- Total Pages
  14
- Publisher
  Consideration of infants' vocal imitation through modeling speech as timbre-based melody
- Description
  「研究成果報告書概要(和文)」より

2007 Fiscal Year Final Research Report Summary

Support of Foreign Language Pronunciation Training of Primary School Students Based on Comprehensive Description of the Pronunciation

Principal Investigator

MINEMATSU Nobuaki The University of Tokyo, Graduate School of Frontier Sciences, Associate Professor (90273333)

Research Products

[Journal Article] 音声の構造的表象に基づく日本語孤立母音系列を対象とした音声認識2008

Author(s)

Journal Title

Description

[Journal Article] Recognition of isolated utterances of Japanese vowel sequences based on structural representation of speech2008

Author(s)

Journal Title

Description

[Journal Article] Consideration of infants' vocal imitation through modeling speech as timbre-based melody2008

Author(s)

Journal Title

Description

[Journal Article] 音声の構造的表象に基づく英語学習者発音の音響的分析2007

Author(s)

Journal Title

Description

[Journal Article] Are learners myna birds to the averaged distributions of native speakers? -a note of warning from a serious speech engineer-2007

Author(s)

Journal Title

Description

[Journal Article] Structural representation of the pronunciation and its use for classifying Japanese learners of English2007

Author(s)

Journal Title

Description

[Journal Article] Structural assessment of language learners' pronunciation2007

Author(s)

Journal Title

Description

[Journal Article] Automatic recognition of connected vowels only using speaker-invariant representation of speech dynamics2007

Author(s)

Journal Title

Description

[Journal Article] 音声の構造的表象に基づく発音矯正必要度の計算手法の検討2007

Author(s)

Journal Title

Description

[Journal Article] Acoustic analysis of the pronunciation of Japanese learners of English based on structural representation of speech2007

Author(s)

Journal Title

Description

[Journal Article] Structural assessment of language learners' pronunciation2007

Author(s)

Journal Title

Description

[Journal Article] Are learners myna birds to the averaged distributions of native speakers? - a note of warning from a serious speech engineer -2007

Author(s)

Journal Title

Description

[Journal Article] Structural representation of the pronunciation and its use for classifying Japanese learners of English2007

Author(s)

Journal Title

Description

[Journal Article] Theorem of the invariant structure and its derivation of speech Gestalt2006

Author(s)

Journal Title

Description

[Journal Article] Structural representation of the pronunciation and its use for CALL2006

Author(s)

Journal Title

Description

[Journal Article] Speech recognition onb with supra-segmental features - hearing speech as music -2006

Author(s)

Journal Title

Description

[Journal Article] Mathematical evidence of the acoustic universal structure2005

Author(s)

Journal Title

Description

[Journal Article] Structural representation of the pronunciation and its use in the pronunciation training2005

Author(s)

Journal Title

Description

[Journal Article] Japanese vowel recognition based on structural representation of speech2005

Author(s)

Journal Title