2007 Fiscal Year Annual Research Report

構造不変の定理に基づく音声アフォーダンスの提案とそれに立脚した音声認識系の構築

Research Project

Project/Area Number	19024023
Research Institution	The University of Tokyo
Principal Investigator	峯松信明 The University of Tokyo, 大学院・新領域創成科学研究科, 准教授 (90273333)
Keywords	構造不変の定理 / 音声の構造的表象 / 分布間距離 / 次元分割 / 識別学習 / 音声認識 / 発音教育支援
Research Abstract	音声には年齢や性別などの非言語的な要因によって不可避的に歪みが混入する。その結果,大人と子供では音声の物理特性は大きく異なる。我々は物理的に異なる音ストリームに例えば「おはよう」という同一の情報を感覚する。従来の音声認識系では,これらの異なる音ストリームに内在する不変な情報を直接的に表象することは行なわず,ただただ音を集めて統計的にモデル化する試みが行なわれて来た。本研究では,構造不変の定理と呼ぶ数学的定理に基づく,話者やマイクに不変な音声表象を提案しており,これに基づいた音声認識系及びそのアプリケーションを構築することが目的である。本年は,パラメータの次元分割や,識別学習に基づく効率的な次元圧縮などについて検討し,また,アプリケーションという側面では発音学習応用において非常に実益性の高い成果を挙げることができた。パラメータ次元分割は,構造不変の定理が呈する「強すぎる不変性」問題を解くために導入された技術であり,制約付きでこの定理を利用する一手段を提供している。また,識別学習による次元削減はパラメータ数の増加に伴う「次元の呪い」問題を解くために導入された技術である。発音教育アプリケーションでは,学習者と教師の間の性別や年齢といった違いを回避することが可能となるため,教師を選ぶ発音教育支援,発音の様態のみに基づく(話者/性別に影響されない)学習者分類,更には,学習者の意欲向上に効果的なインタフェースの構築などを行なった。

Research Products
(15 results)

All 2008 2007 Other

All Journal Article (2 results) (of which Peer Reviewed: 2 results) Presentation (10 results) Book (1 results) Remarks (2 results)

[Journal Article] 音声の構造的表象に基づく日本語孤立母音系列を対象とした音声認識2008
- Author(s)
  村上隆夫, 峯松信明, 広瀬啓吉
- Journal Title
  
  電子情報通信学会論文誌 J91-A-2
  
  Pages: 181-192
- Peer Reviewed
[Journal Article] 音声の構造的表象に基づく英語学習者発音の音響分析2007
- Author(s)
  朝川智, 峯松信明, 広瀬啓吉
- Journal Title
  
  電子情報通信学会論文誌 J90-D-5
  
  Pages: 1249-1262
- Peer Reviewed
[Presentation] Random discriminant structure analysis for continuous Japanese vowel recognition2007
- Author(s)
  Y. Qiao, et. al.
- Organizer
  Proc. ASRU
- Place of Presentation
  Kyoto, Japan
- Year and Date
  20071200
[Presentation] ケプストラムの声道長依存性に関する幾何学的考察2007
- Author(s)
  齋藤大輔, 他
- Organizer
  電子情報通信学会音声研究会
- Place of Presentation
  日本
- Year and Date
  20071200
[Presentation] 構造的表象からの音声生成に関する基礎的研究2007
- Author(s)
  齋藤大輔, 他
- Organizer
  電子情報通信学会音声研究会
- Place of Presentation
  日本
- Year and Date
  20071000
[Presentation] 孤立音を聞いて音韻同定できる能力は音声言語運用に必要か?2007
- Author(s)
  峯松信明, 他
- Organizer
  日本音声学会全国大会
- Place of Presentation
  日本
- Year and Date
  20070900
[Presentation] Structural assessment of language learners' pronunciation2007
- Author(s)
  N. Minematsu, et. al.
- Organizer
  Proc. INTERSPEECH
- Place of Presentation
  Antwerp, Belguim
- Year and Date
  20070800
[Presentation] Automatic recognition of connected vowels only using speaker-invariant representation of speech dynamics2007
- Author(s)
  S. Asakawa, et. al.
- Organizer
  Proc. INTERSPEECH
- Place of Presentation
  Antwerp, Belguim
- Year and Date
  20070800
[Presentation] 要素論から全体論へ〜全体から入る音声情報処理への招待〜2007
- Author(s)
  峯松信明, 他
- Organizer
  情報処理学会音声言語情報処理研究会
- Place of Presentation
  日本
- Year and Date
  20070700
[Presentation] 孤立音[あ]を聞いて/あ/と同定する能力は音声言語に必要か?2007
- Author(s)
  峯松信明, 他
- Organizer
  電子情報通信学会音声研究会
- Place of Presentation
  日本
- Year and Date
  20070700
[Presentation] 学習者による米語母音発音に対する絶対的評価と相対的評価の相関分析2007
- Author(s)
  鎌田圭, 他
- Organizer
  電子情報通信学会音声研究会
- Place of Presentation
  日本
- Year and Date
  20070700
[Presentation] Development of a spoken word recognizer without phonemic awareness - Is this machine a Dyslexia simulator?-2007
- Author(s)
  N. Minematsu, et. al.
- Organizer
  Proc. The 2nd Riken Brain Science Institute and Oxford-Kobe Joint International Symposium
- Place of Presentation
  Kobe, Japan
- Year and Date
  20070400
[Book] Lecture notes of Artificial Intelligence (4914)2008
- Author(s)
  N. Minematsu and T. Nishimura
- Total Pages
  26-39
- Publisher
  Consideration of infants' vocal imitation through modeling speech as timbre-based melody
[Remarks]
- URL
  http://www.gavo.t.u-tokyo.ac.jp/~mine/japanese/paper/2007.html
[Remarks]
- URL
  http://www.gavo.t.u-tokyo.ac.jp/~mine/japanese/paper/2008.html

2007 Fiscal Year Annual Research Report

構造不変の定理に基づく音声アフォーダンスの提案とそれに立脚した音声認識系の構築

Principal Investigator

峯松 信明 The University of Tokyo, 大学院・新領域創成科学研究科, 准教授 (90273333)

Research Products

[Journal Article] 音声の構造的表象に基づく日本語孤立母音系列を対象とした音声認識2008

Author(s)

Journal Title

[Journal Article] 音声の構造的表象に基づく英語学習者発音の音響分析2007

Author(s)

Journal Title

[Presentation] Random discriminant structure analysis for continuous Japanese vowel recognition2007

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] ケプストラムの声道長依存性に関する幾何学的考察2007

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 構造的表象からの音声生成に関する基礎的研究2007

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 孤立音を聞いて音韻同定できる能力は音声言語運用に必要か?2007

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Structural assessment of language learners' pronunciation2007

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Automatic recognition of connected vowels only using speaker-invariant representation of speech dynamics2007

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 要素論から全体論へ 〜全体から入る音声情報処理への招待〜2007

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 孤立音[あ]を聞いて/あ/と同定する能力は音声言語に必要か?2007

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 学習者による米語母音発音に対する絶対的評価と相対的評価の相関分析2007

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Development of a spoken word recognizer without phonemic awareness - Is this machine a Dyslexia simulator?-2007

Author(s)

Organizer

Place of Presentation

Year and Date

[Book] Lecture notes of Artificial Intelligence (4914)2008

Author(s)

Total Pages

Publisher

[Remarks]

URL

[Remarks]

URL

峯松信明 The University of Tokyo, 大学院・新領域創成科学研究科, 准教授 (90273333)

[Presentation] 要素論から全体論へ〜全体から入る音声情報処理への招待〜2007