音声の構造的表象に基づく幼児の単語獲得過程の構成論的シミュレーション

Publicly Offered Research

Project Area	Elucidation of neural computation for prediction and decision making: toward better human understanding and applications
Project/Area Number	24120507
Research Category	Grant-in-Aid for Scientific Research on Innovative Areas (Research in a proposed research area)
Allocation Type	Single-year Grants
Review Section	Complex systems
Research Institution	The University of Tokyo
Principal Investigator	峯松信明東京大学, 工学(系)研究科(研究院), 教授 (90273333)
Project Period (FY)	2012-04-01 – 2015-03-31
Project Status	Completed (Fiscal Year 2013)
Budget Amount *help	¥10,920,000 (Direct Cost: ¥8,400,000、Indirect Cost: ¥2,520,000) Fiscal Year 2013: ¥5,460,000 (Direct Cost: ¥4,200,000、Indirect Cost: ¥1,260,000) Fiscal Year 2012: ¥5,460,000 (Direct Cost: ¥4,200,000、Indirect Cost: ¥1,260,000)
Keywords	音声の構造的表象 / f-divergence / ゲシュタルト知覚 / 言語獲得モデル / シミュレーション / 言語リズム / 単語認識実験
Outline of Annual Research Achievements	本研究は，音声の構造的表象を用いて幼児の言語獲得，特に単語獲得プロセスを構成論的にシミュレーションすることを目的としている。幼児の言語獲得は他者の発声を模倣することが基本であると考えられるが，声帯模写のような音真似をする訳ではない。音声には話者情報，年齢情報など様々な情報が含まれるが，それらを無視し，音声の言語情報，言語メッセージだけを真似る。この場合「発声を音素列のようなものとして表象し，個々の音素を再度発声する」というという仮説も可能であるが，音素意識が未熟であるため不適切な仮説である。音声の構造的表象は，発声の全体像を，話者非依存に表象できる興味深い特徴量抽出法である。音声には体格や年齢に起因する声色のバイアスが必ず存在するが，そのバイアスを捨象し，残された言語の骨格とも言える部分が構造的表象となる。従来より，構造表象を用いた単語認識系は構築されていたが，本研究ではこれを幼児の単語獲得プロセスの技術的一実装であると位置づける。そして，幼児は言語リズムに敏感であるとの知見を反映し，構造表象に基づく入力音声と単語モデル照合処理において，sonority （聞こえ度）最大区間，即ちシラブル中心を検出し，それに基づくシラブル内照合とシラブル間照合という処理を導入し，音声構造に基づく単語認識系の精度向上を実現した。
Research Progress Status	26年度が最終年度であるため、記入しない。
Strategy for Future Research Activity	26年度が最終年度であるため、記入しない。

Report

(2 results)

2013 Annual Research Report
2012 Annual Research Report

Research Products
(12 results)

All 2014 2013 2012

All Journal Article (5 results) (of which Peer Reviewed: 5 results) Presentation (6 results) (of which Invited: 2 results) Book (1 results)

[Journal Article] Unsupervised optimal phoneme segmentation: theory and experimental evaluation2013
- Author(s)
  Y. Qiao, D. Luo, N. Minematsu
- Journal Title
  
  IEEE Trans. Systems, Man & Cybernetics
  
  Volume: 7 Pages: 577-586
- Related Report
  2013 Annual Research Report
- Peer Reviewed
[Journal Article] Automatic pronunciation clustering using a world English archive and pronunciation structure analysis2013
- Author(s)
  H.-P. Shen, N. Minematsu, T. Makino, S. H. Weinberger, T. Pongkittiphan, C.-H. Wu,
- Journal Title
  
  Proc. ASRU
  
  Volume: 1 Pages: 222-227
- Related Report
  2013 Annual Research Report
- Peer Reviewed
[Journal Article] Speaker-invariant and rhythm-sensitive representation of spoken words2013
- Author(s)
  N. Minematsu, Y. Ozaki, K. Hirose, D. Erickson
- Journal Title
  
  Proc. APSIPA
  
  Volume: 1
- Related Report
  2013 Annual Research Report
- Peer Reviewed
[Journal Article] An experimental study on dynamic features of speech structure2012
- Author(s)
  S. Shimizu, M. Suzuki, N. Minematsu, and K. Hirose
- Journal Title
  
  Journal of Research Institute of Signal Processing
  
  Volume: 16 Pages: 319-322
- NAID
  130004457023
- Related Report
  2012 Annual Research Report
- Peer Reviewed
[Journal Article] Discriminative reranking for LVCSR leveraging invariant structure2012
- Author(s)
  M. Suzuki, G. Kurata, M. Nishimura, N. Minematsu,
- Journal Title
  
  Proc. INTERSPEECH
  
  Volume: 1
- Related Report
  2012 Annual Research Report
- Peer Reviewed
[Presentation] 音声の構造的表象による頑健な教師無し語彙獲得システム2014
- Author(s)
  尾崎洋輔，齋藤大輔，峯松信明，広瀬啓吉
- Organizer
  日本音響学会春季講演論文集
- Place of Presentation
  日本大学（東京）
- Year and Date
  2014-03-10 – 2014-03-12
- Related Report
  2013 Annual Research Report
[Presentation] 音声信号における特徴量分離と情報分離2013
- Author(s)
  峯松信明
- Organizer
  情報処理学会音楽情報処理研究会
- Place of Presentation
  お茶の水女子大学（東京）
- Year and Date
  2013-05-11 – 2013-05-12
- Related Report
  2013 Annual Research Report
- Invited
[Presentation] 波形包絡を用いた音節核の自動抽出とそれを用いた構造的表象による単語獲得プロセスのモデル化の初期検討2012
- Author(s)
  尾崎洋輔, 峯松信明, 広瀬啓吉, Donna Erickson,
- Organizer
  電子情報通信学会音声研究会
- Place of Presentation
  東京工業大学（東京都目黒区）
- Year and Date
  2012-12-20
- Related Report
  2012 Annual Research Report
[Presentation] 波形包絡を用いた音節核の自動抽出とそれを用いた構造的表象による単語獲得プロセスのモデル化2012
- Author(s)
  尾崎洋輔，峯松信明，広瀬啓吉，エリクソン・ドナ，
- Organizer
  日本音響学会秋季研究発表会
- Place of Presentation
  信州大学（長野県松本市）
- Year and Date
  2012-09-19
- Related Report
  2012 Annual Research Report
[Presentation] 音声の構造的表象を用いた大語彙音声認識の識別的リランキング，2012
- Author(s)
  鈴木雅之，倉田岳人，西村雅史，峯松信明，広瀬啓吉，
- Organizer
  日本音響学会秋季研究発表会
- Place of Presentation
  信州大学（長野県松本市）
- Year and Date
  2012-09-19
- Related Report
  2012 Annual Research Report
[Presentation] こどばって一体何だろう？2012
- Author(s)
  峯松信明
- Organizer
  日私小連全国教育夏季研究会外国語部会
- Place of Presentation
  アルカディア市ヶ谷（東京都千代田区）
- Year and Date
  2012-08-21
- Related Report
  2012 Annual Research Report
- Invited
[Book] 音声言語処理と自然言語処理2013
- Author(s)
  中川聖一，小林聡，峯松信明，宇津呂武仁，秋葉友良，北岡教英，山本幹雄，甲斐充彦，山本一公，土屋雅稔
- Total Pages
  252
- Publisher
  コロナ社
- Related Report
  2012 Annual Research Report

音声の構造的表象に基づく幼児の単語獲得過程の構成論的シミュレーション

Principal Investigator

峯松 信明 東京大学, 工学(系)研究科(研究院), 教授 (90273333)

¥10,920,000 (Direct Cost: ¥8,400,000、Indirect Cost: ¥2,520,000)

Report

Research Products

[Journal Article] Unsupervised optimal phoneme segmentation: theory and experimental evaluation2013

Author(s)

Journal Title

Related Report

[Journal Article] Automatic pronunciation clustering using a world English archive and pronunciation structure analysis2013

Author(s)

Journal Title

Related Report

[Journal Article] Speaker-invariant and rhythm-sensitive representation of spoken words2013

Author(s)

Journal Title

Related Report

[Journal Article] An experimental study on dynamic features of speech structure2012

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Discriminative reranking for LVCSR leveraging invariant structure2012

Author(s)

Journal Title

Related Report

[Presentation] 音声の構造的表象による頑健な教師無し語彙獲得システム2014

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 音声信号における特徴量分離と情報分離2013

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 波形包絡を用いた音節核の自動抽出とそれを用いた構造的表象による単語獲得プロセスのモデル化の初期検討2012

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 波形包絡を用いた音節核の自動抽出とそれを用いた構造的表象による単語獲得プロセスのモデル化2012

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 音声の構造的表象を用いた大語彙音声認識の識別的リランキング，2012

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] こどばって一体何だろう？2012

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Book] 音声言語処理と自然言語処理2013

Author(s)

Total Pages

Publisher

Related Report

峯松信明東京大学, 工学(系)研究科(研究院), 教授 (90273333)