2011 Fiscal Year Final Research Report

A perceptual model of speech based on real-time speaker adaptation

Research Project

Project/Area Number	21700282
Research Category	Grant-in-Aid for Young Scientists (B)
Allocation Type	Single-year Grants
Research Field	Cognitive science
Research Institution	Tohoku Institute of Technology (2010-2011) Tohoku University (2009)
Principal Investigator	ITO Masashi 東北工業大学, 知能エレクトロニクス学科, 講師 (00436164)
Project Period (FY)	2009 – 2011
Keywords	音声 / 話者適応 / 認知モデル / 知覚実験
Research Abstract	Perceptual experiments indicated that speakers of different vowels could be correctly identified with accuracy of more than 80%. Analyzing speech signals uttered by 632 speakers, a new analysis method was proposed on the basis of the sinusoidal representation of speech signal. Further, cosine expansion of speech spectra and the quadratic combination of their coefficients were shown to be effective features for vowel perception. The result supports the hypothesis that perceptual features for vowel might be extracted by two-step synaptic combination in auditory periphery.

Research Products
(10 results)

All 2012 2011 2010 2009

All Journal Article (3 results) Presentation (7 results)

[Journal Article] 局所変化率変換と時間軸変換に基づく有声音声の正弦波モデル2010
- Author(s)
  伊藤仁, 伊藤彰則
- Journal Title
  
  電子情報通信学会論文誌
  
  Volume: Vol.J93-D(9) Pages: 1745-1754
[Journal Article] A source-filter separation for non-stationary voiced speech based on sinusoidal representation2009
- Author(s)
  Ito, M., Ohara, K., Ito, A., and Yano, M.
- Journal Title
  
  Acoustical Science and Technology
  
  Volume: Vol.31(2) Pages: 181-184
[Journal Article] 局所変化率変換に基づく有声音声の正弦波モデル2009
- Author(s)
  伊藤仁, 伊藤彰則
- Journal Title
  
  第8回情報科学技術フォーラム講演論文集
  
  Volume: Vol2 Pages: 43-48
[Presentation] ケプストラム係数を用いた母音のフォルマント分析2012
- Author(s)
  伊藤仁, 蒔苗久則
- Organizer
  日本音響学会2012年春季研究発表会講演論文集
- Year and Date
  20120000
[Presentation] 話者認識における母音の音韻性の影響2011
- Author(s)
  岩佐尚輝, 亀井大陸, 伊藤仁
- Organizer
  平成23年東北地区若手研究者研究発表会
- Year and Date
  20110000
[Presentation] フォルマントとスペクトル全体形状を統合した母音知覚モデルの検討2010
- Author(s)
  伊藤仁, 小原桂二, 伊藤彰則, 矢野雅文
- Organizer
  日本音響学会2010年春季研究発表会講演論文集
- Year and Date
  20100000
[Presentation] フォルマントピークとスペクトル傾きが母音知覚に及ぼす影響2010
- Author(s)
  小原桂二, 伊藤仁, 矢野雅文
- Organizer
  日本音響学2010年春季研究発表会講演論文集
- Year and Date
  20100000
[Presentation] An effect of formant amplitude in vowel perception2010
- Author(s)
  Ito, M., Ohara, K., Ito, A., and Yano, M.
- Organizer
  Interspeech 2010
- Place of Presentation
  Makuhari
- Year and Date
  20100000
[Presentation] Relative importance of formant and whole-spectral cues for vowel perception2009
- Author(s)
  Ito, M., Ohara, K., Ito, A. and Yano, M.
- Organizer
  Interspeech 2009
- Place of Presentation
  Brighton
- Year and Date
  20090000
[Presentation] スペクトル全体形状モデルに基づく連続母音の音響特性2009
- Author(s)
  伊藤仁, 伊藤彰則, 矢野雅文
- Organizer
  日本音響学会2009年春季研究発表会講演論文集
- Year and Date
  20090000

2011 Fiscal Year Final Research Report

A perceptual model of speech based on real-time speaker adaptation

Principal Investigator

ITO Masashi 東北工業大学, 知能エレクトロニクス学科, 講師 (00436164)

Research Products

[Journal Article] 局所変化率変換と時間軸変換に基づく有声音声の正弦波モデル2010

Author(s)

Journal Title

[Journal Article] A source-filter separation for non-stationary voiced speech based on sinusoidal representation2009

Author(s)

Journal Title

[Journal Article] 局所変化率変換に基づく有声音声の正弦波モデル2009

Author(s)

Journal Title

[Presentation] ケプストラム係数を用いた母音のフォルマント分析2012

Author(s)

Organizer

Year and Date

[Presentation] 話者認識における母音の音韻性の影響2011

Author(s)

Organizer

Year and Date

[Presentation] フォルマントとスペクトル全体形状を統合した母音知覚モデルの検討2010

Author(s)

Organizer

Year and Date

[Presentation] フォルマントピークとスペクトル傾きが母音知覚に及ぼす影響2010

Author(s)

Organizer

Year and Date

[Presentation] An effect of formant amplitude in vowel perception2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Relative importance of formant and whole-spectral cues for vowel perception2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] スペクトル全体形状モデルに基づく連続母音の音響特性2009

Author(s)

Organizer

Year and Date