A perceptual model of speech based on real-time speaker adaptation
Project/Area Number |
21700282
|
Research Category |
Grant-in-Aid for Young Scientists (B)
|
Allocation Type | Single-year Grants |
Research Field |
Cognitive science
|
Research Institution | Tohoku Institute of Technology (2010-2011) Tohoku University (2009) |
Principal Investigator |
ITO Masashi 東北工業大学, 知能エレクトロニクス学科, 講師 (00436164)
|
Project Period (FY) |
2009 – 2011
|
Project Status |
Completed (Fiscal Year 2011)
|
Budget Amount *help |
¥4,290,000 (Direct Cost: ¥3,300,000、Indirect Cost: ¥990,000)
Fiscal Year 2011: ¥1,040,000 (Direct Cost: ¥800,000、Indirect Cost: ¥240,000)
Fiscal Year 2010: ¥1,950,000 (Direct Cost: ¥1,500,000、Indirect Cost: ¥450,000)
Fiscal Year 2009: ¥1,300,000 (Direct Cost: ¥1,000,000、Indirect Cost: ¥300,000)
|
Keywords | 音声 / 話者適応 / 認知モデル / 知覚実験 / 音声知覚 / フォルマント / 母音 / 正弦波モデル / 音声学 / 認知科学 |
Research Abstract |
Perceptual experiments indicated that speakers of different vowels could be correctly identified with accuracy of more than 80%. Analyzing speech signals uttered by 632 speakers, a new analysis method was proposed on the basis of the sinusoidal representation of speech signal. Further, cosine expansion of speech spectra and the quadratic combination of their coefficients were shown to be effective features for vowel perception. The result supports the hypothesis that perceptual features for vowel might be extracted by two-step synaptic combination in auditory periphery.
|
Report
(4 results)
Research Products
(19 results)