Project/Area Number |
25330210
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Multi-year Fund |
Section | 一般 |
Research Field |
Perceptual information processing
|
Research Institution | Shizuoka University (2015) Nagoya University (2014) Doshisha University (2013) |
Principal Investigator |
|
Co-Investigator(Kenkyū-buntansha) |
YAMAMOTO SEIICHI 同志社大学, 理工学部, 教授 (20374100)
|
Project Period (FY) |
2013-04-01 – 2016-03-31
|
Project Status |
Completed (Fiscal Year 2015)
|
Budget Amount *help |
¥2,860,000 (Direct Cost: ¥2,200,000、Indirect Cost: ¥660,000)
Fiscal Year 2015: ¥780,000 (Direct Cost: ¥600,000、Indirect Cost: ¥180,000)
Fiscal Year 2014: ¥1,040,000 (Direct Cost: ¥800,000、Indirect Cost: ¥240,000)
Fiscal Year 2013: ¥1,040,000 (Direct Cost: ¥800,000、Indirect Cost: ¥240,000)
|
Keywords | 多人数会話 / 話者ダイアライゼーション / 発話形式 / 音韻性 / 話者性 / 話者内分散 / 話者間分散 / 話者クラスタリング / 音韻性と話者性 / 主成分分析 / GMM / 講演音声 / 話者認識 / 話題内容 / 発話動作 |
Outline of Final Research Achievements |
We proposed a speaker clustering method using Gaussian mixture model in flexibly selected speaker subspace based on variance of intra-utterance in order to realize a robust speaker clustering to various speaking style. We carried out speaker clustering experiments compared with conventional methods based on Bayesian information criterion and Gaussian mixture model in an observation space. The experimental results showed that the proposed method can achieve higher clustering accuracy than conventional methods.
|