2021 年度実績報告書

Construction of a computational model to deal with the cocktail-party problem for intelligent speech interface

研究課題

研究課題/領域番号	19K12035
研究機関	国立研究開発法人情報通信研究機構
研究代表者	LU Xugang 国立研究開発法人情報通信研究機構, ユニバーサルコミュニケーション研究所先進的音声翻訳研究開発推進センター, 主任研究員 (20362022)
研究期間 (年度)	2019-04-01 – 2022-03-31
キーワード	Generative model / Discriminative model / Model coupling
研究実績の概要	In cocktail party scenarios, many information need to be explored in order to identify different speech (or sound) sources, in particular, who is speaking (speaker information) is one of the most important information for identifying speech sources. In order to combine advantages of both discriminative and generative classifier models for speakers, we proposed to couple a generative model in a discriminative learning for speaker recognition. Our framework showed a large improvement compared with state of the art models.