2009 Fiscal Year Final Research Report
Audio-visual speech corpus for evaluating speech recognition performance in noisy environments
Project/Area Number |
19700163
|
Research Category |
Grant-in-Aid for Young Scientists (B)
|
Allocation Type | Single-year Grants |
Research Field |
Perception information processing/Intelligent robotics
|
Research Institution | Nagoya University |
Principal Investigator |
MIYAJIMA Chiyomi Nagoya University, 大学院・情報科学研究科, 助教 (90335092)
|
Project Period (FY) |
2007 – 2009
|
Keywords | バイモーダル音声認識 / データベース / 雑音環境 / 車内雑音 |
Research Abstract |
Audio-visual speech data are collected in a silent room and a vehicle for developing an audio-visual speech corpus which is used for evaluating speech recognition performance in noisy environments, especially in in-car environments. Acoustic noise and gamma values of images are used for simulating in-car environments over the recorded data in the silent room. Baseline audio and visual features and an integration method are calibrated in some experimental evaluations. The corpus will be open to the public along with database manuals for research purposes.
|
-
[Journal Article] CENSREC-1-C: An evaluation framework for voice activity detection under noisy environments2009
Author(s)
N. Kitaoka, T. Yamada, S. Tsuge, C. Miyajima, K. Yamamoto, T. Nishiura, M. Nakayama, Y. Denda, M. Fujimoto, T. Takiguchi, S. Tamura, S. Matsuda, T. Ogawa, S. Kuroiwa, K. Takeda, S. Nakamura
-
Journal Title
Acoustical Science and Technology
Pages: 363-371
Peer Reviewed
-
-
-
[Presentation] CENSREC- 1-AV:マルチモーダル音声認識コーパスの構築2010
Author(s)
田村哲嗣, 宮島千代美, 北岡教英, 武田一哉, 山田武志, 滝口哲也, 柘植覚, 山本一公, 西浦敬信, 中山雅人, 傳田遊亀, 藤本雅清, 松田繁樹, 小川哲司, 黒岩眞吾, 中村哲
Organizer
2010年日本音響学会春季研究発表会
Place of Presentation
調布市
Year and Date
20100300
-
-
-
-
[Presentation] CENSREC-4: Development of evaluation framework for distant- talking speech recognition under reverberant environments2008
Author(s)
M. Nakayama, T. Nishiura, Y. Denda, N. Kitaoka, K. Yamamoto, T. Yamada, S. Tsuge, C. Miyajima, M. Fujimoto, T. Takiguchi, S. Tamura, T. Ogawa, S. Matsuda, S. Kuroiwa, K. Takeda, S. Nakamura
Organizer
2008 International Conference on Spoken Language Processing
Place of Presentation
オーストラリア
Year and Date
20080900
-
-
-
[Presentation] CENSREC- 4: Development of evaluation framework for distant-talking speech recognition under reverberant environments2008
Author(s)
T. Nishiura, M. Nakayama, Y. Denda, N. Kitaoka, K. Yamamoto, T. Yamada, S. Tsuge, C. Miyajima, M. Fujimoto, T. Takiguchi, S. Tamura, S. Kuroiwa, K. Takeda, S. Nakamura
Organizer
2008 Language Resources and Evaluation Conference
Place of Presentation
モロッコ
Year and Date
20080500
-
[Presentation] Development of VAD evaluation framework CENSREC- 1-C and investigation of relationship between VAD and speech recognition performance2007
Author(s)
N. Kitaoka, K. Yamamoto, T. Kusamizu, S. Nakagawa, T. Yamada, S. Tsuge, C. Miyajima, T. Nishiura, M. Nakayama, Y. Denda, M. Fujimoto, T. Takiguchi, S. Tamura, S. Kuroiwa, K. Takeda, S. Nakamura
Organizer
2007 IEEE workshop on Automatic Speech Recognition and Understanding
Place of Presentation
京都市
Year and Date
20071200
-
-