Audio-visual speech corpus for evaluating speech recognition performance in noisy environments
Project/Area Number |
19700163
|
Research Category |
Grant-in-Aid for Young Scientists (B)
|
Allocation Type | Single-year Grants |
Research Field |
Perception information processing/Intelligent robotics
|
Research Institution | Nagoya University |
Principal Investigator |
MIYAJIMA Chiyomi Nagoya University, 大学院・情報科学研究科, 助教 (90335092)
|
Project Period (FY) |
2007 – 2009
|
Project Status |
Completed (Fiscal Year 2009)
|
Budget Amount *help |
¥3,843,644 (Direct Cost: ¥3,325,880、Indirect Cost: ¥517,764)
Fiscal Year 2009: ¥813,644 (Direct Cost: ¥625,880、Indirect Cost: ¥187,764)
Fiscal Year 2008: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
Fiscal Year 2007: ¥1,600,000 (Direct Cost: ¥1,600,000)
|
Keywords | バイモーダル音声認識 / データベース / 雑音環境 / 車内雑音 / 雑音下音声認識 / 音声認識性能評価 / 近赤外映像 / 自動車内雑音 / 主成分分析 / オプティカルフロー |
Research Abstract |
Audio-visual speech data are collected in a silent room and a vehicle for developing an audio-visual speech corpus which is used for evaluating speech recognition performance in noisy environments, especially in in-car environments. Acoustic noise and gamma values of images are used for simulating in-car environments over the recorded data in the silent room. Baseline audio and visual features and an integration method are calibrated in some experimental evaluations. The corpus will be open to the public along with database manuals for research purposes.
|
Report
(4 results)
Research Products
(28 results)
-
[Journal Article] CENSREC-1-C: An evaluation framework for voice activity detection under noisy environments2009
Author(s)
N. Kitaoka, T. Yamada, S. Tsuge, C. Miyajima, K. Yamamoto, T. Nishiura, M. Nakayama, Y. Denda, M. Fujimoto, T. Takiguchi, S. Tamura, S. Matsuda, T. Ogawa, S. Kuroiwa, K. Takeda, S. Nakamura
-
Journal Title
Acoustical Science and Technology
Pages: 363-371
NAID
Related Report
Peer Reviewed
-
[Journal Article] CENSREC-1-C : An evaluation framework for voice activity detection under noisy environments2009
Author(s)
N.Kitaoka, T.Yamada, S.Tsuge, C.Miyajima, K.Yamamoto, T.Nishiura, M.Nakayama, Y.Denda, M.Fujimoto, T.Takiguchi, S.Tamura, S.Matsuda, T.Ogawa, S.Kuroiwa, K.Takeda, S.Nakamura
-
Journal Title
Acoustical Science and Technology 30
Pages: 363-371
NAID
Related Report
Peer Reviewed
-
-
-
-
-
[Presentation] CENSREC-1-AV:マルチモーダル音声認識コーパスの構築2010
Author(s)
田村哲嗣, 宮島千代美, 北岡教英, 武田一哉, 山田武志, 滝口哲也, 柘植覚, 山本一公, 西浦敬信, 中山雅人, 傳田遊亀, 藤本雅清, 松田繁樹小川哲司, 黒岩眞吾, 中村哲
Organizer
2010年日本音響学会春季研究発表会
Place of Presentation
電気通信大学(東京都)
Year and Date
2010-03-08
Related Report
-
[Presentation] CENSREC- 1-AV:マルチモーダル音声認識コーパスの構築2010
Author(s)
田村哲嗣, 宮島千代美, 北岡教英, 武田一哉, 山田武志, 滝口哲也, 柘植覚, 山本一公, 西浦敬信, 中山雅人, 傳田遊亀, 藤本雅清, 松田繁樹, 小川哲司, 黒岩眞吾, 中村哲
Organizer
2010年日本音響学会春季研究発表会
Place of Presentation
調布市
Related Report
-
-
-
-
-
-
[Presentation] CENSREC-4 : Development of evaluation framework for distant-talking speech recognition under reverberant environments2008
Author(s)
M. Nakayama, T. Nishiura, Y. Denda, N. Kitaoka, K. Yamamoto, T. Yamada, S. Tsuge, C. Miyajima, M. Fujimoto, T. Takiguchi, S. Tamura, T. Ogawa, S. Matsuda, S. Kuroiwa, K. Takeda, S. Nakamura
Organizer
International Conference on Spoken Language Processing
Place of Presentation
Brisbane, Australia
Year and Date
2008-09-24
Related Report
-
-
[Presentation] CENSREC-4 : Development of evaluation framework for distant-talking speech recognition under reverberant environments2008
Author(s)
T. Nishiura, M. Nakayama, Y. Denda, N. Kitaoka, K. Yamamoto, T. Yamada, S. Tsuge, C. Miyajima, M. Fujimoto, T. Takiguchi, S. Tamura, S. Kuroiwa, K. Takeda, and S. Nakamura
Organizer
Language Resources and Evaluation Conference
Place of Presentation
Marrakech, Morocco
Year and Date
2008-05-29
Related Report
-
-
-
[Presentation] CENSREC-4: Development of evaluation framework for distant- talking speech recognition under reverberant environments2008
Author(s)
M. Nakayama, T. Nishiura, Y. Denda, N. Kitaoka, K. Yamamoto, T. Yamada, S. Tsuge, C. Miyajima, M. Fujimoto, T. Takiguchi, S. Tamura, T. Ogawa, S. Matsuda, S. Kuroiwa, K. Takeda, S. Nakamura
Organizer
2008 International Conference on Spoken Language Processing
Place of Presentation
オーストラリア
Related Report
-
-
-
[Presentation] CENSREC- 4: Development of evaluation framework for distant-talking speech recognition under reverberant environments2008
Author(s)
T. Nishiura, M. Nakayama, Y. Denda, N. Kitaoka, K. Yamamoto, T. Yamada, S. Tsuge, C. Miyajima, M. Fujimoto, T. Takiguchi, S. Tamura, S. Kuroiwa, K. Takeda, S. Nakamura
Organizer
2008 Language Resources and Evaluation Conference
Place of Presentation
モロッコ
Related Report
-
[Presentation] Development of VAD evaluation framework CENSREC- 1-C and investigation of relationship between VAD and speech recognition performance2007
Author(s)
N. Kitaoka, K. Yamamoto, T. Kusamizu, S. Nakagawa, T. Yamada, S. Tsuge, C. Miyajima, T. Nishiura, M. Nakayama, Y. Denda, M. Fujimoto, T. Takiguchi, S. Tamura, S. Kuroiwa, K. Takeda, S. Nakamura
Organizer
2007 IEEE workshop on Automatic Speech Recognition and Understanding
Place of Presentation
京都市
Related Report
-
-
[Presentation] Development of VAD evaluation framework CENSREC-1-C and investigation of relationship between VAD and speech recognition perfor mance2007
Author(s)
N. Kitaoka, K. Yamamoto, T. Kusamizu, S. Nakagawa, T. Yamada, S. Tsuge, C. Miyajima, T. Nishiura, M. Nakayama, Y. Denda, M. Fujimoto, T. Takiguchi, S. Tamura, S. Kuroiwa, K. Takeda, and S. Nakamura
Organizer
Proc. IEEE workshop on Automatic Speech Recognition and Understanding
Place of Presentation
Kyoto, Japan
Related Report
-
-
-