Computational Auditory Scene Analysis Using Active Audio-Visual Integration in a Dynamically Changing Environment

Research Project

Project/Area Number	22700165
Research Category	Grant-in-Aid for Young Scientists (B)
Allocation Type	Single-year Grants
Research Field	Perception information processing/Intelligent robotics
Research Institution	Tokyo Institute of Technology
Principal Investigator	NAKADAI Kazuhiro 東京工業大学, 大学院・情報理工学研究科, 講師 (70436715)
Project Period (FY)	2010 – 2012
Project Status	Completed (Fiscal Year 2012)
Budget Amount *help	¥4,030,000 (Direct Cost: ¥3,100,000、Indirect Cost: ¥930,000) Fiscal Year 2012: ¥1,300,000 (Direct Cost: ¥1,000,000、Indirect Cost: ¥300,000) Fiscal Year 2011: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000) Fiscal Year 2010: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000)
Keywords	センサ融合 / 統合(ロボット聴覚,アクティブ視聴覚統合,アクティブ聴覚,視聴覚音声認識,視聴覚発話区間検出) / ロボット聴覚 / アクティブ視聴覚統合 / アクティブ聴覚 / 視聴覚音声認識 / 視聴覚発話区間検出 / 視聴覚統合 / 音声認識 / 発話区間検出 / 音源同定 / 雑音抑圧 / ソフトウェアアーキテクチャ / 信頼度付特徴量
Research Abstract	A framework for Audio-Visual Integration (AVI), which can provide optimal integration according to quality of audio and visual information obtained from a robot’s camera and microphone, was proposed and implemented. In addition, the proposed framework was extended by proposing “Active Audio Visual Integration (AAVI)”, which improves the quality of audio and visual information using active robot ’ s motion. Preliminary experiments on automatic speech recognition and voice activity detection showed that the AAVI framework worked effectively even in visually and/or auditorily noisy conditions.

Report

(4 results)

2012 Annual Research Report Final Research Report ( PDF )
2011 Annual Research Report
2010 Annual Research Report

Research Products
(62 results)

All 2013 2012 2011 2010 Other

All Journal Article (11 results) (of which Peer Reviewed: 10 results) Presentation (47 results) (of which Invited: 1 results) Remarks (4 results)

[Journal Article] クワドロコプター搭載のマイクロホンアレイを用いた屋外音環境理解の逐次雑音推定による向上2013
- Author(s)
  奥谷啓太, 吉田尚水, 中村圭佑, 中臺一博
- Volume
  31(掲載決定)
- Pages
  7-8
- NAID
  10031194247
- Related Report
  2012 Final Research Report
- Peer Reviewed
[Journal Article] クワドロコプター搭載のマイクロホンアレイを用いた屋外音環境理解の逐次雑音推定による向上2013
- Author(s)
  奥谷啓太, 吉田尚水, 中村圭佑, 中臺一博
- Journal Title
  
  日本ロボット学会誌
  
  Volume: 31(7-8)
- NAID
  10031194247
- Related Report
  2012 Annual Research Report
[Journal Article] Audio-Visual Voice Activity Detection Based on an Utterance State Transition Model2012
- Author(s)
  K. Nakadai, T. Yoshida
- Journal Title
  
  Advanced Robotics
  
  Volume: 26(10) Issue: 10 Pages: 1183-1201
- DOI
  10.1080/01691864.2012.687152
- Related Report
  2012 Annual Research Report 2012 Final Research Report
- Peer Reviewed
[Journal Article] SLAM-based Online Calibration for Asynchronous Microphone Array2012
- Author(s)
  H. Miura, T. Yoshida, K. Nakamura, K.Nakadai
- Journal Title
  
  Advanced Robotics
  
  Volume: 26(17) Issue: 17 Pages: 1941-1965
- DOI
  10.1080/01691864.2012.728690
- Related Report
  2012 Annual Research Report 2012 Final Research Report
- Peer Reviewed
[Journal Article] Whole Body Motion Noise Cancellation of a Robot for Improved Automatic SpeechRecognition2011
- Author(s)
  G. Ince, K. Nakadai, T. Rodemann, H.Tsujino, J. Imura
- Journal Title
  
  Advanced Robotics
  
  Volume: 25 Issue: 11-12 Pages: 1405-1426
- DOI
  10.1163/016918611x579448
- NAID
  10031135639
- Related Report
  2012 Final Research Report 2011 Annual Research Report
- Peer Reviewed
[Journal Article] Ego NoiseCancellation of a Robot using MissingFeature Masks2011
- Author(s)
  G. Ince, K. Nakadai, T. Rodemann, H.Tsujino, J. Imura
- Journal Title
  
  Applied Intelligence
  
  Volume: 34 Issue: 3 Pages: 360-371
- DOI
  10.1007/s10489-011-0285-0
- Related Report
  2012 Final Research Report 2011 Annual Research Report
- Peer Reviewed
[Journal Article] ロボット聴覚のための2階層視聴覚情報統合を用いた音声認識システムの検討2010
- Author(s)
  吉田尚水, 中臺一博, 奥乃博
- Journal Title
  
  日本ロボット学会誌
  
  Volume: 28 Pages: 56-63
- URL
  https://www.jstage.jst.go.jp/article/jrsj/28/8/28_8_970/_pdf
- Related Report
  2012 Final Research Report 2010 Annual Research Report
- Peer Reviewed
[Journal Article] Robust Ego Noise Suppression of a Robot2010
- Author(s)
  G. Ince, K. Nakadai, T. Rodemann, H.Tsujino, J. Imura
- Journal Title
  
  Trends in Applied Intelligent Systems,Lecture Notes in Computer Science
  
  Volume: 6096/2010 Pages: 62-71
- DOI
  10.1007/978-3-642-13022-9_7
- ISBN
  9783642130212, 9783642130229
- Related Report
  2012 Final Research Report
- Peer Reviewed
[Journal Article] An Improvement in Audio-Visual Voice Activity Detection for AutomaticSpeech Recognition2010
- Author(s)
  T. Yoshida, K. Nakadai, H. G. Okuno
- Journal Title
  
  Trends in Applied Intelligent Systems, Lecture Notes in Computer Science
  
  Volume: 6096/2010 Pages: 51-61
- DOI
  10.1007/978-3-642-13022-9_6
- ISBN
  9783642130212, 9783642130229
- Related Report
  2012 Final Research Report
- Peer Reviewed
[Journal Article] Robust Ego Noise Suppression of a Robot2010
- Author(s)
  Gokhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Hiroshi Tsujino, Jun-ichi Imura
- Journal Title
  
  Trends in Applied Intelligent Systems Lecture Notes in Computer Science
  
  Volume: 6096/2010 Pages: 62-71
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] An Improvement in Audio-Visual Voice Activity Detection for Automatic Speech Recognition2010
- Author(s)
  Takami Yoshida, Kazuhiro Nakadai, Hiroshi G.Okuno
- Journal Title
  
  Trends in Applied Intelligent Systems, Lecture Notes in Computer Science
  
  Volume: 6096/2010 Pages: 51-61
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Presentation] Active Audio-Visual Integration for Robots2013
- Author(s)
  K. Nakadai, T. Yoshida
- Organizer
  The 2nd Symposium on Binaural Active Audition for Humanoid Robots (BINAAHR)
- Place of Presentation
  京都
- Year and Date
  2013-03-18
- Related Report
  2012 Final Research Report
[Presentation] Active Audio-Visual Integration for Robots2013
- Author(s)
  K. Nakadai, T. Yoshida
- Organizer
  The 2nd Symposium on Binaural Active Audition for Humanoid Robots (BINAAHR)
- Place of Presentation
  京都大学（京都）
- Related Report
  2012 Annual Research Report
- Invited
[Presentation] アクティブ視聴覚統合による発話区間検出の検討:因果モデルベースアプローチ2012
- Author(s)
  吉田尚水,中臺一博
- Organizer
  人工知能学会第36回AI-Challenge研究会
- Place of Presentation
  東京
- Year and Date
  2012-11-15
- Related Report
  2012 Final Research Report
[Presentation] Audio-VisualIntegration for voice activity detection2012
- Author(s)
  T. Yoshida, K. Nakadai
- Organizer
  First Symposium on Binaural Active Audition for Humanoid Robots
- Place of Presentation
  パリ(フランス)
- Year and Date
  2012-02-27
- Related Report
  2012 Final Research Report
[Presentation] Audio-Visual Integration for voice activity detection2012
- Author(s)
  T. Yoshida, K. Nakadai
- Organizer
  First Symposium on Binaural Active Audition for Humanoid Robots
- Place of Presentation
  パリ(フランス)(招待講演)
- Year and Date
  2012-02-27
- Related Report
  2011 Annual Research Report
[Presentation] Improvement of Audio-Visual Score Following in Robot Ensemble with Human Guitarist2012
- Author(s)
  T. Itohara, K. Nakadai, T. Ogata, H.G.Okuno
- Organizer
  IEEE-RASInternational Conference on HumanoidRobots(Humanoids 2012)
- Place of Presentation
  大阪
- Related Report
  2012 Final Research Report
[Presentation] Active Audio-Visual Integration for Voice Activity Detection based on a CausalBayesian Network2012
- Author(s)
  T. Yoshida, K. Nakadai
- Organizer
  IEEE-RASInternational Conference on HumanoidRobots (Humanoids 2012)
- Place of Presentation
  大阪
- Related Report
  2012 Final Research Report
[Presentation] Live Assessment of Beat Tracking for Robot Audition2012
- Author(s)
  J. L. Oliveira, G. Ince, K. Na kamura, K. Nakadai, H.G. Okuno, L. P. Reis, F. Gouyon
- Organizer
  IEEE/RSJInternational Conference on Intelligent Robots and Systems (IROS-2012)
- Place of Presentation
  ビラモウラ(ポルトガル)
- Related Report
  2012 Final Research Report
[Presentation] ロボット聴覚のための因果モデルを用いたアクティブ視聴覚統合発話区間検出の検討2012
- Author(s)
  吉田尚水,中臺一博
- Organizer
  第30回日本ロボット学会学術講演会
- Place of Presentation
  札幌
- Related Report
  2012 Final Research Report
[Presentation] Improvement of Audio-Visual Score Following in Robot Ensemble with Human Guitarist2012
- Author(s)
  Tatsuhiko Itohara, Kazuhiro Nakadai, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2012)
- Place of Presentation
  大阪産業創造館（大阪）
- Related Report
  2012 Annual Research Report
[Presentation] Active Audio-Visual Integration for Voice Activity Detection based on a Causal Bayesian Network2012
- Author(s)
  T. Yoshida, K. Nakadai
- Organizer
  IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2012)
- Place of Presentation
  大阪産業創造館（大阪）
- Related Report
  2012 Annual Research Report
[Presentation] Outdoor Auditory Scene Analysis Using a Moving Microphone Array Embedded in a Quadrocopter2012
- Author(s)
  K. Okutani, T. Yoshida, K. Nakamura, K. Nakadai
- Organizer
  IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2012)
- Place of Presentation
  ビラモウラ（ポルトガル）
- Related Report
  2012 Annual Research Report
[Presentation] Live Assessment of Beat Tracking for Robot Audition2012
- Author(s)
  J. L. Oliveira, G. Ince, K. Nakamura, K. Nakadai, H.G. Okuno, L. P. Reis, F. Gouyon
- Organizer
  IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2012)
- Place of Presentation
  ビラモウラ（ポルトガル）
- Related Report
  2012 Annual Research Report
[Presentation] An Active Audition Framework for Auditory-driven HRI: Application to Interactive Robot Dancing2012
- Author(s)
  J. L. Oliveira, G. Ince, K. Nakamura, K. Nakadai, H.G. Okuno, L. P. Reis, F. Gouyon
- Organizer
  IEEE International Symposium on Robot and Human Interactive Communication (Ro-Man 2012)
- Place of Presentation
  パリ（フランス）
- Related Report
  2012 Annual Research Report
[Presentation] ロボットを対象とした動作指令値ベース動作雑音抑圧手法の検討2012
- Author(s)
  手塚大貴，吉田尚水，中臺一博
- Organizer
  第13回計測自動制御学会システムインテグレーション部門講演会
- Place of Presentation
  福岡国際会議場（福岡）
- Related Report
  2012 Annual Research Report
[Presentation] アクティブ視聴覚統合による発話区間検出の検討: 因果モデルベースアプローチ2012
- Author(s)
  吉田尚水，中臺一博
- Organizer
  人工知能学会第36回 AI-Challenge 研究会
- Place of Presentation
  慶応大学（東京）
- Related Report
  2012 Annual Research Report
[Presentation] クワドロコプタを用いた屋外音環境理解の逐次雑音推定による向上2012
- Author(s)
  奥谷啓太，吉田尚水，中村圭佑，中臺一博
- Organizer
  第30回日本ロボット学会学術講演会
- Place of Presentation
  札幌コンベンションセンター（北海道）
- Related Report
  2012 Annual Research Report
[Presentation] ロボット聴覚のための因果モデルを用いたアクティブ視聴覚統合発話区間検出の検討2012
- Author(s)
  吉田尚水，中臺一博
- Organizer
  第30回日本ロボット学会学術講演会
- Place of Presentation
  札幌コンベンションセンター（北海道）
- Related Report
  2012 Annual Research Report
[Presentation] クワドロコプターを使った屋外音環境の収録と解析2011
- Author(s)
  奥谷啓太, 吉田尚水, 中村圭佑, 中臺一博
- Organizer
  第12回計測自動制御学会システムインテグレーション部門講演会,計測自動制御学会
- Place of Presentation
  京都大学(京都)
- Year and Date
  2011-12-24
- Related Report
  2011 Annual Research Report
[Presentation] SLAMに基づく非同期分散マイクロホンアレイのキャリブレーションの評価2011
- Author(s)
  三浦弘樹, 吉田尚水, 中村圭佑, 中臺一博
- Organizer
  人工知能学会第34回AI-Challenge研究会
- Place of Presentation
  慶応大学(東京都)
- Year and Date
  2011-12-15
- Related Report
  2011 Annual Research Report
[Presentation] Multi-talker Speech Recognition under Ego-motion Noise using Missing Feature Theory2011
- Author(s)
  G. Ince, K. Nakadai, T. Rodemann, H.Tsujino, J. Imura
- Organizer
  IEEE/RSJInternational Conference onIntelligent Robots and Systems (IROS2010)
- Place of Presentation
  台北(台湾)
- Year and Date
  2011-10-19
- Related Report
  2012 Final Research Report
[Presentation] ロボットのための情報量レベルに基づくアクティブ視聴覚統合の検討2011
- Author(s)
  吉田尚水, 中村圭佑, 中臺一博
- Organizer
  第29回日本ロボット学会学術講演会,日本ロボット学会
- Place of Presentation
  東京
- Year and Date
  2011-09-09
- Related Report
  2012 Final Research Report
[Presentation] SLAMとビームフォーミングによる非同期分散マイクロホンアレイのキャリブレーション2011
- Author(s)
  三浦弘樹, 吉田尚水, 中村圭佑, 中臺一博
- Organizer
  第29回日本ロボット学会学術講演会,日本ロボット学会
- Place of Presentation
  芝浦工業大学(東京都)
- Year and Date
  2011-09-09
- Related Report
  2011 Annual Research Report
[Presentation] ロボットのための情報量レベルに基づくアクティブ視聴覚統合の検討2011
- Author(s)
  吉田尚水, 中村圭佑, 中臺一博
- Organizer
  第29回日本ロボット学会学術講演会,日本ロボット学会
- Place of Presentation
  芝浦工業大学(東京都)
- Year and Date
  2011-09-09
- Related Report
  2011 Annual Research Report
[Presentation] Assessment of General Applicability of Ego Noise Estimation-Applications toAutomatic Speech Recognition and Sound Source Localization2011
- Author(s)
  G. Ince, K. Nakamura, F. Asano, H.Nakajima, K. Nakadai
- Organizer
  IEEE-RAS International Conference on Roboticsand Automation (ICRA 2011)
- Place of Presentation
  (上海)中国
- Year and Date
  2011-05-11
- Related Report
  2012 Final Research Report
[Presentation] Assessment of General Applicability of Ego Noise Estimation-Applications to Automatic Speech Recognition and Sound Source Localization-2011
- Author(s)
  G. Ince, K. Nakamura, F. Asano, H. Nakajima, K. Nakadai
- Organizer
  IEEE-RAS International Conference on Robotics and Automation (ICRA 2011)
- Place of Presentation
  上海(中国)
- Year and Date
  2011-05-11
- Related Report
  2011 Annual Research Report
[Presentation] SLAMに基づく非同期分散型マイクロホンアレイによる音源定位2011
- Author(s)
  三浦弘樹, 吉田尚水, 中臺一博
- Organizer
  情報処理学会第73回全国大会
- Place of Presentation
  東京工業大学,東京
- Year and Date
  2011-03-02
- Related Report
  2010 Annual Research Report
[Presentation] ロボット聴覚～高雑音下でのハンズフリー音声認識～2011
- Author(s)
  中臺一博, 奥乃博
- Organizer
  電子情報通信学会音声研究会
- Place of Presentation
  ATR,京都(招待講演)
- Year and Date
  2011-01-27
- Related Report
  2010 Annual Research Report
[Presentation] Incremental Learning for Ego Noise Estimation of a Robot2011
- Author(s)
  G. Ince, K. Nakadai, T. Rodemann, J.Imura, K. Nakamura, H. Nakajima
- Organizer
  IEEE/RSJInternational Conference onIntelligent Robots and Systems (IROS2011)
- Place of Presentation
  サンフランシスコ(アメリカ)
- Related Report
  2012 Final Research Report
[Presentation] Assessment of Single-channel Ego Noise Estimation Methods2011
- Author(s)
  G. Ince, K. Nakadai, T. Rodemann, J.Imura, K. Nakamura, H. Nakajima
- Organizer
  IEEE/RSJInternational Conference onIntelligent Robots and Systems (IROS2011)
- Place of Presentation
  サンフランシスコ(アメリカ)
- Related Report
  2012 Final Research Report
[Presentation] Incremental Learning for Ego Noise Estimation of a Robot2011
- Author(s)
  G. Ince, K. Nakadai, T. Rodemann, J. Imura, K. Nakamura, H. Nakajima
- Organizer
  IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2011)
- Place of Presentation
  サンフランシスコ(アメリカ)
- Related Report
  2011 Annual Research Report
[Presentation] Assessment of Single-channel Ego Noise Estimation Methods2011
- Author(s)
  G. Ince, K. Nakadai, T. Rodemann, J. Imura, K. Nakamura, H. Nakajima
- Organizer
  IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2011)
- Place of Presentation
  サンフランシスコ(アメリカ)
- Related Report
  2011 Annual Research Report
[Presentation] SLAM-based Online Calibration of Asynchronous Microphone Array for Robot Audition2011
- Author(s)
  H. Miura, T. Yoshida, K. Nakamura, K. Nakadai
- Organizer
  IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2011)
- Place of Presentation
  サンフランシスコ(アメリカ)
- Related Report
  2011 Annual Research Report
[Presentation] ロボット聴覚用オープンソースソフトウェアHARK 1.0.0の概要2010
- Author(s)
  中臺一博, 奥乃博
- Organizer
  第11回計測自動制御学会システムインテグレーション部門講演会
- Place of Presentation
  東北大学,仙台
- Year and Date
  2010-12-25
- Related Report
  2010 Annual Research Report
[Presentation] ロボットによる音声発話区間検出のためのハイブリッドダイナミカルシステムに基づくモダリティ選択の検討2010
- Author(s)
  吉田尚水, 中臺一博
- Organizer
  第11回計測自動制御学会システムインテグレーション部門講演会
- Place of Presentation
  仙台
- Year and Date
  2010-12-23
- Related Report
  2012 Final Research Report
[Presentation] ロボットによる音声発話区間検出のためのハイブリッドダイナミカルシステムに基づくモダリティ選択の検討2010
- Author(s)
  吉田尚水, 中臺一博
- Organizer
  第11回計測自動制御学会システムインテグレーション部門講演会
- Place of Presentation
  東北大学,仙台
- Year and Date
  2010-12-23
- Related Report
  2010 Annual Research Report
[Presentation] ロボット聴覚における音声認識技術-ロボット知能化に向けて-2010
- Author(s)
  中臺一博
- Organizer
  日本ロボット学会ロボット工学セミナー「記号・言語を基盤としたロボットの知能化技術」
- Place of Presentation
  東京大学,東京(セミナー講師)
- Year and Date
  2010-11-29
- Related Report
  2010 Annual Research Report
[Presentation] Two-Layered Audio-Visual Speech Recognition for Robots in NoisyEnvironments2010
- Author(s)
  T. Yoshida, K. Nakadai, H.G. Okuno
- Organizer
  IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2010)
- Place of Presentation
  台北(台湾)
- Year and Date
  2010-10-19
- Related Report
  2012 Final Research Report
[Presentation] Multi-talker Speech Recognition under Ego-motion Noise using Missing Feature Theory2010
- Author(s)
  Gokhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Hiroshi Tsujino, Jun-ichi Imura
- Organizer
  IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2010)
- Place of Presentation
  Taipei, Taiwan
- Year and Date
  2010-10-19
- Related Report
  2010 Annual Research Report
[Presentation] Two-Layered Audio-Visual Speech Recognition for Robots in Noisy Environments2010
- Author(s)
  Takami Yoshida, Kazuhiro Nakadai, Hiroshi G.Okuno
- Organizer
  IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2010)
- Place of Presentation
  Taipei, Taiwan
- Year and Date
  2010-10-19
- Related Report
  2010 Annual Research Report
[Presentation] Audio-visual speech recognition system for a robot2010
- Author(s)
  T. Yoshida, K. Nakadai
- Organizer
  International Conference on Auditory-Visual Speech Processing (AVSP 2010)
- Place of Presentation
  箱根
- Year and Date
  2010-10-01
- Related Report
  2012 Final Research Report
[Presentation] Audio-visual speech recognition system for a robot2010
- Author(s)
  Takami Yoshida, Kazuhiro Nakadai
- Organizer
  International Conference on Auditory-Visual Speech Processing (AVSP 2010)
- Place of Presentation
  Hakone, Kanagawa
- Year and Date
  2010-10-01
- Related Report
  2010 Annual Research Report
[Presentation] Two-layered audio-visual integration in voice activity detection and automatic speech recognition for robots2010
- Author(s)
  Takami Yoshida, Kazuhiro Nakadai
- Organizer
  International Conference on Spoken Language Processing (Interspeech 2010)
- Place of Presentation
  Makuhari, Chiba
- Year and Date
  2010-09-30
- Related Report
  2010 Annual Research Report
[Presentation] A Robust Speech Recognition System against the Ego Noise of a Robot2010
- Author(s)
  G. Ince, K. Nakadai, T. Rodemann, H.Tsujino, J. Imura
- Organizer
  InternationalConference on Spoken LanguageProcessing (Interspeech 2010)
- Place of Presentation
  千葉
- Year and Date
  2010-09-29
- Related Report
  2012 Final Research Report
[Presentation] Two-layered audio-visual integration in voice activity detection and automatic speech recognition for robots2010
- Author(s)
  T. Yoshida, K. Nakadai
- Organizer
  International Conference on Spoken Language Processing (Interspeech2010)
- Place of Presentation
  千葉
- Year and Date
  2010-09-29
- Related Report
  2012 Final Research Report
[Presentation] A Robust Speech Recognition System against the Ego Noise of a Robot2010
- Author(s)
  Gokhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Hiroshi Tsujino, Jun-ichi Imura
- Organizer
  International Conference on Spoken Language Processing (Interspeech 2010)
- Place of Presentation
  Makuhari, Chiba
- Year and Date
  2010-09-29
- Related Report
  2010 Annual Research Report
[Presentation] A Hybrid Framework for Ego Noise Cancellation of a Robot2010
- Author(s)
  Gokhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Yuji Hasegawa, Hiroshi Tsujino, Jun-ichi Imura
- Organizer
  IEEE-RAS International Conference on Robotics and Automation (ICRA 2010)
- Place of Presentation
  Anchorage, USA
- Year and Date
  2010-05-06
- Related Report
  2010 Annual Research Report
[Remarks] HARKのページロボット聴覚オープンソースソフトウェア
- URL
  http://winnie.kuis.kyoto-u.ac.jp/
- Related Report
  2012 Final Research Report
[Remarks] 東京工業大学中臺研究室HP
- URL
  http://www.cyb.mei.titech.ac.jp/nakadai
- Related Report
  2012 Final Research Report
[Remarks] ロボット聴覚オープンソースソフトウェアHARKのページ
- URL
  http://winnie.kuis.kyoto-u.ac.jp/
- Related Report
  2012 Annual Research Report
[Remarks] 東京工業大学大学院情報理工学研究科中臺研究室HP
- URL
  http://www.cyb.mei.titech.ac.jp/nakadai/
- Related Report
  2012 Annual Research Report

Computational Auditory Scene Analysis Using Active Audio-Visual Integration in a Dynamically Changing Environment

Principal Investigator

NAKADAI Kazuhiro 東京工業大学, 大学院・情報理工学研究科, 講師 (70436715)

¥4,030,000 (Direct Cost: ¥3,100,000、Indirect Cost: ¥930,000)

Report

Research Products

[Journal Article] クワドロコプター搭載のマイクロ ホンアレイを用いた屋外音環境理解の逐次雑音推定による向上2013

Author(s)

Volume

Pages

NAID

Related Report

[Journal Article] クワドロコプター搭載のマイクロホンアレイを用いた屋外音環境理解の逐次雑音推定による向上2013

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Audio-Visual Voice Activity Detection Based on an Utterance State Transition Model2012

Author(s)

Journal Title

DOI

Related Report

[Journal Article] SLAM-based Online Calibration for Asynchronous Microphone Array2012

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Whole Body Motion Noise Cancellation of a Robot for Improved Automatic SpeechRecognition2011

Author(s)

Journal Title

DOI

NAID

Related Report

[Journal Article] Ego NoiseCancellation of a Robot using MissingFeature Masks2011

Author(s)

Journal Title

DOI

Related Report

[Journal Article] ロボット聴覚のための2階層視聴覚情報統合を用いた音声認識システムの検討2010

Author(s)

Journal Title

URL

Related Report

[Journal Article] Robust Ego Noise Suppression of a Robot2010

Author(s)

Journal Title

DOI

ISBN

Related Report

[Journal Article] An Improvement in Audio-Visual Voice Activity Detection for AutomaticSpeech Recognition2010

Author(s)

Journal Title

DOI

ISBN

Related Report

[Journal Article] Robust Ego Noise Suppression of a Robot2010

Author(s)

Journal Title

Related Report

[Journal Article] An Improvement in Audio-Visual Voice Activity Detection for Automatic Speech Recognition2010

Author(s)

Journal Title

Related Report

[Presentation] Active Audio-Visual Integration for Robots2013

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Active Audio-Visual Integration for Robots2013

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] アクティブ視聴覚統合による発話区間検出の検討:因果モデルベースアプローチ2012

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Journal Article] クワドロコプター搭載のマイクロホンアレイを用いた屋外音環境理解の逐次雑音推定による向上2013