Improvement of function in Robot Audition based on Active Audition

Research Project

Project/Area Number	21700195
Research Category	Grant-in-Aid for Young Scientists (B)
Allocation Type	Single-year Grants
Research Field	Perception information processing/Intelligent robotics
Research Institution	Kyoto University
Principal Investigator	TAKAHASHI Toru 京都大学, 情報学研究科, GCOE助教 (30419494)
Project Period (FY)	2009 – 2011
Project Status	Completed (Fiscal Year 2011)
Budget Amount *help	¥4,290,000 (Direct Cost: ¥3,300,000、Indirect Cost: ¥990,000) Fiscal Year 2011: ¥1,040,000 (Direct Cost: ¥800,000、Indirect Cost: ¥240,000) Fiscal Year 2010: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000) Fiscal Year 2009: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000)
Keywords	ロボット聴覚 / 音源定位 / 音源分離 / 分離音声認識 / 音源追跡 / 音声認識 / アクティブオーディション / ミッシングフィーチャ理論 / 同時発話認識 / 実環境下音声認識 / ヒューマノイドロボット / HRP-2
Research Abstract	The accuracy of sound source localization, tracking, separation, and separated speech recognition is improved for robot audition. These improvements enable that a robot recognizes speech while the robot is moving and is gesturing, such as active audition. Restriction on robot's motion is reduced for active audition. A robot can close to a target sound source to improve signal-to-noise ratio. A robot can move to the best place for making angle between multiple sound sources wider. The improved system preserves speech recognition accuracy from noise of robot motion, when a robot's body doesn' t locates between sound source and microphone.

Report

(4 results)

2011 Annual Research Report Final Research Report ( PDF )
2010 Annual Research Report
2009 Annual Research Report

Research Products
(108 results)

All 2012 2011 2010 2009 Other

All Journal Article (29 results) (of which Peer Reviewed: 22 results) Presentation (71 results) Book (3 results) Remarks (4 results) Patent(Industrial Property Rights) (1 results)

[Journal Article] Efficient Blind Dereverberation and Echo Cancellation based on Independent Component Analysis for Actual Acoustic Signals2012
- Author(s)
  R. Takeda, K. Nakadai, T. Takahashi, T. Ogata, H. G. Okuno
- Journal Title
  
  Neural Computation
  
  Volume: 24 Issue: 1 Pages: 234-272
- DOI
  10.1162/neco_a_00219
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] Tool-Body Assimilation of Humanoid Robt using Neuro-Dynamical System2012
- Author(s)
  Shun Nishide, J.Tani, Takahashi, H.G.Okuno, T.Ogata
- Journal Title
  
  IEEE Transactions on Autonomous Mental Development
  
  Volume: 4 Issue: 2 Pages: 139-149
- DOI
  10.1109/tamd.2011.2177660
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] Complex Extension of Infinite Sparse Factor Analysis for Blind Speech Separation2012
- Author(s)
  Kohei Nagira
- Journal Title
  
  Proceedings of 10th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA-2012)
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] A GMM Sound Source Model for Blind Speech Separation in Under-determined Condisions2012
- Author(s)
  Yasuharu Hirasawa
- Journal Title
  
  Proceedings of 10th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA-2012)
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] Efficient Blind Dereverberation and Echo Cancellation based on Independent Component Analysis for Actual Acoustic Signals2011
- Author(s)
  Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno
- Journal Title
  
  Neural Computation, MIT Press
- Related Report
  2011 Final Research Report
[Journal Article] Classification of Known and Unknown Environmental Sounds based on Self-organized Space using Recurrent Neural Network2011
- Author(s)
  Zhang Yang, Tetsuya Ogata, Shun Nishide, Toru Takahashi, Hiroshi G. Okuno
- Journal Title
  
  Advanced Robotics
  
  Volume: 13
- Related Report
  2011 Final Research Report
[Journal Article] 発語行為レベルの情報をユーザ発話の解釈に用いる音声対話システム2011
- Author(s)
  駒谷和範
- Journal Title
  
  情報処理学会論文誌
  
  Volume: 52 Pages: 3374-3385
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] フレーズ置換のための調波非調波GMM・NMF・残響推定に基づく音源分離・演奏合成2011
- Author(s)
  安良岡直希
- Journal Title
  
  情報処理学会論文誌
  
  Volume: 52 Pages: 3839-3852
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] Complex and Transitive Synchronization in a Frustrated System of Calling Frogs2011
- Author(s)
  Ikkyu Aihara
- Journal Title
  
  Journal of American Physical Society, Physical Review E
  
  Volume: 83 Issue: 3 Pages: 1-5
- DOI
  10.1103/physreve.83.031913
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] Classification of Known and Unknown Environmental Sounds based on Self-organized Space using Recurrent Neural Network2011
- Author(s)
  Zhang Yang, Tetsuya Ogata, S.Nishide, T.Takahashi, H.G.Okuno
- Journal Title
  
  Advanced Robotics
  
  Volume: Vol.25,No.7 Issue: 17 Pages: 2127-2141
- DOI
  10.1163/016918611x595017
- NAID
  10031135674
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] Use of a Sparse Structure to Improve Learning Performance of Recurrent Neural Networks2011
- Author(s)
  Hiromitsu Awano
- Journal Title
  
  Proceedings of 18th International Conference on Neural Information Processing (ICONIP 2011)
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] Fast and simple iterative algorithm of Lp-norm minimization for under-determined speech separation2011
- Author(s)
  Yasuharu Hirasawa
- Journal Title
  
  Proceedings of International Conference on Spoken Language Processing (Interspeech 2011)
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] Environmental Sound Recognition for Robot Audition using Matching-pursuit2011
- Author(s)
  Nobuhide Yamakawa
- Journal Title
  
  Proceedings of International Conference on Spoken Language Processing (Interspeech 2011)
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] Robot with Two Ears Listens to More Than Two Simultaneous Utterances by Exploiting Harmonic Structures2011
- Author(s)
  Yasuharu Hirasawa
- Journal Title
  
  Proceeding of the 24th International Conference on Industrial, Engineering and Other Applications of Applied Intelligence Systems (IEA/AIE-2011)
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] Efficient Blind Dereverberation and Echo Cancellation based on Independent Component Analysis for Actual Acoustic Signals2011
- Author(s)
  Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Tetsuya Ogata, Hiroshi G.Okuno
- Journal Title
  
  Neural Computation
  
  Volume: (掲載確定)
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] Real-Time Audio-to-Score Alignment using Particle Filter for Co-player Music Robots2010
- Author(s)
  Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno
- Journal Title
  
  Hindawi Pub.
- Related Report
  2011 Final Research Report
[Journal Article] Voice awareness control for a humanoid robot consistent with its body posture and movements2010
- Author(s)
  Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Journal Title
  
  PALADYN Journal of Behavioral Robotics
  
  Volume: 1 Issue: 1 Pages: 80-88
- DOI
  10.2478/s13230-010-0009-x
- Related Report
  2011 Final Research Report
[Journal Article] Design and Implementation of Robot Audition System "HARK"2010
- Author(s)
  K. Nakadai, H. G. Okuno, H. Nakajima, Y. Hasegawa, and H. Tsujino
- Journal Title
  
  Advanced Robotics
  
  Volume: 24 Issue: 5-6 Pages: 739-761
- DOI
  10.1163/016918610x493561
- Related Report
  2011 Final Research Report
[Journal Article] Soft Missing-Feature Mask Generation for Simultaneous Speech cognition System in Robots2010
- Author(s)
  Toru Takahashi, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Journal Title
  
  PALADYN Journal of Behavioral Robotics
  
  Volume: 1 Issue: 1 Pages: 37-47
- DOI
  10.2478/s13230-010-0005-1
- Related Report
  2011 Final Research Report
[Journal Article] Design and Implementation of Robot Audition System 'HARK'---Open Source Software for Listening to Three Simultaneous Speakers2010
- Author(s)
  Kazuhiro Nakadai, Toru Takahashi, Hiroshi G.Okuno, Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino
- Journal Title
  
  Advanced Robotics
  
  Volume: Vol.24, No.5-6 Pages: 739-761
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] An Improvement in Automatic Speech Recognition Using Soft Missing Feature Masks for Robot Audition (Invited paper)2010
- Author(s)
  Toru Takahashi, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G.Okuno
- Journal Title
  
  Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems, (IROS 2010)
  
  Pages: 964-969
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] Speedup and Performance Improvement of ICA-based Robot Audition by Parallel and Resampling-based Block-wise Processing (Invited paper)2010
- Author(s)
  Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G.Okuno
- Journal Title
  
  Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems, (IROS 2010)
  
  Pages: 1949-1956
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] Exploiting Harmonic Structures to Improve Separating Simultaneous Speech in Under-Determined Conditions2010
- Author(s)
  Yasuharu Hirasawa, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G.Okuno
- Journal Title
  
  Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems, (IROS 2010)
  
  Volume: (Invited paper) Pages: 450-457
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] Effects of modelling within- and between-frame temporal variations in power spectra on non-verbal sound recognition2010
- Author(s)
  Nobuhide Yamakawa, Tetsuro Kitahara, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G.Okuno
- Journal Title
  
  Proceedings of International Conference on Spoken Language Processing (Interspeech 2010)
  
  Pages: 2342-2345
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] Improvement in Listening Capability for Humanoid Robot HRP-22010
- Author(s)
  Toru Takahashi, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi.G.Okuno
- Journal Title
  
  Proceedings of IEEE-RAS International Conference on Robotics and Automation 2010, (ICRA-2010)
  
  Pages: 470-475
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] Upper-limit Evaluation of a Robot Audition based on ICA-BSS in Multi-source, Barge-in and Highly Reveberant Conditions2010
- Author(s)
  Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G.Okuno
- Journal Title
  
  Proceedings of IEEE-RAS International Conference on Robotics and Automation 2010, (ICRA-2010)
  
  Pages: 4366-4371
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] Soft Missing-Feature Mask Generation for Simultaneous Speech Recognition System in Robots2010
- Author(s)
  Toru Takahashi
- Journal Title
  
  PALADYN Journal of Behavioral Robotics 1巻
  
  Pages: 37-47
- Related Report
  2009 Annual Research Report
- Peer Reviewed
[Journal Article] 残響下でのバージイン発話認識のための多入力独立成分分析を応用したロボット聴覚2009
- Author(s)
  武田龍, 中臺一博, 高橋徹, 駒谷和範, 尾形哲也, 奥乃愽
- Journal Title
  
  日本ロボット学会誌
  
  Volume: 7/8 Pages: 80-90
- NAID
  10025114321
- Related Report
  2011 Final Research Report
[Journal Article] Missing-Feature-Theory-based Robust Simultaneous Speech Recognition System with Non-clean Speech Acoustic Model2009
- Author(s)
  Toru Takahashi
- Journal Title
  
  Proc.of IEEE/RSJ International Conference on Intelligent Robots and Systems
  
  Pages: 2730-2735
- Related Report
  2009 Annual Research Report
- Peer Reviewed
[Presentation] Complex Infinite Sparse Factor Analysisによる周波数領域での音声信号のブラインド音源分離2012
- Author(s)
  柳楽浩平
- Organizer
  情報処理学会第74回全国大会
- Place of Presentation
  名古屋工業大学(愛知県)
- Year and Date
  2012-03-07
- Related Report
  2011 Annual Research Report
[Presentation] パーティクルフィルタを用いた動的環境下の複数音源追跡2012
- Author(s)
  黄楊暘
- Organizer
  情報処理学会第74回全国大会
- Place of Presentation
  名古屋工業大学(愛知県)
- Year and Date
  2012-03-07
- Related Report
  2011 Annual Research Report
[Presentation] 複数音源下での擬音語による音源選択システム2012
- Author(s)
  山村祐介
- Organizer
  情報処理学会第74回全国大会
- Place of Presentation
  名古屋工業大学(愛知県)
- Year and Date
  2012-03-07
- Related Report
  2011 Annual Research Report
[Presentation] 発話中の方言変化に頑健な方言変換システム2012
- Author(s)
  平山直樹
- Organizer
  情報処理学会第74回全国大会
- Place of Presentation
  名古屋工業大学(愛知県)
- Year and Date
  2012-03-07
- Related Report
  2011 Annual Research Report
[Presentation] アクセント特徴量を用いた歌声と朗読音声の識別システム2012
- Author(s)
  阿曽慎平
- Organizer
  情報処理学会第74回全国大会
- Place of Presentation
  名古屋工業大学(愛知県)
- Year and Date
  2012-03-07
- Related Report
  2011 Annual Research Report
[Presentation] ロボットのためのマイクアレイによる複数話者追跡2012
- Author(s)
  高橋徹
- Organizer
  京都大学ICTイノベーション2012
- Place of Presentation
  京都大学百周年時計台記念館2階国際交流ホールI, II, III
- Year and Date
  2012-02-17
- Related Report
  2011 Final Research Report
[Presentation] ロボットのためのアイクアレイによる複数話者追跡2012
- Author(s)
  高橋徹
- Organizer
  京都大学ICTイノベーション2012
- Place of Presentation
  京都大学(京都府)
- Year and Date
  2012-02-17
- Related Report
  2011 Annual Research Report
[Presentation] スペクトル変化量のピーク間隔・F0・MFCCを用いた歌声と朗読音声の自動識別システム2012
- Author(s)
  阿曽慎平
- Organizer
  情報処理学会第94回音楽情報科学研究会
- Place of Presentation
  舘山寺温泉(静岡県)
- Year and Date
  2012-02-04
- Related Report
  2011 Annual Research Report
[Presentation] Complex Extension of Infinite Sparse Factor Analysis for Blind Speech Separation2012
- Author(s)
  Kohei Nagira, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of 10th International Conference on Latent Variable Analysis and Signal Separation
- Place of Presentation
  Tel-Aviv, Israel
- Related Report
  2011 Final Research Report
[Presentation] A GMM Sound Source Model for Blind Speech Separation in Under-determined Condisions2012
- Author(s)
  Yasuharu Hirasawa, Naoki Yasuraoka, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of 10th International Conference on Latent Variable Analysis and Signal Separation
- Place of Presentation
  Tel-Aviv, Israel
- Related Report
  2011 Final Research Report
[Presentation] Complex Infinite Sparse Factor Analysisによる周波数領域での音声信号のブラインド音源分離2012
- Author(s)
  柳楽浩平, 高橋徹, 尾形哲也, 奥乃博
- Organizer
  情報処理学会第74回全国大会
- Place of Presentation
  名古屋工業大学
- Related Report
  2011 Final Research Report
[Presentation] パーティクルフィルタを用いた動的環境下の複数音源追跡2012
- Author(s)
  黄楊暘, 大塚琢馬, 高橋徹, 尾形哲也, 奥乃博
- Organizer
  情報処理学会第74回全国大会
- Place of Presentation
  名古屋工業大学
- Related Report
  2011 Final Research Report
[Presentation] 複数音源下での擬音語による音源選択システム2012
- Author(s)
  山村祐介, 高橋徹, 尾形哲也, 奥乃博
- Organizer
  情報処理学会第74回全国大会
- Place of Presentation
  名古屋工業大学
- Related Report
  2011 Final Research Report
[Presentation] 発話中の方言変化に頑健な方言変換システム2012
- Author(s)
  平山直樹, 高橋徹, 尾形哲也, 奥乃博
- Organizer
  情報処理学会第74回全国大会
- Place of Presentation
  名古屋工業大学
- Related Report
  2011 Final Research Report
[Presentation] Infinite Sparse Factor Analysisの複素拡張による音声信号のブラインド音源分離2011
- Author(s)
  柳楽浩平, 高橋徹, 尾形哲也, 奥乃博
- Organizer
  日本音響学会関西支部第14回若手研究者交流研究発表会
- Place of Presentation
  若手優秀賞・関西支部長賞
- Year and Date
  2011-12-18
- Related Report
  2011 Final Research Report
[Presentation] Infinite Sparse Factor Analysisの複素拡張による音声信号のブラインド音源分離2011
- Author(s)
  柳楽浩平
- Organizer
  日本音響学会関西支部第14回若手研究者交流研究発表会
- Place of Presentation
  産業技術総合研究所(大阪府)
- Year and Date
  2011-12-18
- Related Report
  2011 Annual Research Report
[Presentation] ブラインド音源分離のためのInfinite Sparse Factor Analysisの複素拡張2011
- Author(s)
  柳楽浩平, 高橋徹, 尾形哲也, 奥乃博
- Organizer
  第34回AIチャレンジ研究会
- Place of Presentation
  慶応義塾大学
- Year and Date
  2011-12-15
- Related Report
  2011 Final Research Report
[Presentation] ブラインド音源分離のためのInfinite Sparse Factor Analysisの複素拡張2011
- Author(s)
  柳楽浩平
- Organizer
  人工知能学会第34回AIチャレンジ研究会
- Place of Presentation
  慶応義塾大学(神奈川県)
- Year and Date
  2011-12-15
- Related Report
  2011 Annual Research Report
[Presentation] ノンパラメトリックベイズによる時間周波数領域における音声信号のブラインド音源分離2011
- Author(s)
  柳楽浩平, 高橋徹, 尾形哲也, 奥乃博
- Organizer
  日本ロボット学会第29回学術講演会
- Place of Presentation
  芝浦工業大学
- Year and Date
  2011-09-09
- Related Report
  2011 Final Research Report
[Presentation] ノンパラメトリックベイズによる時間周波数領域における音声信号のブラインド音源分離2011
- Author(s)
  柳楽浩平
- Organizer
  日本ロボット学会第29回学術講演会
- Place of Presentation
  芝浦工業大学(東京都)
- Year and Date
  2011-09-09
- Related Report
  2011 Annual Research Report
[Presentation] 調波・非調波音源モデルを用いたマイク数以上の音源分離2011
- Author(s)
  平澤恭治
- Organizer
  日本ロボット学会第29回学術講演会
- Place of Presentation
  芝浦工業大学(東京都)
- Year and Date
  2011-09-09
- Related Report
  2011 Annual Research Report
[Presentation] Introduction to Open Source Robot Audition Software HARK2011
- Author(s)
  Kazuhiro Nakadai
- Organizer
  日本ロボット学会第29回学術講演会
- Place of Presentation
  芝浦工業大学(東京都)
- Year and Date
  2011-09-08
- Related Report
  2011 Annual Research Report
[Presentation] 実環境下での音源定位・音源検出の検討2011
- Author(s)
  高橋徹
- Organizer
  日本ロボット学会第29回学術講演会
- Place of Presentation
  芝浦工業大学(東京都)
- Year and Date
  2011-09-07
- Related Report
  2011 Annual Research Report
[Presentation] 同時発話認識ロボットの共通開発プラットフォーム2011
- Author(s)
  高橋徹
- Organizer
  第13回日本感性工学会大会
- Place of Presentation
  工学院大学(東京都)(招待講演)
- Year and Date
  2011-09-04
- Related Report
  2011 Annual Research Report
[Presentation] 調波・非調波音源モデルを用いたマイク数以上の音源分離2011
- Author(s)
  平澤恭治, 安良岡直希, 高橋徹, 尾形哲也, 奥乃博
- Organizer
  日本ロボット学会第29回学術講演会
- Place of Presentation
  芝浦工業大学
- Year and Date
  2011-07-09
- Related Report
  2011 Final Research Report
[Presentation] Introduction to Open Source Robot Audition Software HARK2011
- Author(s)
  Kazuhiro Nakadai, Toru Takahashi, Hiroshi G. Okuno, Nakamura Keisuke, Yoshida Takami, Mizumoto Takeshi, Otsuka Takuma, Ince Gohkan
- Organizer
  日本ロボット学会第29回学術講演会
- Place of Presentation
  芝浦工業大学
- Year and Date
  2011-07-09
- Related Report
  2011 Final Research Report
[Presentation] 実環境下での音源定位・音源検出の検討2011
- Author(s)
  高橋徹, 中臺一博, 石井Carlos寿憲, Jani Even, 奥乃博
- Organizer
  日本ロボット学会第29回学術講演会
- Place of Presentation
  芝浦工業大学
- Year and Date
  2011-07-09
- Related Report
  2011 Final Research Report
[Presentation] 擬音語と環境音の音響的関係性を考慮した環境音to擬音語変換システム2011
- Author(s)
  山川暢英, 北原鉄朗, 高橋徹, 尾形哲也, 奥乃博
- Organizer
  2011年度人工知能学会全国大会
- Place of Presentation
  岩手
- Year and Date
  2011-06-01
- Related Report
  2011 Final Research Report
[Presentation] 擬音語と環境音の音響的関係性を考慮した環境音to擬音語変換システム2011
- Author(s)
  山川暢英
- Organizer
  2011年度人工知能学会全国大会
- Place of Presentation
  岩手県立大学(岩手県)
- Year and Date
  2011-06-01
- Related Report
  2011 Annual Research Report
[Presentation] 累積頻度重みを適用したパーティクルフィルタによる実時間楽譜追従2011
- Author(s)
  大塚琢馬, 中臺一博, 高橋徹, 尾形哲也, 奥乃博
- Organizer
  情報処理学会第73回全国大会
- Year and Date
  2011-03-04
- Related Report
  2011 Final Research Report
[Presentation] 音源数同定とブラインド音源分離を同時に行うinfinite ICA2011
- Author(s)
  柳楽浩平, 高橋徹, 尾形哲也, 奥乃博
- Organizer
  情報処理学会第73回全国大会
- Year and Date
  2011-03-04
- Related Report
  2011 Final Research Report 2010 Annual Research Report
[Presentation] L1ノルム最小化による劣決定音源分離のための線形計画と二次錐計画の比較評価2011
- Author(s)
  平澤恭治, 武田龍, 高橋徹, 尾形哲也, 奥乃博
- Organizer
  情報処理学会第73回全国大会
- Year and Date
  2011-03-04
- Related Report
  2011 Final Research Report 2010 Annual Research Report
[Presentation] ロボット聴覚のためのMatching Pursuitによる複数環境音の同定2011
- Author(s)
  山川暢英, 高橋徹, 北原鉄朗, 尾形哲也, 奥乃博
- Organizer
  情報処理学会第73回全国大会
- Year and Date
  2011-03-04
- Related Report
  2011 Final Research Report
[Presentation] Speaker Localization Using Two-Channel Microphone on the SIG-2 Humanoid Robot2011
- Author(s)
  Uihyun Kim, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  情報処理学会第73回全国大会
- Year and Date
  2011-03-03
- Related Report
  2011 Final Research Report
[Presentation] Speaker Localization Using Two-Channel Microphone on the SIG-2 Humanoid Robot2011
- Author(s)
  Uihyun Kim, Toru Takahashi, Tetsuya Ogata, Hiroshi G.Okuno
- Organizer
  情報処理学会第73回全国大会
- Place of Presentation
  東京工業大学
- Year and Date
  2011-03-03
- Related Report
  2010 Annual Research Report
[Presentation] Time-of-flight camera based Probabilistic Polygonal Mesh mapping2011
- Author(s)
  Louis-Kenzo Cahier, 高橋徹, 尾形哲也, 奥乃博
- Organizer
  情報処理学会第73回全国大会
- Year and Date
  2011-03-02
- Related Report
  2011 Final Research Report
[Presentation] Fast and simple iterative algorithm of Lp-norm minimization for under-determined speech separation2011
- Author(s)
  Yasuharu Hirasawa, Naoki Yasuraoka, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of International Conference on Spoken Language Processing
- Place of Presentation
  Florence, Italy
- Related Report
  2011 Final Research Report
[Presentation] Environmental Sound Recognition for Robot Audition using Matching-pursuit2011
- Author(s)
  Nobuhide Yamakawa, Toru Takahashi, Tetsuro Kitahara, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceeding of the 24th International Conference on Industrial
- Place of Presentation
  Syracuse, NY
- Related Report
  2011 Final Research Report
[Presentation] Robot with Two Ears Listens to More Than Two Simultaneous Utterances by Exploiting Harmonic Structures2011
- Author(s)
  Yasuharu Hirasawa, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceeding of the 24th International Conference on Industrial
- Place of Presentation
  Syracuse, NY
- Related Report
  2011 Final Research Report
[Presentation] Cluster Self-organization of Known and Unknown Environmental Sounds using Recurrent Neural Network2011
- Author(s)
  Zhang Yang, Shun Nishide, Toru Takahashi, Hiroshi G. Okuno, and, Tetsuya Ogata
- Organizer
  Proceeding of the International Conference on Artificial Neural Networks
- Place of Presentation
  Espoo, Finland
- Related Report
  2011 Final Research Report
[Presentation] Exploiting Harmonic Structures to Improve Separating Simultaneous Speech in Under-Determined Conditions(Invitedpaper)2010
- Author(s)
  Yasuharu Hirasawa, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems
- Place of Presentation
  Taiwan, Taipei
- Year and Date
  2010-10-19
- Related Report
  2011 Final Research Report
[Presentation] Two-level Synchronization using Particle Filter for Co-player Music Robots2010
- Author(s)
  Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of IEEE/RSJ-2010Workshop on Robots and Musical Expression
- Place of Presentation
  Taipei(CD-ROM)
- Year and Date
  2010-10-18
- Related Report
  2011 Final Research Report
[Presentation] Probabilistic polygonal mesh for 3D SLAM2010
- Author(s)
  Louis-Kenzo Cahier, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  日本ロボット学会第28回学術講演会
- Place of Presentation
  名古屋工業大学
- Year and Date
  2010-09-23
- Related Report
  2011 Final Research Report
[Presentation] ロボット聴覚のためのMatching-Pursuitによる環境音の分離音認識2010
- Author(s)
  山川暢英, 高橋徹, 北原鉄朗, 尾形哲也, 奥乃博
- Organizer
  日本ロボット学会第28回学術講演会
- Place of Presentation
  名古屋工業大学
- Year and Date
  2010-09-22
- Related Report
  2011 Final Research Report 2010 Annual Research Report
[Presentation] Dynamic Recognition of Environmental Sounds with Recurrent Neural Network2010
- Author(s)
  Zhang Yang, Tetsuya Ogata, Toru Takahashi, Hiroshi G. Okuno
- Organizer
  日本ロボット学会第28回学術講演会
- Place of Presentation
  名古屋工業大学
- Year and Date
  2010-09-22
- Related Report
  2011 Final Research Report
[Presentation] リサンプル-ブロック処理と並列化に基づくICAの実時間実装2010
- Author(s)
  武田龍, 中臺一博, 高橋徹, 尾形哲也, 奥乃博
- Organizer
  日本ロボット学会第28回学術講演会
- Place of Presentation
  名古屋工業大学
- Year and Date
  2010-09-22
- Related Report
  2011 Final Research Report 2010 Annual Research Report
[Presentation] Predictive Score Following user Particle Filter for Music Robots2010
- Author(s)
  Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  日本ロボット学会第28回学術講演会
- Place of Presentation
  名古屋工業大学
- Year and Date
  2010-09-22
- Related Report
  2011 Final Research Report
[Presentation] ロボット聴覚オープンソースソフトウエアHARK2010
- Author(s)
  奥乃博, 中臺一博, 高橋徹
- Organizer
  電子情報通信学会ソサイエティ大会
- Place of Presentation
  大阪府立大学
- Year and Date
  2010-09-14
- Related Report
  2011 Final Research Report
[Presentation] ロボット聴覚オープンソースソフトウエアHARK2010
- Author(s)
  奥乃博, 中臺一博, 高橋徹
- Organizer
  電子情報通信学会ソサイエティ大会,依頼シンポジウムセッション,AI-1:マルチモーダル信号処理とその応用
- Place of Presentation
  大阪府立大学
- Year and Date
  2010-09-14
- Related Report
  2010 Annual Research Report
[Presentation] ロボット聴覚ソフトウエアHARKとそのロボットへの応用2010
- Author(s)
  高橋徹, 中臺一博, 奥乃博
- Organizer
  平成22年度電気関係学会東海支部連合大会
- Place of Presentation
  中部大学(招待講演)
- Year and Date
  2010-08-30
- Related Report
  2010 Annual Research Report
[Presentation] Improvement in Listening Capability for Humanoid Robot HRP-22010
- Author(s)
  Toru Takahashi, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi. G. Okuno
- Organizer
  Proceedings of IEEE-RAS International Conference on Robotics and Automation 2010
- Place of Presentation
  Anchorage, Alaska, USA.
- Year and Date
  2010-05-03
- Related Report
  2011 Final Research Report
[Presentation] 実環境音声認識のためのロボット聴覚システム開発とパラメータチューニング2010
- Author(s)
  高橋徹
- Organizer
  情報処理学会第72回全国大会
- Place of Presentation
  東京大学(東京都)
- Year and Date
  2010-03-11
- Related Report
  2009 Annual Research Report
[Presentation] フィールド情報学的アプローチによる複数発話音声認識ロボットの開発2010
- Author(s)
  高橋徹
- Organizer
  京都大学ICTイノベーション
- Place of Presentation
  京都大学(京都府)
- Year and Date
  2010-02-19
- Related Report
  2009 Annual Research Report
[Presentation] Method of Discriminating Known and Unknown Environmental Sounds using Recurrent Neural Network2010
- Author(s)
  Yang Zhang, Tetsuya Ogata, Shun Nishide, Toru Takahashi, Hiroshi G. Okuno
- Organizer
  11th International Symposium on advanced Intelligent Systems
- Place of Presentation
  Okayama, JAPAN
- Related Report
  2011 Final Research Report
[Presentation] Speedup and Performance Improvement of ICA-based Robot Audition by Parallel and Resampling-based Block-wise Processing(Invited paper)2010
- Author(s)
  Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems
- Place of Presentation
  Taiwan, Taipei
- Related Report
  2011 Final Research Report
[Presentation] An Improvement in Automatic Speech Recognition Using Soft Missing Feature Masks for Robot Audition(Invitedpaper)2010
- Author(s)
  Toru Takahashi, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems
- Place of Presentation
  Taiwan, Taipei
- Related Report
  2011 Final Research Report
[Presentation] Effects of modelling within-and between-frame temporal variations in power spectra onnon-verbal sound recognition2010
- Author(s)
  Nobuhide Yamakawa, Tetsuro Kitahara, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of International Conference on Spoken Language Processing
- Place of Presentation
  Makuhari, Japan
- Related Report
  2011 Final Research Report
[Presentation] Analyzing User Utterances in Barge-in-able Spoken Dialogue System for Improving Identification Accuracy2010
- Author(s)
  Kyoko Matsuyama, Kazunori Komatani, Ryu Takeda, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of International Conference on Spoken Language Processing
- Place of Presentation
  Makuhari, Japan
- Related Report
  2011 Final Research Report
[Presentation] ロボット聴覚ソフトウエアHARKとそのロボットへの適用2010
- Author(s)
  高橋徹, 中臺一博, 奥乃博
- Organizer
  電気関係東海支部連合会大会
- Related Report
  2011 Final Research Report
[Presentation] Design and Implementation of Two-level Synchronization for Interactive Music Robot2010
- Author(s)
  Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of the Twenty-Fourth AAAI Conference on ArtificialIntelligence(AAAI-10)
- Place of Presentation
  USA
- Related Report
  2011 Final Research Report
[Presentation] Music-ensemble robot that is capable of playing the theremin while listening to the accompanied music2010
- Author(s)
  Takuma Otsuka, Takeshi Mizumoto, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of the 23rd International Conference on Industrial
- Place of Presentation
  Cordoba, Spain
- Related Report
  2011 Final Research Report
[Presentation] Improving Identification Accuracy by Extending Acceptable Utterances in Spoken Dialogue System Using Barge-in Timing2010
- Author(s)
  Kyoko Matsuyama, Kazunori Komatani, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of the 23rd International Conference on Industrial
- Place of Presentation
  Cordoba, Spain
- Related Report
  2011 Final Research Report
[Presentation] Upper-limit Evaluation of a Robot Audition based on ICA-BSS in Multi-source, Barge-in and Highly Reveberant Conditions2010
- Author(s)
  Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of IEEE-RAS International Conference on Robotics and Automation
- Place of Presentation
  Anchorage, Alaska, USA.
- Related Report
  2011 Final Research Report
[Presentation] Simultaneous Speech Recognition System implemented on Humanoid Robot HRP-22009
- Author(s)
  Toru Takahashi
- Organizer
  21世紀コンピューティングコンファレンス,サイエンスカフェ
- Place of Presentation
  京都大学(京都府)
- Year and Date
  2009-11-06
- Related Report
  2009 Annual Research Report
[Presentation] 頭部音響伝達関数を用いたGSSによる3話者同時発話認識～HARK 1.0.0の新機能～2009
- Author(s)
  高橋徹
- Organizer
  日本ロボット学会第27回学術講演会
- Place of Presentation
  横浜国立大学(神奈川県)
- Year and Date
  2009-09-15
- Related Report
  2009 Annual Research Report
[Presentation] Automatic Estimation of Reverberation Time with Robot Speech to Improve ICA-based Robot Audition2009
- Author(s)
  Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots
- Place of Presentation
  IEEE, Paris
- Related Report
  2011 Final Research Report
[Presentation] Voice quality manipulation for humanoid robots consistent with their head movements2009
- Author(s)
  Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots
- Place of Presentation
  IEEE, Paris
- Related Report
  2011 Final Research Report
[Presentation] Missing-Feature-Theory-based Robust Simultaneous Speech Recognition System with Non-clean Speech Acoustic Model2009
- Author(s)
  Toru Takahashi, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems 2009
- Place of Presentation
  USA, St. Louis
- Related Report
  2011 Final Research Report
[Presentation] Incremental Polyphonic Audio to Score Alignment using Beat Tracking for Singer Robots2009
- Author(s)
  Takuma Otsuka, Kazumasa Murata, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems 2009
- Place of Presentation
  USA, St. Louis
- Related Report
  2011 Final Research Report
[Presentation] Step-size Parameter Adaptation of Multi-channel Semi-blind ICA with Piecewise Linear Model for Barge-in-able Robot Audition2009
- Author(s)
  Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems 2009
- Place of Presentation
  USA, St. Louis
- Related Report
  2011 Final Research Report
[Presentation] ICA-basedefficient blind dereverberation and echo cancellation method for barge-in-able robot audition2009
- Author(s)
  Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of 2009International Conference on Acoustics, Speech and Signal Processing 2009
- Place of Presentation
  Taipei, Taiwan
- Related Report
  2011 Final Research Report
[Book] Lecture Notes in Computer Science, 2011, Volume 7064/20112012
- Author(s)
  Lieven De Lathauwer
- Publisher
  Springer
- Related Report
  2011 Annual Research Report
[Book] Lecture Notes in Computer Science, 2011, Volume 7064/20112011
- Author(s)
  Emdad Hossain, Girija Chetty
- Publisher
  Springer
- Related Report
  2011 Annual Research Report
[Book] Lecture Notes in Computer Science, 2011, Volume 6703/20112011
- Author(s)
  Kishan G.Mehrotra, Chilukuri K.Mohan, Jae C.Oh, Pramod K.Varshney, Moonis Ali
- Publisher
  Springer
- Related Report
  2011 Annual Research Report
[Remarks]
- URL
  http://winnie.kuis.kyoto-u.ac.jp/~tall
- Related Report
  2011 Final Research Report
[Remarks]
- URL
  http://www.ise.osaka-sandai.ac.jp/~takahashi
- Related Report
  2011 Final Research Report
[Remarks]
- URL
  http://www.ise.osaka-sandai.ac.jp/~takahashi/
- Related Report
  2011 Annual Research Report
[Remarks]
- URL
  http://winnie.kuis.kyoto-u.ac.jp/~tall/
- Related Report
  2009 Annual Research Report
[Patent(Industrial Property Rights)] 音声認識装置及び音声認識装置のマスク生成方法2009
- Inventor(s)
  中臺一博,高橋徹,奥乃博
- Industrial Property Rights Holder
  本田技研工業株式会社
- Filing Date
  2009-08-07
- Related Report
  2011 Final Research Report

Improvement of function in Robot Audition based on Active Audition

Principal Investigator

TAKAHASHI Toru 京都大学, 情報学研究科, GCOE助教 (30419494)

¥4,290,000 (Direct Cost: ¥3,300,000、Indirect Cost: ¥990,000)

Report

Research Products

[Journal Article] Efficient Blind Dereverberation and Echo Cancellation based on Independent Component Analysis for Actual Acoustic Signals2012

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Tool-Body Assimilation of Humanoid Robt using Neuro-Dynamical System2012

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Complex Extension of Infinite Sparse Factor Analysis for Blind Speech Separation2012

Author(s)

Journal Title

Related Report

[Journal Article] A GMM Sound Source Model for Blind Speech Separation in Under-determined Condisions2012

Author(s)

Journal Title

Related Report

[Journal Article] Efficient Blind Dereverberation and Echo Cancellation based on Independent Component Analysis for Actual Acoustic Signals2011

Author(s)

Journal Title

Related Report

[Journal Article] Classification of Known and Unknown Environmental Sounds based on Self-organized Space using Recurrent Neural Network2011

Author(s)

Journal Title

Related Report

[Journal Article] 発語行為レベルの情報をユーザ発話の解釈に用いる音声対話システム2011

Author(s)

Journal Title

Related Report

[Journal Article] フレーズ置換のための調波非調波GMM・NMF・残響推定に基づく音源分離・演奏合成2011

Author(s)

Journal Title

Related Report

[Journal Article] Complex and Transitive Synchronization in a Frustrated System of Calling Frogs2011

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Classification of Known and Unknown Environmental Sounds based on Self-organized Space using Recurrent Neural Network2011

Author(s)

Journal Title

DOI

NAID

Related Report

[Journal Article] Use of a Sparse Structure to Improve Learning Performance of Recurrent Neural Networks2011

Author(s)

Journal Title

Related Report

[Journal Article] Fast and simple iterative algorithm of Lp-norm minimization for under-determined speech separation2011

Author(s)

Journal Title

Related Report

[Journal Article] Environmental Sound Recognition for Robot Audition using Matching-pursuit2011

Author(s)

Journal Title

Related Report

[Journal Article] Robot with Two Ears Listens to More Than Two Simultaneous Utterances by Exploiting Harmonic Structures2011

Author(s)

Journal Title

Related Report

[Journal Article] Efficient Blind Dereverberation and Echo Cancellation based on Independent Component Analysis for Actual Acoustic Signals2011

Author(s)

Journal Title

Related Report

[Journal Article] Real-Time Audio-to-Score Alignment using Particle Filter for Co-player Music Robots2010

Author(s)

Journal Title

Related Report

[Journal Article] Voice awareness control for a humanoid robot consistent with its body posture and movements2010

Author(s)

Journal Title

DOI

Related Report