• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2011 Fiscal Year Final Research Report

Development of Robot Audition based on Computational Auditory Scene Analysis

Research Project

  • PDF
Project/Area Number 19100003
Research Category

Grant-in-Aid for Scientific Research (S)

Allocation TypeSingle-year Grants
Research Field Perception information processing/Intelligent robotics
Research InstitutionKyoto University

Principal Investigator

OKUNO Hiroshi  京都大学, 大学院・情報学研究科, 教授 (60318201)

Co-Investigator(Kenkyū-buntansha) OGATA Tetsuya  京都大学, 大学院・情報学研究科, 准教授 (00318768)
KOMASTANI Kazunori  名古屋大学, 大学院・工学研究科, 准教授 (40362579)
TAKAHASHI Toru  京都大学, 大学院・情報学研究科, 教授 (30419494)
SHIRAMATSU Shun  名古屋工業大学, 工学研究科, 助教 (80548595)
NAKADAI Kazuhiro  東京工業大学, 情報理工研究科, 連携教授 (70436715)
KITAHARA Tetsuro  日本大学, 文理学部, 講師 (00454710)
ITOYAMA Katsutoshi  京都大学, 大学院・情報学研究科, 助教 (60614451)
Co-Investigator(Renkei-kenkyūsha) ASANO Futoshi  産業技術総合研究所, グループリーダー (00231895)
Project Period (FY) 2007 – 2011
Keywordsロボット聴覚 / 音環境理解 / マルチドメイン音声対話 / 音楽共演ロボット / バージイン発話 / 聴覚アウエアネス / アクティブオーディション / ロボットインタラクション
Research Abstract

Three main features of Computational Auditory Scene Analysis, sound source localization, sound source separation, and recognition of separated sounds, have been developed and their collections are made available as an open-sourced robot audition software called "HARK". As a proof of concepts in this robot audition, we developed "Prince Shotoku" robots that can listen to simultaneous talkers, and a spoken dialogue system that accepts a barge-in utterance of the user. We also developed various technologies to separate musical instrument parts for polyphonic performance, and real-time score following systems. These musical-related technologies are applied to make musical robots to play ensemble with human players

  • Research Products

    (36 results)

All 2012 2011 2010 2009 2008 2007 Other

All Journal Article (15 results) Presentation (7 results) Book (2 results) Remarks (10 results) Patent(Industrial Property Rights) (2 results) (of which Overseas: 2 results)

  • [Journal Article] Efficient Blind Dereverberation and Echo Cancellation based on Independent Component Analysis for Actual Acoustic Signals2012

    • Author(s)
      R. Takeda, K. Nakadai, T. Takahashi, T. Ogata, H. G. Okuno
    • Journal Title

      Neural Computation

      Volume: 24 Pages: 234-272

    • DOI

      doi:10.1162/NECO_a_00219

  • [Journal Article] musical robot that synchronizes with a co-player using non-verbal cues2012

    • Author(s)
      A. Lim, T. Mizumoto, T. Ogata, H. G. Okuno
    • Journal Title

      Advanced Robotics

      Volume: 26 Pages: 363-381

    • DOI

      doi:10.1163/156855311X614626

  • [Journal Article] Complex Extension of Infinite Sparse Factor Analysis for Blind Source Separation of Speech Signals2012

    • Author(s)
      K. Nagira, T. Takahashi, T. Ogata, H. G. Okuno
    • Journal Title

      Lecture Notes in Computer Science

      Volume: 7191 Pages: 388-396

  • [Journal Article] A GMM Sound Source Model for Blind Speech Separation in Under-determined Conditions2012

    • Author(s)
      Y. Hirasawa, N. Yasuraoka, T. Takahashi, T. Ogata, H. G. Okuno
    • Journal Title

      Lecture Notes in Computer Science

      Volume: 7191 Pages: 446-453

    • DOI

      doi:10.1007/978-3-642-28551-6_55

  • [Journal Article] 発語行為レベルの情報をユーザ発話の解釈に用いる音声対話システム2011

    • Author(s)
      駒谷和範, 松山匡子, 武田龍, 高橋徹, 尾形哲也, 奥乃博
    • Journal Title

      情報処理学会論文誌

      Volume: 52 Pages: 3374-3385

  • [Journal Article] Emergence of Hierarchical Structure mirroring Linguistic Composition in a Recurrent Neural Network2011

    • Author(s)
      W. Hinoshita, H. Arie, J. Tani, H. G. Okuno, T. Ogata
    • Journal Title

      Neural Networks

      Volume: 24 Pages: 311-320

    • DOI

      doi:10.1016/j.neunet.2010.12.006

  • [Journal Article] Environmental Sound Recognition for Robot Audition using Matching-pursuit2011

    • Author(s)
      Z. Yamakawa, T. Takahashi, T. Kitahara, T. Ogata, H. G. Okuno
    • Journal Title

      Lecture Notes in Artificial Intelligence

      Volume: 6704 Pages: 1-10

    • DOI

      doi:1007/978-3-642-21827-9_1

  • [Journal Article] Sound Imaging of Nocturnal Animal Calls in Their Natural Habita2011

    • Author(s)
      T. Mizumoto, I. Aihara, T. Otsuka, R. Takeda, K. Aihara, H. G. Okuno
    • Journal Title

      Journal of Comparative Physiology A

      Volume: 197 Pages: 915-921

  • [Journal Article] Design and Implementation of Robot Audition System "HARK"2010

    • Author(s)
      K. Nakadai, H. G. Okuno, H. Nakajima, Y. Hasegawa, and H. Tsujino
    • Journal Title

      Advanced Robotics

      Volume: 24 Pages: 739-761

    • DOI

      doi:10.1163/016918610X493561

  • [Journal Article] Selecting Help Messages by using Robust Grammar Verification for Handling Out-of-Grammar Utterances in Spoken Dialogue Systems2010

    • Author(s)
      K. Komatani, Y. F., S. Ikeda, T. Ogata, H. G. Okuno
    • Journal Title

      IEICE Transactions D.

      Volume: E93-D Pages: 3359-3367

  • [Journal Article] A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval2010

    • Author(s)
      H. Fujihara, M. Goto, T. Kitahara, H. G. Okuno
    • Journal Title

      IEEE Trans. on Audio, Speech and Language Processing

      Volume: 18 Pages: 638-648

    • DOI

      doi:10.1109/TASL.2010.2041386

  • [Journal Article] 人工神経回路モデルと声道物理モデルを用いた母音模倣モデルに基づく音素獲得シミュレーション2009

    • Author(s)
      神田尚, 尾形哲也, 駒谷和範, 奥乃博
    • Journal Title

      日本ロボット学会誌

      Volume: 27 Pages: 802-813

  • [Journal Article] Human Tracking System Integrating Sound and Face Localization using EM Algorithm in Real Environments2009

    • Author(s)
      H-D. Kim, K. Komatani, T. Ogata, H. G. Okuno
    • Journal Title

      Advanced Robotics

      Volume: 23 Pages: 629-653

    • DOI

      doi:10.1163/156855309X431659

  • [Journal Article] マルチドメイン音声対話システムにおけるトピック推定と対話履歴の統合によるドメイン選択手法2009

    • Author(s)
      池田智志, 駒谷和範, 尾形哲也, 奥乃博
    • Journal Title

      情報処理学会論文誌

      Volume: 50 Pages: 488-500

  • [Journal Article] Game-Theoretic Model of Referential Coherence and Its Empirical Verification Using Large Japanese and English Corpora2008

    • Author(s)
      S. Shiramatsu, K. Komatani, K. Hasida, T. Ogata, H. G. Okuno
    • Journal Title

      ACM Trans. on Speech and Language Processing

      Volume: 5 Pages: 6

    • DOI

      doi:10.1145/1410358.1410360

  • [Presentation] Improvement of Speaker Localization by Considering Multipath Interference of Sound Wave for Binaural Robot Audition2011

    • Author(s)
      E-H. Kim, T. Muzumoto, T. Ogata, H. G. Okuno
    • Organizer
      Proc. of IEEE/RSJ International Conference on Intelligent Robots and Systems(IROS)
    • Place of Presentation
      San Francisco
    • Year and Date
      20110900
  • [Presentation] Bayesian Extension of MUSIC for Sound Source Localization and Tracking2011

    • Author(s)
      T. Otsuka, K. Nakadai, T. Ogata, H. G. Okuno
    • Organizer
      Proc. of International Conf
    • Place of Presentation
      Spoken Language Processing
    • Year and Date
      20110000
  • [Presentation] Design and Implementation of Selectable Sound Separation on a Texai Telepresence System using HARK2011

    • Author(s)
      T. Mizumoto, T. Yoshida, K. Nakadai, R. Takeda, T. Otsuka, T. Takahashi, H. G. Okuno
    • Organizer
      Proc. of IEEE-RAS International Conference on Robotics and Automation
    • Place of Presentation
      Shanghai
    • Year and Date
      20110000
  • [Presentation] Robot Musical Accompaniment : Integrating Audio and Visual Cues for Real-time Synchronization with a Human Flutist(Invited paper)2010

    • Author(s)
      A. Lim, T. Mizumoto, L-K Cahier, T. Otsuka, T. Takahashi, K. Komatani, T. Ogata, H. G. Okuno
    • Organizer
      Proc. of IEEE/RSJ International Conference on Intelligent Robots and Systems(IROS)
    • Place of Presentation
      Taipei
    • Year and Date
      20100000
  • [Presentation] Design and Implementation of Two-level Synchronization for Interactive Music Robot2010

    • Author(s)
      T. Otsuka, K. Nakadai, T. Takahashi, K. Komatani, T. Ogata, H. G. Okuno
    • Organizer
      Proc. of the 24^<th> AAAI Conf. on Artificial Intelligence
    • Place of Presentation
      Atlanta
    • Year and Date
      20100000
  • [Presentation] Changing Timbre and Phrase in Existing Musical Performances as You Like2009

    • Author(s)
      N. Yasuraoka, T. Abe, K. Itoyama, K. Yoshii, K. Komatani, T. Ogata, H. G. Okuno
    • Organizer
      Proc. of ACM Multimedia
    • Place of Presentation
      Beijing
    • Year and Date
      20090000
  • [Presentation] A Robot Listens to Music and Counts Its Beats Aloud by Separating Music from Counting Voice2008

    • Author(s)
      T. Mizumoto, R. Takeda, K. Yoshii, K. Komatani, T. Ogata, H. G. Okuno
    • Organizer
      Proc. of IEEE/RSJ IROS-2008
    • Year and Date
      20080000
  • [Book] ロボット聴覚,日本ロボット学会編『ロボットテクノロジー』2011

    • Author(s)
      奥乃博
    • Total Pages
      304
    • Publisher
      オーム社
  • [Book] Modelling Machine Emotions for Realizing Intelligence : Foundations and Applications, Smart Innovation, Systems and Technologies Series2007

    • Author(s)
      H. G. Okuno, M. Ali
    • Total Pages
      1194
    • Publisher
      Springer
  • [Remarks] オープンソースソフトウエア1)ロボット聴覚オープンソースソフトウエアHARK, V. 0. 0. 7 : 2008年4月, V. 1. 0. 0 : 2009年11月, V. 1. 1. 0 2012年2月公開. 2010年5月から1年間に15, 000超のダウンロード

    • URL

      http://winnie.kuis.kyoto-u.ac.jp/HARK

  • [Remarks] 受賞1)奥乃博: IEEE Fellow,ロボット聴覚技術への貢献, IEEE, Jan. 2012

  • [Remarks] 2) A. Lim, T. Mizumoto, L-K Cahier, T. Otsuka, T. Takahashi, K. Komatani, T. Ogata, H. G. Okuno : NTF Award for Entertainment Robots and Systems, IEEE/RSJ, Oct. 2010

  • [Remarks] 3) T. Otsuka, T. Mizumoto, K. Nakadai, T. Takahashi, K. Komatani, T. Ogata, H. G. Okuno : Best Paper Award, IEA/AIE, 2010.報道発表

  • [Remarks] 報道発表1)「複数の音を聞き分ける聖徳太子のようなロボットが登場!」,世の中進歩堂, BS Japan, 2011年2月4日

  • [Remarks] 2)日経エレクトロニクス,五感センサ,聴覚「聖徳太子の耳をすべての機器に」, pp. 75-77, 2008年2月25日号.日経BP社

  • [Remarks] 3)「聖徳太子ロボットの未来」,日経サイエンス, 2007年10月号.

  • [Remarks] 4)"Playing It by Ear-A machine-listening system that understands three speakers atonce", Scientific American, Aug. 2007, p. 28

  • [Remarks] ホームページ等

    • URL

      http://winnie.kuis.kyoto-u.ac.jp/HARK/

  • [Remarks]

    • URL

      http://winnie.kuis.kyoto-u.ac.jp/SIG/

  • [Patent(Industrial Property Rights)] 音声認識装置及び音声認識方法2010

    • Inventor(s)
      中臺一博,高橋徹,奥乃博
    • Industrial Property Rights Holder
      本田技研工業株式会社
    • Industrial Property Number
      特許出願、特願2011-53124号
    • Filing Date
      2010-03-10
    • Overseas
  • [Patent(Industrial Property Rights)] 音声認識装置2010

    • Inventor(s)
      中臺一博,辻野広司,奥乃博,山本
    • Industrial Property Rights Holder
      本田技研工業株式会社
    • Industrial Property Number
      特許、特許第4516527号
    • Acquisition Date
      2010-05-21
    • Overseas

URL: 

Published: 2013-07-31   Modified: 2017-10-12  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi