2008 年度実績報告書

音環境理解研究からのロボット聴覚の構築

研究課題

研究課題/領域番号	19100003
研究機関	京都大学
研究代表者	奥乃博京都大学, 情報学研究科, 教授 (60318201)
研究分担者	尾形哲也京都大学, 情報学研究科, 准教授 (00318768) 駒谷和範京都大学, 情報学研究科, 助教 (40362579) 高橋徹京都大学, 情報学研究科, 助教 (30419494)
キーワード	ロボット聴覚 / 音環境理解 / 身体性 / ロボットインタラクション / アクティブオーディション / 聴覚アウエアネス / マルチドメイン音声対話 / バージン発話
研究概要	平成20年度は要素技術の洗練化と公開に取り組んだ. (1) 実時間ロボット聴覚ソフトウエアHARKの機能拡張:特徴量信頼度を連続値で表現するソフトマスク自動生成に取り組み,音声認識率が約10%向上.また,システムの発話中にユーザが割り込み発話を行うバージイン発話認識のために独立成分解析によるセミブラインド分離を開発.2種類の音楽ロボットに応用し,ロボットが歌っても音楽だけを聞き分ける機能を実現.2件の論文がIEEE/RSJ IROS-2008 Award for Entertainment Robots and Systems Nomination Finalistの4件に選出.さらに,HARKを応用した音環境可視化システムにより聴覚アウエアネス(音の気付き)の改善手法を考案し,実装. (2) HARKのオープンソース化と講習会の実施:京都大学と韓国KISTで無料講習会を開催.ロボット聴覚特別セッションをIROS-2008で主宰.信号処理国際会議ICASSP-2009にも提案採択. (3) アクティブオーディションをSIG2上で開発:2本のマイクロフォンによる音源定位で不可避な前後問題の曖昧性解消のために,ロボットの首の動作を設計.首の動きが最初に斜め下に動かし,その後横に動かす方が,いきなり横に動かすよりも性能が改善.人も同様の動作をすることが知られており,ロボットでの有効性を確認. (4) ロボットの経験に基づいた物体ダイナミクスの予測:RNNPBにより学習した物体のダイナミクスのモデルを通じて,未知物体であっても,ロボットの動作によりその物体がどのように動くかを予測する技術基盤を確立. (5) マルチドメイン音声対話システムの高性能化:どのドメインからも受理されない想定外発話からのユーザ意図推定法とそれに基づいたヘルプ生成法を開発し,その有効性を確認.

研究成果
(42件)

すべて 2009 2008 その他

すべて雑誌論文 (15件) (うち査読あり 14件) 学会発表 (19件) 備考 (1件) 産業財産権 (7件) (うち外国 3件)

[雑誌論文] 音色特徴の歪みを回避した楽器音の音高・音長操作手法2009
- 著者名/発表者名
  安部武宏, 糸山克寿, 吉井和佳, 駒谷和範, 尾形哲也, 奥乃博
- 雑誌名
  
  情報処理学会論文誌 Vol. 50, No. 3
  
  ページ: 1054-1066
- 査読あり
[雑誌論文] マルチドメイン音声対話システムにおけるトピック推定と対話履歴の統合によるドメイン選択手法2009
- 著者名/発表者名
  池田智志, 駒谷和範, 尾形哲也, 奥乃博
- 雑誌名
  
  情報処理学会論文誌 Vol. 50, No. 2
  
  ページ: 488-500
- 査読あり
[雑誌論文] Game-Theoretic Model of Referential Coherence and Its Empirical Verification Usine Large Jacanese and English Cornora2009
- 著者名/発表者名
  Shun Shiramatsu, Kazunori Komatani, Koiti Hasida, Tetsuva Ogata Hiroshi G. Okuno
- 雑誌名
  
  ACM Transactions on Speech and Language Processing Vol. 5, No. 3
  
  ページ: Article 6
- 査読あり
[雑誌論文] 分析時刻に依存しない周期信号のパワースペクトル推定法を用いた音声分析2009
- 著者名/発表者名
  森勢将雅, 高橋徹, 河原英紀, 入野俊夫
- 雑誌名
  
  電子情報通信学会論文誌A Vol. J92-A, No. 3
  
  ページ: 163-171
- 査読あり
[雑誌論文] 歌声の統計的モデル化とビタビ探索を用いた多重奏中のボーカルパートに対する音高推定手法2008
- 著者名/発表者名
  藤原弘将, 後藤真孝, 奥乃博
- 雑誌名
  
  情報処理学会論文誌 Vol. 49, No. 10
  
  ページ: 3682-3693
- 査読あり
[雑誌論文] Managing Out-of-Grammar Utterances by Topic Estimation with Domain Extensibility in Multi Domain Spoken2008
- 著者名/発表者名
  Kazunori Komatani, Satoshi Ikeda, Tetsuya Ogata, Hiroshi G. Okuno
- 雑誌名
  
  Speech Communcation Vol. 50, No. 10
  
  ページ: 836-870
- 査読あり
[雑誌論文] 独立成分分析に基づく適応フィルタのロボット聴覚への応用2008
- 著者名/発表者名
  武田龍, 中毫一博, 駒谷和範, 尾形哲也, 奥乃博
- 雑誌名
  
  日本ロボット学会誌 Vol. 26, No. 6
  
  ページ: 529-536
- 査読あり
[雑誌論文] 音声対話システムにおけるラピッドプロトタイピングを指向したWFSTに基づく言語理解2008
- 著者名/発表者名
  福林雄一朗, 駒谷和範, 中野幹生, 船越孝太郎, 辻野広司, 尾形哲也, 奥乃博
- 雑誌名
  
  情報処理学会論文誌 Vol. 49, No. 8
  
  ページ: 2762-2772
- 査読あり
[雑誌論文] Predicting Object Dynamics from Visual Images through Active Sensing Experiences2008
- 著者名/発表者名
  Shun Nishide, Tetsuya Ogata, J. Tani, Kazunori Komatani, Hiroshi G Okuno
- 雑誌名
  
  Advanced Robotics Vol. 22, No. 5
  
  ページ: 527-546
- 査読あり
[雑誌論文] A Portable Robot Audition Software System for Multiple Simultaneous Speech Signals2008
- 著者名/発表者名
  Hiroshi G Okuno, Shun'ichi Yamamoto, Kazuhiro Nakadai, J-M Valin, K. i Komatani, T. Ogata
- 雑誌名
  
  ournal of Acoustic Society of America Vol. 123, No. 5
  
  ページ: 3066-3067
- 査読あり
[雑誌論文] SalienceGraph : Visualizing Salience Dynamics of Written Discourse by Using Reference Probability and PLSA2008
- 著者名/発表者名
  Shun Shiramatsu, Kazunori Kofflatani, Tetsuya Ogata, Hiroshi G. Okuno
- 雑誌名
  
  PRICAI-2008 : Trends in Artificial Intelligence LNCS Vol. 5351
  
  ページ: 890-902
- 査読あり
[雑誌論文] 多数の人の声を一度に聞き分ける聴覚センサ2008
- 著者名/発表者名
  奥乃博
- 雑誌名
  
  日経エレクトロニクス 2008年9月22日号
  
  ページ: 115-123
[雑誌論文] Integrating Topic Estimation and Dialogue HDomain Selection in Multi-Domain Spoken Dialogue Systems2008
- 著者名/発表者名
  Satoshi Ikeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- 雑誌名
  
  New Frontiers in Applied Artificial Intelligence LNAI Vol. 5027
  
  ページ: 294-304
- 査読あり
[雑誌論文] Vowel Imitation using Vocal Tract Model and Recurrent Neural Network2008
- 著者名/発表者名
  Hisashi Kanda, Tetsuya Ogata, Kazunori Komalani, Hiroshi G. Okuno
- 雑誌名
  
  Neural Information Processing LKCS Vol. 4985
  
  ページ: 222-232
- 査読あり
[雑誌論文] Motion Emergence from Sound using Cross-Modal Mapping on Recurrent Neural Network2008
- 著者名/発表者名
  Tetsuya Ogata, Hiroshi G. Okuno
- 雑誌名
  
  IEEE Intelligent System Vol. 23, No. 2
  
  ページ: 74-84
- 査読あり
[学会発表] Design and Implementation of 3D Auditory Scene Visualizer towards Auditory Awareness with Face Tracking2008
- 著者名/発表者名
  Yuji Kubota, Masatoshi Yoshida, Kazunori Komatani, Tetsuva Ogata, Hiroshi G. Okuno
- 学会等名
  Proceedings of IEEE International Symposium on Multimedia (ISM08)
- 発表場所
  Berkeley, U.S.A
- 年月日
  2008-12-16
[学会発表] 3D Auditory Scene Visualizer With Face Tracking : Designand Implementation For Auditory Awareness Compensation2008
- 著者名/発表者名
  Yuji Kubota, Shun Shiramatsu, Kazunori Komatani, Tetsuva Ogata, Hiroshi G. Okuno
- 学会等名
  Proceedings of 2nd International Symposium on Universal Communication (ISUC2008
- 発表場所
  Osaka, Japan
- 年月日
  2008-12-15
[学会発表] A Beat-Tracking Robot for Human-Robot Interaction and Its Evaluation2008
- 著者名/発表者名
  Kazumasa Murata, Kazuhiro Nakadai, Ryu Takeda, Hiroshi G. Okuno, T. Torii, Y. Hasegawa, H, Tsujino
- 学会等名
  Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2008)
- 発表場所
  Daejeon, Korea
- 年月日
  2008-12-03
[学会発表] Computational Auditory Scene Analysis and Its Application to Robot Audition (Invited Talk)2008
- 著者名/発表者名
  Hiroshi G. Okuno
- 学会等名
  Proceedings of the Second International Symposiumon Robotics and Artificial Intelligence
- 発表場所
  Chofu, Japan
- 年月日
  2008-10-09
[学会発表] A Robot Listens to Music and Counts Its Beats Aloud by Separating Music from Counting Voice2008
- 著者名/発表者名
  Takeshi Mizumoto, Ryu Takeda, K. Yoshii, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- 学会等名
  Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2008)
- 発表場所
  Nice, France
- 年月日
  2008-09-24
[学会発表] Soft Missing-Feature Mask Generation for Simultaneous Speech Recognition System in Robots2008
- 著者名/発表者名
  Toru Takahashi. Shun'ichi Yamamoto Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- 学会等名
  Proceedings of International Conference on Spoken Language Processing (Interspeech-2008)
- 発表場所
  Brisbane, Australia
- 年月日
  2008-09-24
[学会発表] Predicting ASR Errors by Exploiting Barge-In Rate of Individual Users for Spoken Dialogue Systems2008
- 著者名/発表者名
  Kazunori Komatani, Tatsuya Kawahara, Hiroshi G. Okuno
- 学会等名
  Proceedings of International Conference on Spoken Language Processing (Interspeech-2008)
- 発表場所
  Brisbane, Australia
- 年月日
  2008-09-24
[学会発表] Expanding Vocabulary for Recognizing User's Abbreviations of Proper Nouns without Increasing ASR Error Rates in Spoken Dialogue Systems2008
- 著者名/発表者名
  Masaki Katsumaru, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- 学会等名
  Proceedings of International Conference on Spoken Language Processing (Interspeech-2008)
- 発表場所
  Brisbane, Australia
- 年月日
  2008-09-24
[学会発表] Extensibility Verification of Robust Domain Selection against Out-of-Grammar Utterances in Multi-Domain Spoken Dialogue System2008
- 著者名/発表者名
  Satoshi Ikeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- 学会等名
  Proceedings of International Conference on Spoken Language Processing (Interspeech-2008)
- 発表場所
  Brisbane, Australia
- 年月日
  2008-09-24
[学会発表] Target Speech Detection and Separation for Humanoid Robot in Sparse Dialogue with Noisy Home Environinents2008
- 著者名/発表者名
  Hyun-Don Kim, Jinsung Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- 学会等名
  Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2008)
- 発表場所
  Nice, France
- 年月日
  2008-09-24
[学会発表] Segmenting Acoustic Signal with Articulatory Movement using Recurrent Neural Network for Phoneme Acquisition2008
- 著者名/発表者名
  Hisashi Kanda, Tetsuya Ogata, Kazunori Komatani, Hiroshi G. Okuno
- 学会等名
  Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2008)
- 発表場所
  Nice, France
- 年月日
  2008-09-24
[学会発表] Barge-in-able Robot Audition Based on ICA and Missing Feature Theory under Semi-Blind Situation2008
- 著者名/発表者名
  Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- 学会等名
  Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2008)
- 発表場所
  Nice, France
- 年月日
  2008-09-24
[学会発表] A Robot Uses Its Own Microphone to Synchronize Its Stepsto Musical Beats While Scatting and Singing2008
- 著者名/発表者名
  K. Murata, K. Nakadai, K. Yoshii, R. Takeda, T. Torii, Hiroshi G. Okuno, Y. Hasegawa, H. Tsujino
- 学会等名
  Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2008)
- 発表場所
  Nice, France
- 年月日
  2008-09-24
[学会発表] Active Ssensing based Dynamical Object Feature Extraction2008
- 著者名/発表者名
  Shun Nishide, Tetsuya Ogata, Ryunosuke Yokoya, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno
- 学会等名
  Proceedings of IEEE/RSJ Internalional Conference on Inteligent Robots and Systems (IROS-2008)
- 発表場所
  Nice, France
- 年月日
  2008-09-23
[学会発表] Analysis of Reliable Predictability based Mo tion Generation using RNNP2008
- 著者名/発表者名
  Shun Nishide, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno
- 学会等名
  Proc. of Joint 4th International Conf. on Soft Computing and Intelligent Systems and 9th International Symposium on advanced Intelligent Systems (SCIS & ISIS 2008)
- 発表場所
  Nagoya, Japan
- 年月日
  2008-09-18
[学会発表] Automatic Chord Recognition Based on Probabilistic Integration of Chord Transition and Bass Pitch2008
- 著者名/発表者名
  Kohei Sumi, Katsutoshi Itoyama, K. Yoshii, Kazunori Komatani, T. Ogata, Hiroshi G. Okuno
- 学会等名
  Proceedings of 9th International Conference on Musical Information Retreival (ISMIR-2008)
- 発表場所
  Philadelphia, U.S.A
- 年月日
  2008-09-15
[学会発表] Instrument Equalizer for Query-by-Example Retrieval : Improving Sound Source Separation based on Integrated Harmonic and Inharmonic Models2008
- 著者名/発表者名
  Katsutoshi Itoyama, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- 学会等名
  Proceedings of 9th International Conference on Musical Information Retreival (ISMIR-2008)
- 発表場所
  Philadelphia, U.S.A
- 年月日
  2008-09-15
[学会発表] A Robot Referee for Rock-Paper-Scissors Sound Games2008
- 著者名/発表者名
  Kazuliiro Nakadai, Shun'ichi Yamamoto, Hiroshi G. Okuno, H. Nakajima, Y. Hasegawa, H. Tsujino
- 学会等名
  Proceedings of IEEE-RAS International Conference on Robotics and Automation (ICRA-2008)
- 発表場所
  Pasadena, U.S.A
- 年月日
  2008-05-20
[学会発表] COMPUTATIONAL AUDITORY SCENE ANALYSIS AND ITS APPLICATION TO ROBOT AUDITION2008
- 著者名/発表者名
  Hiroshi G. Okuno, Kazuhiro Nakadai
- 学会等名
  Proceedings of Hands-free Speech Communication and Microphone Arrays (HSCMA-2008)
- 発表場所
  Trino, Italy
- 年月日
  2008-05-07
[備考]
- URL
  http://winnie.kuis.kyoto-u.ac.jp/
[産業財産権] 音楽音響信号の音色変更システム2009
- 発明者名
  安部武宏, 糸山克寿, 奥乃博
- 権利者名
  国立大学法人京都大学
- 産業財産権番号
  特願2009-34664号
- 出願年月日
  2009-02-17
[産業財産権] 文単位検索方法, 文単位検索装置, コンピュータプログラム, 記憶媒体, 及び文書記憶装置2008
- 発明者名
  白松俊, 駒谷和範, 奥乃博
- 権利者名
  国立大学法人京都大学
- 産業財産権番号
  PCT/JP2007/055448
- 出願年月日
  2008-12-12
- 外国
[産業財産権] 音源分離システム2008
- 発明者名
  武田龍, 中田一博, 辻野広司, 奥乃博
- 権利者名
  本田技研工業株式会社
- 産業財産権番号
  特願2008-191382号
- 出願年月日
  2008-07-24
[産業財産権] 音源分離システム, 音源分離方法及び音源分離用コンピュータプログラム2008
- 発明者名
  糸山克寿, 奥乃博, 後藤真孝
- 権利者名
  京都大学, 産業技術総合研究所
- 産業財産権番号
  PCT/JP2008/05731
- 出願年月日
  2008-04-14
- 外国
[産業財産権] SalienceGraph (議事録閲覧システム)2008
- 発明者名
  白松俊, 奥乃博
- 権利者名
  国立大学法人京都大学
- 産業財産権番号
  京都大学デジタルコンテンツC32
- 取得年月日
  2008-08-01
[産業財産権] Robot Audition Software HARK2008
- 発明者名
  Shun'chi Yamamoto, hiroshi G., Okuno Kazuhiro, Nakadai Hirofumi, Nakashima Hiroshi Tsujino
- 権利者名
  京都大学, 本田技研工業
- 産業財産権番号
  オープンソースソフトウエア
- 取得年月日
  2008-05-01
- 外国
[産業財産権] 音声認識装置2008
- 発明者名
  中毫一鳳辻野広司, 奥乃1専, 山本俊一
- 権利者名
  本田技研工業株式会社
- 産業財産権番号
  特許第4157581号
- 取得年月日
  2008-07-18

2008 年度 実績報告書

音環境理解研究からのロボット聴覚の構築

研究代表者

奥乃 博 京都大学, 情報学研究科, 教授 (60318201)

研究成果

[雑誌論文] 音色特徴の歪みを回避した楽器音の音高・音長操作手法2009

著者名/発表者名

雑誌名

[雑誌論文] マルチドメイン音声対話システムにおけるトピック推定と対話履歴の統合によるドメイン選択手法2009

著者名/発表者名

雑誌名

[雑誌論文] Game-Theoretic Model of Referential Coherence and Its Empirical Verification Usine Large Jacanese and English Cornora2009

著者名/発表者名

雑誌名

[雑誌論文] 分析時刻に依存しない周期信号のパワースペクトル推定法を用いた音声分析2009

著者名/発表者名

雑誌名

[雑誌論文] 歌声の統計的モデル化とビタビ探索を用いた多重奏中のボーカルパートに対する音高推定手法2008

著者名/発表者名

雑誌名

[雑誌論文] Managing Out-of-Grammar Utterances by Topic Estimation with Domain Extensibility in Multi Domain Spoken2008

著者名/発表者名

雑誌名

[雑誌論文] 独立成分分析に基づく適応フィルタのロボット聴覚への応用2008

著者名/発表者名

雑誌名

[雑誌論文] 音声対話システムにおけるラピッドプロトタイピングを指向したWFSTに基づく言語理解2008

著者名/発表者名

雑誌名

[雑誌論文] Predicting Object Dynamics from Visual Images through Active Sensing Experiences2008

著者名/発表者名

雑誌名

[雑誌論文] A Portable Robot Audition Software System for Multiple Simultaneous Speech Signals2008

著者名/発表者名

雑誌名

[雑誌論文] SalienceGraph : Visualizing Salience Dynamics of Written Discourse by Using Reference Probability and PLSA2008

著者名/発表者名

雑誌名

[雑誌論文] 多数の人の声を一度に聞き分ける聴覚センサ2008

著者名/発表者名

雑誌名

[雑誌論文] Integrating Topic Estimation and Dialogue HDomain Selection in Multi-Domain Spoken Dialogue Systems2008

著者名/発表者名

雑誌名

[雑誌論文] Vowel Imitation using Vocal Tract Model and Recurrent Neural Network2008

著者名/発表者名

雑誌名

[雑誌論文] Motion Emergence from Sound using Cross-Modal Mapping on Recurrent Neural Network2008

著者名/発表者名

雑誌名

[学会発表] Design and Implementation of 3D Auditory Scene Visualizer towards Auditory Awareness with Face Tracking2008

著者名/発表者名

学会等名

発表場所

年月日

[学会発表] 3D Auditory Scene Visualizer With Face Tracking : Designand Implementation For Auditory Awareness Compensation2008

著者名/発表者名

学会等名

発表場所

年月日

[学会発表] A Beat-Tracking Robot for Human-Robot Interaction and Its Evaluation2008

著者名/発表者名

学会等名

発表場所

年月日

[学会発表] Computational Auditory Scene Analysis and Its Application to Robot Audition (Invited Talk)2008

著者名/発表者名

学会等名

発表場所

年月日

[学会発表] A Robot Listens to Music and Counts Its Beats Aloud by Separating Music from Counting Voice2008

著者名/発表者名

学会等名

発表場所

年月日

[学会発表] Soft Missing-Feature Mask Generation for Simultaneous Speech Recognition System in Robots2008

著者名/発表者名

学会等名

発表場所

年月日

2008 年度実績報告書

奥乃博京都大学, 情報学研究科, 教授 (60318201)