2007 Fiscal Year Annual Research Report

音響信号記号変換に基づいたセマンティックインタラクション

Research Project

Project/Area Number	19024042
Research Institution	Kyoto University
Principal Investigator	奥乃博 Kyoto University, 情報学研究科, 教授 (60318201)
Keywords	ロボット聴覚 / 音響信号記号変換 / ブラインド音源分離 / 発達論的コミュニケーション / ミッシングフィーチャ理論 / 聖徳太子コンピュータ / HARK / 音環境可視化技術
Research Abstract	(1)「音を聞き分ける」音の量的爆発促進技術: 使用環境,設置条件に制約がなく使えるように,環境に関する事前知識量を極力減らした実時間ロボット聴覚ソフトウエア「HARK」を開発した.HARKは,FlowDesignerというミドルウエアの上に,音源定位,音源分離,ミッシングフィーチャマスク自動生成,ミッシングフィーチヤ理論に基づく音声認識を組み込んだシステムである.体型が全く異なる3体のロボットで異なる配置のマイクロフォンアレイやサラウンドマイクロフォンを入力機器とし,ノートPC上で稼動する.3人同時の料理注文を受けるロボット,口によるジャンケンの審判をするロボットのデモを開発し,HARKの移植性の高さを実証した.IEEE/RSJ IROS-2006 Best Paper Finalistを受賞した. (2)「音を見せる」音の質的複雑さ軽減技術: HARKとJava3Dによるviewerとをネットワークで接続して音を音環境可視化システムを開発した.GUIは,"Overview first, zoom and filter, then details on demand"という方針で設計した.Overview firstでは音の到来方向を実時間で表示し,Zoom and filterでは指定された範囲にある方向の音だけを提示し,Details on demandでは特定の音だけを分離し提示,あるいは音声認識結果を提示することができる.さらに,聴覚障害者の情報保障のために,HMDに音源到来方向と音声認識結果を表示するシステムに応用した. (3)音楽を聞き分ける技術として,市販CD音楽を聞き分け,ビートを認識し,次のビートを予想してステップを踏むロボットの開発も行い,実時間で動くことが確認できた.自分の耳で音楽を聞き分けるロボットは世界初である.

Research Products
(41 results)

All 2008 2007 Other

All Journal Article (16 results) (of which Peer Reviewed: 10 results) Presentation (19 results) Book (1 results) Remarks (1 results) Patent(Industrial Property Rights) (4 results) (of which Overseas: 1 results)

[Journal Article] 楽譜情報を援用した多重奏音楽音響信号の音源分離と調波・非調波統合モデルの制約付パラメータ推定の同時実現2008
- Author(s)
  糸山克寿, 後藤真孝, 駒谷和範, 尾形哲也、奥乃博
- Journal Title
  
  情報処理学会論文誌 Vol.49, No.3
  
  Pages: 1465-1479
- Peer Reviewed
[Journal Article] Hybrid Collaborative and Content-based Music Recommendation Using Incrementally-trainable Probabilistic Generative Model2008
- Author(s)
  Kazuyoshi Yoshii, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Journal Title
  
  IEEE Transactions on Audio, Speech and Language Processing Vol.16, No.2
  
  Pages: 435-447
- Peer Reviewed
[Journal Article] ロボットによる人の聴覚処理への構成的アプローチ2008
- Author(s)
  奥乃博
- Journal Title
  
  情報処理 Vol.49,No.1
  
  Pages: 15-23
[Journal Article] 移動型および静止型マイクロホンアレイ統合による複数移動音源追跡2007
- Author(s)
  中臺一博, 中島弘史, 村瀬昌満, 奥乃博, 長谷川雄二, 辻野広司
- Journal Title
  
  日本ロボット学会誌 Vol.25, No.6
  
  Pages: 181-191
- Peer Reviewed
[Journal Article] Robust Recognition of Simultaneous Speech By a Mobile Robot2007
- Author(s)
  Jean-Marc Valin, Shun' ichi Yamamoto, J. Rouat, F. Michaud, K. Nakadai, H. G. Okuno
- Journal Title
  
  IEEE Transactions on Robotics, Vol.23, No.4
  
  Pages: 742-752
- Peer Reviewed
[Journal Article] Statistical machine translation using hierarchical phrase alignmen2007
- Author(s)
  Taro Watanabe, Kenji Imamura, Eiichiro Sumita, Hiroshi G. Okuno
- Journal Title
  
  Systems and Computers in Japan, Vol.21, No.10
  
  Pages: 70-79
- Peer Reviewed
[Journal Article] マルチドメイン音声対話システムにおける対話履歴を利用したドメイン選択2007
- Author(s)
  神田直之, 駒谷和範, 中野幹生, 中臺一博, 辻野広司, 尾形哲也, 奥乃博
- Journal Title
  
  情報処理学会論文誌 Vol.48, No.5
  
  Pages: 1980-1989
- Peer Reviewed
[Journal Article] Bus Information System Based on User Models and Dynamic Generation of VoiceXML Scripts2007
- Author(s)
  Shinichi Ueno, Fumihiro Adachi, Kazunori Komatani, Tatsuya Kawahara, Hiroshi G. Okuno
- Journal Title
  
  Lecture Notes in Artificial Intelligence 3609
  
  Pages: 45-60
- Peer Reviewed
[Journal Article] Evaluation of Two Simultaneous Continous Speech Recognition with ICA BSS and MTF-based ASR2007
- Author(s)
  Ryu Takeda, Shun' ichi Yamamoto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Journal Title
  
  Lecture Notes in Artificial Intelligence 4570
  
  Pages: 384-394
- Peer Reviewed
[Journal Article] Real-Time Auditory and Visual Talker Tracking through integrating EM algorithm and Particle Filter2007
- Author(s)
  Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Journal Title
  
  Lecture Notes in Artificial Intelligence 4570
  
  Pages: 45-60
- Peer Reviewed
[Journal Article] Meaning Games2007
- Author(s)
  Koiti Hasida, Shun Shiramatsu, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Journal Title
  
  Lecture Notes in Computer Science 4914
  
  Pages: 228-241
- Peer Reviewed
[Journal Article] ロボット聴覚のための情報統合の現状と課題2007
- Author(s)
  奥乃博, 溝口博
- Journal Title
  
  計測と制御 Vol.46, No.6
  
  Pages: 415-419
[Journal Article] ミッシングフィーチャ理論に基づく音声認識を利用した複数話者同時発話認識2007
- Author(s)
  山本俊一, 武田龍, 奥乃博
- Journal Title
  
  計測と制御 Vol.46,No.6
  
  Pages: 447-452
[Journal Article] ロボット聴覚のための情報統合の現状と課題(韓国語訳)2007
- Author(s)
  奥乃博, 溝口博
- Journal Title
  
  自動化技術(韓国語) Vol.24,No.3
  
  Pages: 60-63
[Journal Article] 楽曲の特徴量抽出と検索技術2007
- Author(s)
  奥乃博, 北原鉄朗, 吉井和佳
- Journal Title
  
  電気学会誌 Vol.127,No.7
  
  Pages: 417-420
[Journal Article] 音環境理解コンピューティング2007
- Author(s)
  奥乃博, 山本俊一
- Journal Title
  
  人工知能学会誌 Vol.22,No.6
  
  Pages: 846-854
[Presentation] 新近性効果の減数曲線を加味した顕現性計算手法に基づく話題遷移の可視化2008
- Author(s)
  白松俊, 駒谷和範, 尾形哲也, 奥乃博
- Organizer
  言語処理学会第14回年次大会
- Place of Presentation
  東京大学
- Year and Date
  2008-03-24
[Presentation] 音楽と自分の声を聞き分けながらビートに合わせて発声するロボットの開発2008
- Author(s)
  水本武志, 武田龍, 吉井和佳, 駒谷和範, 尾形哲也, 奥乃博
- Organizer
  情報処理学会第70回全国大会, 2X-8
- Place of Presentation
  筑波大学
- Year and Date
  2008-03-13
[Presentation] ベース音高を考慮したポピュラー音楽に対する和音進行認識2008
- Author(s)
  須見康平, 糸山克寿, 吉井和佳, 駒谷和範, 尾形哲也, 奥乃博
- Organizer
  情報処理学会第70回全国大会, 2X-5
- Place of Presentation
  筑波大学
- Year and Date
  2008-03-13
[Presentation] 複数楽器個体による事前分布を用いた調波・非調波統合モデルのパラメータ推定2008
- Author(s)
  糸山克寿, 後藤真孝, 駒谷和範, 尾形哲也, 奥乃博
- Organizer
  情報処理学会第70回全国大会, 2X-6
- Place of Presentation
  筑波大学
- Year and Date
  2008-03-13
[Presentation] 楽器固有の音響的特徴を考慮した楽器音の音高操作手法2008
- Author(s)
  安部武宏, 糸山克寿, 吉井和佳, 駒谷和範, 尾形哲也, 奥乃博
- Organizer
  情報処理学会第70回全国大会, 2X-7
- Place of Presentation
  筑波大学
- Year and Date
  2008-03-13
[Presentation] 音源定位結果と音声認識結果をHMDに統合呈示する聴覚障害者向け音環境理解支援システム2008
- Author(s)
  徳田浩一, 駒谷和範, 尾形哲也, 奥乃博
- Organizer
  情報処理学会第70回全国大会, 5ZD-7
- Place of Presentation
  筑波大学
- Year and Date
  2008-03-13
[Presentation] 顔の動作に追従したGUIインタフェースを持つ音環境可視化システム2008
- Author(s)
  久保田祐史, 吉田雅敏, 駒谷和範, 尾形哲也, 奥乃博
- Organizer
  情報処理学会第70回全国大会, 2ZL-5
- Place of Presentation
  筑波大学
- Year and Date
  2008-03-13
[Presentation] ロボット聴覚システムを用いた口じゃんけん判定ロボット2007
- Author(s)
  中臺一博, 山本俊一, 奥乃博, 中島弘史, 長谷川雄二, 辻野広司
- Organizer
  第8回システムインテグレーション部門講演会(SI2007)
- Place of Presentation
  広島国際大学国際教育センター
- Year and Date
  2007-12-19
[Presentation] ロボットによるビートトラッキングにおける周期性自己発生音の影響評価2007
- Author(s)
  村田和真, 吉井和佳, 奥乃博, 鳥井豊隆, 中臺一博, 長谷川雄二
- Organizer
  第8回システムインテグレーション部門講演会(SI2007), 3K4-4
- Place of Presentation
  広島国際大学国際教育センター
- Year and Date
  2007-12-19
[Presentation] Design and Implementation of A Robot Audition System for Automatic Speech Recognition of Simultaneous Speech2007
- Author(s)
  Shun'ichi Yamamoto, K. Nakadai, M. Nakano, H. Tsujino, J-MValin, K. Komatani, T. Ogata, H.G. Okuno
- Organizer
  Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU-2007), 111-116
- Place of Presentation
  Kyoto, Japan
- Year and Date
  2007-12-10
[Presentation] Auditory and Visual Integration based Localization and Tracking of Humans in Daily-life Environments2007
- Author(s)
  Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2007), 2021-2027
- Place of Presentation
  San Diego, CA, USA
- Year and Date
  2007-10-31
[Presentation] Exploiting Known Sound Sources to Improve ICA-based Robot Audition in Speech Separation and Recognition2007
- Author(s)
  Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2007)1757-1762
- Place of Presentation
  San Diego, CA, USA
- Year and Date
  2007-10-31
[Presentation] A Biped Robot that Keeps Steps in Time with Musical Beats while Listening to Music with Its Own Ears2007
- Author(s)
  Kazuyoshi Yoshii, K. Nakadai, T. Torii, Y. Hasegawa, H. Tsujino, K. Komatani, T. Ogata, H.G. Okuno
- Organizer
  Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2007)1743-1750
- Place of Presentation
  San Diego, CA, USA
- Year and Date
  2007-10-31
[Presentation] Hybrid Collaborative and Content-based Music Recommendation Using Probabilistic Model with Latent User Preferences2007
- Author(s)
  Kazuyoshi Yoshii, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of 8th International Conference on Musical Information Retreival (ISMIR-2007), 89-94
- Place of Presentation
  Vienna, Austria
- Year and Date
  2007-10-16
[Presentation] 独立成分分析に基づく適応フィルタのロボット聴覚への応用2007
- Author(s)
  武田龍, 中臺一博, 駒谷和範, 尾形哲也, 奥乃博
- Organizer
  日本ロボット学会第25回大会, 1N6
- Place of Presentation
  千葉工業大学
- Year and Date
  2007-09-13
[Presentation] Auditory and VIsual Integration based Localization and Tracking of Multiple Moving Sounds in Daily-life Environments2007
- Author(s)
  Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of International Workshop on Robot and Human Interaction (Ro-Man 2007) 399-404
- Place of Presentation
  Jeju Island, Korea
- Year and Date
  2007-08-29
[Presentation] 音色特徴量分布の利用による調波・非調波併用モデルのパラメータ推定2007
- Author(s)
  糸山克寿, 後藤真孝, 駒谷和範, 尾形哲也, 奥乃博
- Organizer
  音楽情報処理研究会, 2007-MUS-71, Vol.2007, No.71, pp.161-166, 情報処理学会
- Place of Presentation
  ロワジールホテル長崎
- Year and Date
  2007-08-01
[Presentation] INTEGRATION AND ADAPTATION OF HARMONIC AND INHARMONIC MODELS FOR SEPARATING POLYPHONIC2007
- Author(s)
  Katsutoshi Itoyama, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of 2007 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2007)
- Place of Presentation
  Honolulu, Hawaii, USA
- Year and Date
  2007-04-17
[Presentation] Distance Estimation of Hidden Objects Based on Acoustical Holography by applying Acoustic Diffraction of Audible Sound2007
- Author(s)
  Haruhiko Niwa, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
- Organizer
  Proceedings of IEEE-RAS International Conference on Robots and Automation (ICRA-2007)423-428
- Place of Presentation
  Rome, Italy
- Year and Date
  2007-04-11
[Book] New Trends in Applied Artificial Intelligence (IEA/AIE-2007)2007
- Author(s)
  Hiroshi G. Okuno, Moonis Ali (Eds.)
- Total Pages
  1194
- Publisher
  Springer-Verlag
[Remarks]
- URL
  http://winnie.kuis.kyoto-u.ac.jp/
[Patent(Industrial Property Rights)] 音源分離システム, 音源分離プログラム及び音源分離装置2007
- Inventor(s)
  糸山克寿, 奥乃博, 後藤真孝
- Industrial Property Rights Holder
  京都大学, 産業技術総合研究所
- Industrial Property Number
  特願2007-106576号
- Filing Date
  2007-04-19
[Patent(Industrial Property Rights)] Robot acoustic device and robot acoustic system2007
- Inventor(s)
  K. Nakadai, H. Okuno, H. Kitano
- Industrial Property Rights Holder
  Japan Science and Technology
- Industrial Property Number
  Patent No. US 7, 215, 786
- Acquisition Date
  2007-05-08
- Overseas
[Patent(Industrial Property Rights)] 楽曲推薦システム、楽曲推薦方法及び楽曲推薦用プログラム2007
- Inventor(s)
  後藤真孝, 吉井和佳, 奥乃博
- Industrial Property Rights Holder
  産業技術総合研究所
- Industrial Property Number
  特願2007-199936号
- Filing Date
  2007-07-31
[Patent(Industrial Property Rights)] 音楽音響信号と歌詞の時間的対応付けを自動で行うシステム及び方法2007
- Inventor(s)
  藤原弘将, 奥乃博, 後藤真孝
- Industrial Property Rights Holder
  京都大学, 産業技術総合研究所
- Industrial Property Number
  特願2007-233682号
- Filing Date
  2007-09-10

2007 Fiscal Year Annual Research Report

音響信号記号変換に基づいたセマンティックインタラクション

Principal Investigator

奥乃 博 Kyoto University, 情報学研究科, 教授 (60318201)

Research Products

[Journal Article] 楽譜情報を援用した多重奏音楽音響信号の音源分離と調波・非調波統合モデルの制約付パラメータ推定の同時実現2008

Author(s)

Journal Title

[Journal Article] Hybrid Collaborative and Content-based Music Recommendation Using Incrementally-trainable Probabilistic Generative Model2008

Author(s)

Journal Title

[Journal Article] ロボットによる人の聴覚処理への構成的アプローチ2008

Author(s)

Journal Title

[Journal Article] 移動型および静止型マイクロホンアレイ統合による複数移動音源追跡2007

Author(s)

Journal Title

[Journal Article] Robust Recognition of Simultaneous Speech By a Mobile Robot2007

Author(s)

Journal Title

[Journal Article] Statistical machine translation using hierarchical phrase alignmen2007

Author(s)

Journal Title

[Journal Article] マルチドメイン音声対話システムにおける対話履歴を利用したドメイン選択2007

Author(s)

Journal Title

[Journal Article] Bus Information System Based on User Models and Dynamic Generation of VoiceXML Scripts2007

Author(s)

Journal Title

[Journal Article] Evaluation of Two Simultaneous Continous Speech Recognition with ICA BSS and MTF-based ASR2007

Author(s)

Journal Title

[Journal Article] Real-Time Auditory and Visual Talker Tracking through integrating EM algorithm and Particle Filter2007

Author(s)

Journal Title

[Journal Article] Meaning Games2007

Author(s)

Journal Title

[Journal Article] ロボット聴覚のための情報統合の現状と課題2007

Author(s)

Journal Title

[Journal Article] ミッシングフィーチャ理論に基づく音声認識を利用した複数話者同時発話認識2007

Author(s)

Journal Title

[Journal Article] ロボット聴覚のための情報統合の現状と課題(韓国語訳)2007

Author(s)

Journal Title

[Journal Article] 楽曲の特徴量抽出と検索技術2007

Author(s)

Journal Title

[Journal Article] 音環境理解コンピューティング2007

Author(s)

Journal Title

[Presentation] 新近性効果の減数曲線を加味した顕現性計算手法に基づく話題遷移の可視化2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 音楽と自分の声を聞き分けながらビートに合わせて発声するロボットの開発2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] ベース音高を考慮したポピュラー音楽に対する和音進行認識2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 複数楽器個体による事前分布を用いた調波・非調波統合モデルのパラメータ推定2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 楽器固有の音響的特徴を考慮した楽器音の音高操作手法2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 音源定位結果と音声認識結果をHMDに統合呈示する聴覚障害者向け音環境理解支援システム2008

Author(s)

奥乃博 Kyoto University, 情報学研究科, 教授 (60318201)