2014 Fiscal Year Annual Research Report

ロボット聴覚の実環境理解に向けた多面的展開

Research Project

Project/Area Number	24220006
Research Institution	Waseda University
Principal Investigator	奥乃博早稲田大学, 理工学術院, 教授(任期付) (60318201)
Co-Investigator(Kenkyū-buntansha)	吉井和佳京都大学, 情報学研究科, 講師 (20510001) 糸山克寿京都大学, 情報学研究科, 助教 (60614451) 中臺一博東京工業大学, 情報理工学(系)研究科, 連携教授 (70436715) 公文誠熊本大学, 自然科学研究科, 准教授 (70332864) 加賀美聡独立行政法人産業技術総合研究所, デジタルヒューマン研究センター, 副センター長 (30344196) [Withdrawn] 佐々木洋子独立行政法人産業技術総合研究所, デジタルヒューマン研究センター, 研究員 (00574013) 田所諭東北大学, 情報科学研究科, 教授 (40171730) [Withdrawn] 昆陽雅司東北大学, 情報科学研究科, 准教授 (20400301)
Project Period (FY)	2012-05-31 – 2017-03-31
Keywords	ロボット聴覚 / 音環境理解 / AV-SLAM / 聖徳太子ロボット / UAV用音源定位 / ホース型レスキューロボット / 極限音環境理解 / 屋外音環境理解
Outline of Annual Research Achievements	昨年度の要素技術の開発を受け，本年度は『システムの統合化・評価』に取り組んだ． WP1:【ロボット聴覚ソフトウエアHARK2.1.1の開発】①音源定位で2種，音源分離で11種のアルゴリズム，エゴノイズ抑制機能を提供．11月早稲田大学で無料講習会，12月同大でハッカソン実施．②両耳聴モジュールの開発，伝達関数補完法による3D音源定位法を開発．③対話型HARKパラメータチューニング支援システムの開発．④HARKの応用としてロボットクイズ司会者開発．WP2:【屋内から屋外への展開】⑤展示用自律移動ロボットを開発し，2回のイベントで大規模な実環境データを収集．ネスト型無限混合ガウスモデルによる分類・識別機能の有効性を確認．⑥3次元地図作製技術の開発と複数移動体追跡ソフトウエア公開．WP3:【音一般への展開】⑦ベイジアンノンパラメトリック・マイクロフォンアレイ処理（BNP-MAP）により，音源数無限の環境（イベント会場収録音，カエルの合唱，鳥の鳴き交わし）において，音源定位・音源分離の同時推定が可能になる．イベント会場での人追跡可視化に音響事象を重畳し，人の移動と話者の状態を同時に表示．⑧UAVからの耐雑音性の高い音源定位法 iGSVD-MUSIC-CMS法を開発し，12m～14mの距離からの音声の音源検出を実証．⑨反応帯域を分割したカエルホタル2の開発と，2種類のカエルの合唱の実フィールドでの観測に成功．⑩DNNによるAV-ASRしいステムの開発．DNNの有効性確認．⑪感情の認識生成の双方向をモデル化し，検証．WP4:【極限環境への展開】⑪索状ロボットによる3D音源定位法とRPCAによる音声強調法を開発．⑫屋外用音収録UAVの開発．UAV用音環境可視化システムの開発．⑬関数型言語による3Dプリンタ用モデル化の開発． RSJ, JSAI等で研究集会を企画・運営．産総研で公開デモを実施．
Current Status of Research Progress	Current Status of Research Progress 1: Research has progressed more than it was originally planned. Reason 【計画より進捗した点】 ①ロボット聴覚オープンソースソフトウエアＨＡＲＫが音源定位で2種，音源分離で11種のアルゴリズムを提供し，研究用ツールとして充実したこと．今期だけで，国内外から３万件弱ダウンロードがあった．人工知能学会業績賞受賞．②伝達関数を補完するトリリニア補間法の開発により，３D音源定位法で事前測定を大幅に削減．論文賞受賞．③第三者ユーザ開発の応用がフィールドテストに： HARKBird（名大），音源定位（HAL東京ロボット学科）④ベイジアンノンパラメトリック・マイクロフォンアレイ処理（BNP-MAP）の音源定位・音源分離の同時推定が実環境でも機能することが判明．HARK提供のオンラインアルゴリズム群では対応しきれない状況で使用可能なBNP-MAPという事前知識最小限の音源定位・音源分離が可能となる．実際，人混み，カエル・野鳥データの解析に成功，⑤UAVからの耐雑音性の高い音源定位法 iGSVD-MUSIC-CMS法を開発し，ロータ，風切り音等の雑音を逐次的に抑圧するとともに，CMS法により雑音過抑圧で， 12m～14m程度の距離から話される音声の音源検出が実証　⑥　教室型と競り型の2種に大別し，イズ司会者ロボットを開発し，HARKの有用さを実証　⑦感情の認識・生成の双方向で使用できるモデルとしてSIREを考案し，その上にMultimodal Emotional Intelligence（MEI）を実現し，実証．　⑧トップレベルのジャーナル論文，国際会議論文，章分担，解説論文，特許と多数の成果が出ていること．⑨極限環境でのロボット聴覚がImPACT「タフ・ロボティクス・チャレンジ」の極限音響として採用されたこと. ⑩「ロボット聴覚」が国内外で認知が進み始めたこと．IROS-2014より“robot audition”がキーワードとして採用．
Strategy for Future Research Activity	4年目は『WPの成果を統合したシステム評価と個別要素技術の再構築』に取り組む．2名の分担者を加え，自然環境への展開を強化する．各WPの予定を次に示す． WP1【ロボット聴覚ソフトウエアHARKの開発】①RASP-ZX用高速無線伝送方式を開発，②HARK-Binauralの組込みシステム化，③HARKのクラウド化，④HARKの普及活動：市販マイクアレイへの対応と，一般向け講習会・ハッカソンの実施．WP2【屋内から屋外への展開】⑤AV情報統合による音マップ構築法・AV-SLAM法の洗練化と熊本大学への技術移転，⑥音源位置情報を基にした音源への追従飛行，⑦UAV収録音からの多クラス音源識別法の開発，⑧多様なUAV（ホバリング型・カイト型）による音源探索法の検討．WP3【音一般への展開】⑨ベイジアンノンパラメトリック法による実録データからの動物の鳴き声認識法の開発，DNNによる音響イベント認識と音環境可視化，⑩BNP-MAPの実データによる評価と高性能化・高速化．WP4【実環境・極限環境への展開】⑪ホース型ロボットの移動中での姿勢推定・音源定位・分離・音声強調機能の高性能化，⑫UAVやホース型ロボットから得られるAV情報のオペレータ提示法の開発，⑬HARKFrog，HARKBirdの実環境での評価と洗練化，⑭RCグライダ（静音・高速移動）用の音収録装置の開発，⑮地上ロボット群とUAVとの連携による音マップ構築法と音探索システムの検討． WP横断【システムの総合化】⑯WP2でロボットが大規模展示会場等で自律走行して集めた中長期間データを用いて，WP1で開発した要素技術の評価を行い，その結果をWP4で開発した技術で音環境可視化を行う．また，音マップ構築法を屋外自走ロボットに適用し，UAVとの連携に応用．また，RSJ, JSAI等で研究集会を企画・運営．公開デモ等アウトリーチ活動を実施．
Remarks	東北大学大学院情報科学研究科田所・昆陽研究室　http://www.rm.is.tohoku.ac.jp 主たる受賞(1)奥乃:人工知能学会業績賞 (2)中村他:Advanced Robotics論文賞 (3)坂東:SSRR-2014 Best Student Paper Award (4)合原他:人工知能学会研究会優秀賞，(5)小島他:SI2014優秀講演賞 (6)大会学生奨励賞6件，他2件受賞．

Research Products
(89 results)

All 2015 2014 Other

All Journal Article (44 results) (of which Peer Reviewed: 40 results, Acknowledgement Compliant: 38 results, Open Access: 37 results) Presentation (37 results) (of which Invited: 2 results) Book (3 results) Remarks (5 results)

[Journal Article] Preferential Training of Neuro-Dynamical Model Based on Predictability of Target Dynamics2015
- Author(s)
  Shun Nishide, Harumitsu Nobuta, Hiroshi G. Okuno, Tetsuya Ogata
- Journal Title
  
  Advanced Robotics
  
  Volume: 29 Pages: 587-596
- DOI
  10.1080/01691864.2015.1031279
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] Bayesian Audio-to-Score Alignment Based on Joint Inference of Timbre, Volume, Tempo, and Performer-Dependent Note Onset Timings2015
- Author(s)
  Akira Maezawa, Hiroshi G. Okuno
- Journal Title
  
  Computer Music Journal
  
  Volume: 39 Pages: 74-87
- DOI
  10.1162/COMJ_A_00286
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] 角度ベース複数仮説を用いたLRFによる複数種類・複数個の移動体追跡手法2015
- Author(s)
  畑尾直孝, 鮫島一平, 加賀美聡
- Journal Title
  
  計測自動制御学会論文集
  
  Volume: 51 Pages: 印刷中
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] Scheme による3D図形の構成的制作2015
- Author(s)
  古川孝太郎，糸山克寿，吉井和佳，奥乃博
- Journal Title
  
  コンピュータソフトウエア
  
  Volume: 未定 Pages: 未定
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] A Recipe for Empathy: Integrating the mirror system, insula, somatosensory cortex and motherese2015
- Author(s)
  Angelica Lim, Hiroshi G. Okuno
- Journal Title
  
  International Journal of Social Robotics
  
  Volume: 7 Pages: 35-49
- DOI
  10.1007/s12369-014-0262-y
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Automatic Speech Recognition for Mixed Dialect Utterances by Mixing Dialect Language Models2015
- Author(s)
  Naoki Hirayama, Koichiro Yoshino, Katsutoshi Itoyama, Shunsuke Mori, Hiroshi G. Okuno
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech and Language Processing
  
  Volume: 23 Pages: 372-382
- DOI
  10.1109/TASLP.2014. 2387414
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Audio-Visual Speech Recognition using Deep Learning2015
- Author(s)
  Kuniaki Noda,Yuki Yamaguchi, Kazuhiro Nakadai, Hiroshi G. Okuno, Tetsuya Ogata
- Journal Title
  
  Applied Intelligence
  
  Volume: 42 Pages: 722-737
- DOI
  10.1007/s10489-014-0629-7
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Posture Estimation of Hose-shaped Robot by using Active Microphone Array2015
- Author(s)
  Yoshiaki Bando, Takuma Otsuka, Kazuhiro Nakadai, Satoshi Tadokoro, Masashi Konyo, Katsutoshi Itoyama, Hiroshi G. Okuno
- Journal Title
  
  Advanced Robotics
  
  Volume: 29 Pages: 35-49
- DOI
  10.1080/01691864.2015.1031279
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Improved Sound Source Localization in Horizontal Plane for Binaural Robot Audition2015
- Author(s)
  Ui-Hyun Kim, Kazuhiro Nakadai, Hiroshi G. Okuno
- Journal Title
  
  Applied Intelligence
  
  Volume: 42 Pages: 63-74
- DOI
  10.1007/s10489-014-0544-y
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Developing Robot Emotions through Interaction with Caregives2015
- Author(s)
  Angelica Lim, Hiroshi G. Okuno
- Journal Title
  
  Synthesizing Human Emotion in Intelligent Systems and Robotics
  
  Volume: 1 Pages: 316-337
- DOI
  10.4018/978-1-4666-7278-9.ch015
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] CHALLENGES IN DEPLOYING A MICROPHONE ARRAY TO LOCALIZE AND SEPARATE SOUND SOURCES IN REAL AUDITORY SCENES2015
- Author(s)
  Yoshiaki Bando, Takuma Otsuka, Katsutoshi Itoyama, Kazuyoshi Yoshii, Yoko Sasaki, Satoshi Kagami, Hiroshi G. Okuno
- Journal Title
  
  Proc. of 2015 Int’l Conf. on Acoustics, Speech and Signal Processing (ICASSP 2015)
  
  Volume: 1 Pages: 723-727
- DOI
  10.1109/ICASSP.2015.xxx not decided yet
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] ROBOT AUDITION: ITS RISE AND PERSPECTIVES2015
- Author(s)
  Hiroshi G. Okuno, Kazuhiro Nakadai
- Journal Title
  
  Proc. of 2015 Int’l Conf. on Acoustics, Speech and Signal Processing (ICASSP 2015)
  
  Volume: 1 Pages: 5610-5614
- DOI
  10.1109/ICASSP.2015.xxx not decided yet
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Singing Voice Analysis and Editing based on Mutually Dependent F0 Estimation and Source Separation2015
- Author(s)
  Yukara Ikemiya, Katsutoshi Itoyama, Kazuyoshi Yoshii
- Journal Title
  
  Proc. of 2015 Int’l Conf. on Acoustics, Speech and Signal Processing (ICASSP 2015)
  
  Volume: 1 Pages: in print
- DOI
  10.1109/ICASSP.2015.xxx not decided yet
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Recognition of In-field Frog Chorusing using Bayesian Nonparametric Microphone Array Processing2015
- Author(s)
  Yoshiaki Bando, Takuma Otsuka, Ikkyu Aihara, Hiromitsu Awano, Katsutoshi Itoyama, Kazuyoshi Yoshii, Hiroshi G. Okuno
- Journal Title
  
  Technical Report of AAAI-2015 Workshop on Computational Sustainability
  
  Volume: 1 Pages: 1-6
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] 統計的音響信号処理の新展開2015
- Author(s)
  吉井和佳, 糸山克寿
- Journal Title
  
  映像情報メディア学会誌
  
  Volume: 69 Pages: 111-116
- Open Access
[Journal Article] Multichannel Sound Source Dereverberation and Separation for Arbitrary Number of Sources based on Bayesian Nonparametrics2014
- Author(s)
  Takuma Otsuka, Katsutoshi Ishiguro, Takuya Yoshioka, Hiroshi Sawada, Hiroshi G. Okuno
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech and Language Processing
  
  Volume: 22 Pages: 2218-2232
- DOI
  10.1109/TASLP.2014.2363790
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Nonparametric Bayesian dereverberation ofpower spectrograms based on infinite-orderautoregressive processes and interpretation2014
- Author(s)
  Akira Maezawa, Katsutoshi Itoyama, Hiroshi G. Okuno
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech and Language Processing
  
  Volume: 22 Pages: 1918-1930
- DOI
  10.1109/TASLP.2014.2355772
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] 擬似生成した複数方言言語モデル混合による混合方言音声認識2014
- Author(s)
  平山直樹, 吉野幸一郎, 糸山克寿, 森信介, 奥乃博
- Journal Title
  
  情報処理学会論文誌
  
  Volume: 55 Pages: 1681-1930
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] The MEI Robot: Towards Using Motherese to Develop Multimodal Emotional Intelligence2014
- Author(s)
  Angelica Lim, Hiroshi G. Okuno
- Journal Title
  
  IEEE Transactions on Autonomous Mental Development
  
  Volume: 6138 Pages: 126-138
- DOI
  10.1109/TAMD.2014.2317513
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Transferring Vocal Expression of F0 Contour using Singing Voice Synthesizer2014
- Author(s)
  Yukara Ikemiya, Katsutoshi Itoyama, Hiroshi G. Okuno
- Journal Title
  
  Lecture Notes in Computer Science
  
  Volume: 8482 Pages: 250-259
- DOI
  10.1007/978-3-319-07467-2_27
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Development of a Robot Quizmaster with Auditory Functions for Speech-based Multiparty Interaction2014
- Author(s)
  Izaya Nishimuta, Kazuyoshi Yoshii, Katsutoshi Itoyama, Hiroshi G. Okuno
- Journal Title
  
  Proceedings of 2014 IEEE/SICE International Symposium on System Integration (SII 2014)
  
  Volume: 1 Pages: 328-333
- DOI
  10.1109/SII.2014.7028059
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] A Method of Localisation and Multi-Layered 2D Mapping using Selective Update for Particle Filter2014
- Author(s)
  Yuma Nihei, Takuro Egawa, Ippei Samejima, Naotaka Hatao, Simon Thompson, Satoshi Kagami, Hiroshi Takemura, Hiroshi Mizoguchi
- Journal Title
  
  Proc. of IEEE/RSJ International Conference on Robotics and Biomimetics (ROBIO-2014)
  
  Volume: 1 Pages: 1398-1405
- DOI
  10.1109/ROBIO.2014.7090529
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] A Robot Quizmaster that can Localize, Separate, and Recognize Simultaneous Utterances for a Fastest-Voice-First Quiz Game2014
- Author(s)
  Izaya Nishimuta, Kazuyoshi Yoshii, Katsutoshi Itoyama, Hiroshi G. Okuno
- Journal Title
  
  Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2014)
  
  Volume: 1967-962 Pages: 967-972
- DOI
  10.1109/HUMANOIDS.2014.7041480
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Applying Intrinsic motivation for Visuomotor Learning of Robot Arm Motion2014
- Author(s)
  Shun Nishide, Harumitsu Nobuta, Hiroshi G. Okuno, Tetsuya Ogata
- Journal Title
  
  Proceedings of the 11th International Conference on Ubiquitous Robots and Ambient Intelligence (URAI2014)
  
  Volume: 1 Pages: 364-367
- DOI
  10.1109/URAI.2014.7057370
- Peer Reviewed / Open Access
[Journal Article] A Sound-based Online Method for Estimating the Time-Varying Posture of a Hose-shaped Robot2014
- Author(s)
  Yoshiaki Bando, Katsutoshi Itoyama, Satoshi Tadokoro, Masashi Konyo, Kazuhiro Nakadai, Kazuyoshi Yoshii, Hiroshi G. Okuno
- Journal Title
  
  Proceedings of the 12th IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR-2014)
  
  Volume: 1 Pages: 1-6
- DOI
  10.1109/SSRR.2014.7017665
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] A microphone array configuration for an auditory quadrotor helicopter system2014
- Author(s)
  Takihiro Ishiki, Makoto Kumon
- Journal Title
  
  Proceedings of the 12th IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR-2014)
  
  Volume: 1 Pages: 1-6
- DOI
  10.1109/SSRR.2014.7017653
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] BAYESIAN AUDIO ALIGNMENT BASED ON A UNIFIED GENERATIVE MODEL OF MUSIC COMOSITION AND PERFORMANCE2014
- Author(s)
  Akira Maezawa, Katsutoshi Itoyama, Kazuyoshi Yoshii, Hiroshi G. Okuno
- Journal Title
  
  Proceedings of 2014 International Symposium on Music Information Retrieval (ISMIR 2014
  
  Volume: 1 Pages: 233-238
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Optimal path planning method with attitude constraints for quadrotor helicopters2014
- Author(s)
  T. Hirata, Makoto Kumon
- Journal Title
  
  Proceedings of 2014 International Conference on Advanced Mechatronic Systems (ICAMechS)
  
  Volume: 1 Pages: 377-381
- DOI
  10.1109/ICAMechS.2014.6911574
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Development of automatic bird-species recognition system from birdsongs in tropical forests2014
- Author(s)
  Akira Maruyama, Motoko S. Fujita, Katsutoshi Itoyama, Hiroshi G. Okuno, Mamoru Kanzaki
- Journal Title
  
  Proceedings of the 26th International Ornithological Congress (IOC2014)
  
  Volume: 1 Pages: 14
- Acknowledgement Compliant
[Journal Article] Parameter Estimation of Virtual Musical Instrumental Synthesizers2014
- Author(s)
  Katsutoshi Itoyama, Hiroshi G. Okuno
- Journal Title
  
  Proceedings of 2014 Joint Conference on 11th Sound and Music Computing Conference (ICMC) and 40th International Computer Music Conference (SMC) (ICMC|SMC 2014)
  
  Volume: 1 Pages: 1-6
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Sound Annotation Tool for Multidirectional Sounds based on spatial information extracted by HARK robot audition software2014
- Author(s)
  Osamu Sugiyama, Katsutoshi Itoyama, Kazuhiro Nakadai, Hiroshi G. Okuno
- Journal Title
  
  Proceeding of IEEE International Conference on Systems, Man, and Cybernetics (SMC 2014)
  
  Volume: 1 Pages: 2335-2340
- DOI
  10.1109/SMC.2014.6974275
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Object Identification by 3D LIDAR Using Nested Infinite Gaussian Mixture Model2014
- Author(s)
  Shogo Tsurusaki, Yoko Sasaki, Satoshi Kagami, Hiroshi Mizoguchi
- Journal Title
  
  Proceeding of IEEE International Conference on Systems, Man, and Cybernetics (SMC 2014)
  
  Volume: 1 Pages: 2391-2396
- DOI
  10.1109/SMC.2014. 6974279
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Improvement in Outdoor Sound Source Detection Using a Quadrotor-Embedded Microphone Array2014
- Author(s)
  Takuma Ohata, Keisuke Nakamura, Takashi Mizumoto, Taiki Tezuka, Kazuhiro Nakadai
- Journal Title
  
  2014 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2014)
  
  Volume: 1 Pages: 1902-1907
- DOI
  10.1109/IROS.2014.6942814
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Visualization of auditory awareness based on sound source positions estimated by depth sensor and microphone array2014
- Author(s)
  Takahiro Iyama, Osamu Sugiyama, Takuma Otsuka, Katsutoshi Itoyama, Hiroshi G. Okuno
- Journal Title
  
  2014 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2014)
  
  Volume: 1 Pages: 1908-1913
- DOI
  10.1109/IROS.2014.6942814
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Making a Robot Dance to Diverse Musical Genre in Noisy Environments2014
- Author(s)
  Joao Lobato Oliveira, Keisuke Nakamura, Thibault Lanlois, Fabien Gouyon, Kazuhiro Nakadai, Angelica Lim, Luis Paulo Reis, Hiroshi G. Okuno
- Journal Title
  
  2014 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2014)
  
  Volume: 1 Pages: 1896-1901
- DOI
  10.1109/IROS.2014.6942812
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Classification and Identification of Robot Sensing Data Based on Nested Infinite GMM2014
- Author(s)
  Yoko Sasaki, Naotaka Hatao, Shogo Tsurusaki, Satoshi Kagami
- Journal Title
  
  2014 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2014)
  
  Volume: 1 Pages: 3162-3167
- DOI
  10.1109/IROS.201 4.6943000
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Lipreading using Convolutional Neural Network2014
- Author(s)
  Kuniaki Noda, Yuki Yamaguchi, Kazuhiro Nakadai, Hiroshi G. Okuno, Tetsuya Ogata
- Journal Title
  
  Proceedings of 2014 International Conference on Spoken Language Processing (Interspeech 2014)
  
  Volume: 1 Pages: 1149-1153
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Insertion of Pause in Drawing from Babbling for Robot's Developmental Imitation Learning2014
- Author(s)
  Shun Nishide, Keita Mochizuki, Hiroshi G. Okuno, Tetsuya Ogata
- Journal Title
  
  Proceedings of 2014 IEEE International Conference on Robots and Automation (ICRA 2014)
  
  Volume: 1 Pages: 4785-4791
- DOI
  10.1109/ICRA2014.6907559
- Peer Reviewed / Open Access
[Journal Article] Ego-motion Noise Suppression for Robots Based on Semi-Blind Infinite Non-negative Matrix Factorization2014
- Author(s)
  Taiki Tezuka, Takami Yoshida, Kazuhiro Nakadai
- Journal Title
  
  Proceedings of 2014 IEEE International Conference on Robots and Automation (ICRA 2014)
  
  Volume: 1 Pages: 6293-6298
- DOI
  10.1109/ICRA2014. 6907787
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Audio Part Mixture Alignment Based on Hierarchical Nonparametric Bayesian Model of Musical Audio Sequence Collection2014
- Author(s)
  Akira Maezawa, Hiroshi G. Okuno
- Journal Title
  
  Proceedings of 2014 International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014)
  
  Volume: 1 Pages: 5249-5253
- DOI
  10.1109/ICASSP.2014.6854597
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Automatic Transcription of Guitar Tablature from Audio Signals in Accordance with Player's Proficiency2014
- Author(s)
  Kazuki Yazawa, Katsutoshi Itoyama, Hiroshi G. Okuno
- Journal Title
  
  Proceedings of 2014 International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014)
  
  Volume: 1 Pages: 3146-3150
- DOI
  10.1109/ICASSP.2014.6854175
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Transcribing Vocal Expression from Polyphonic Music2014
- Author(s)
  Yukara Ikemiya, Katsutoshi Itoyama, Hiroshi G. Okuno
- Journal Title
  
  Proceedings of 2014 International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014)
  
  Volume: 1 Pages: 3151-3155
- DOI
  10.1109/ICASSP.2014.6854176
- Peer Reviewed / Open Access
[Journal Article] 研究の水平展開2014
- Author(s)
  奥乃博
- Journal Title
  
  コンピュータソフトウエア
  
  Volume: 31 Pages: 1-1
[Journal Article] マイクロホンアレイのオンライン校正とそのロボット聴覚システムへの応用2014
- Author(s)
  中臺一博, 中村圭佑
- Journal Title
  
  日本音響学会誌
  
  Volume: 70 Pages: 397-402
[Presentation] タイ熱帯林における鳥類の自動音声認識による多様性調査法の開発2015
- Author(s)
  丸山晃央, 藤田素子, 奥乃博, 糸山克寿, Dome Pratumthong, Taksin Artchawacom, 神崎護
- Organizer
  第62回生態学会大会
- Place of Presentation
  鹿児島大学，鹿児島
- Year and Date
  2015-03-18 – 2015-03-22
[Presentation] 分散型マイクロホンアレイを用いた音源分離のための複数移動ロボットの配置最適化2015
- Author(s)
  関口航平, 坂東宣昭, 糸山克寿, 吉井和佳
- Organizer
  情報処理学会第77回全国大会
- Place of Presentation
  京都大学，京都
- Year and Date
  2015-03-17 – 2015-03-19
[Presentation] 歌声・伴奏音・打楽器音分離に基づく音楽演奏支援システム2015
- Author(s)
  土橋彩香, 池宮由楽, 糸山克寿, 吉井和佳
- Organizer
  情報処理学会第77回全国大会
- Place of Presentation
  京都大学，京都
- Year and Date
  2015-03-17 – 2015-03-19
[Presentation] 柔軟索状レスキューロボットのためのロバスト主成分分析を用いた走行雑音抑圧2015
- Author(s)
  坂東宣昭, 池宮由楽, 糸山克寿, 昆陽雅司, 田所諭, 中臺一博, 吉井和佳, 奥乃博
- Organizer
  情報処理学会第77回全国大会
- Place of Presentation
  京都大学，京都
- Year and Date
  2015-03-17 – 2015-03-19
[Presentation] 音楽音響信号に対する相補的な歌声分離と音高推定2015
- Author(s)
  池宮由楽, 糸山克寿, 吉井和佳
- Organizer
  情報処理学会第77回全国大会
- Place of Presentation
  京都大学，京都
- Year and Date
  2015-03-17 – 2015-03-19
[Presentation] ダンス共演ロボットのためのマルチモーダルビートトラッキング2015
- Author(s)
  大喜多美里, 坂東宣昭, 池宮由楽, 糸山克寿, 吉井和佳
- Organizer
  情報処理学会第77回全国大会
- Place of Presentation
  京都大学，京都
- Year and Date
  2015-03-17 – 2015-03-19
[Presentation] 2混合音に対する音源分離の不確実性を考慮した同時発話音声認識2015
- Author(s)
  板倉光佑, 西牟田勇哉, 坂東宜昭, 糸山克寿, 吉井和佳
- Organizer
  情報処理学会第77回全国大会
- Place of Presentation
  京都大学，京都
- Year and Date
  2015-03-17 – 2015-03-19
[Presentation] 雑音下での音源定位・音源分離に与える伝達関数測定法の影響の評価2015
- Author(s)
  赤堀渉，増田太郎，奥乃博, 森島繁生
- Organizer
  情報処理学会第77回全国大会
- Place of Presentation
  京都大学，京都
- Year and Date
  2015-03-17 – 2015-03-19
[Presentation] 聴覚アウェアネスの可視化のための深度センサとマイクロフォンアレイを用いた物体認識と音イベント検出2015
- Author(s)
  井山貴裕, 杉山治，坂東宣昭, 糸山克寿, 吉井和佳, 奥乃博
- Organizer
  情報処理学会第77回全国大会
- Place of Presentation
  京都大学，京都
- Year and Date
  2015-03-17 – 2015-03-19
[Presentation] プログラミング基礎教育のための図形言語の3D拡張2015
- Author(s)
  古川孝太郎, 糸山克寿, 吉井和佳, 奥乃博
- Organizer
  情報処理学会第77回全国大会
- Place of Presentation
  京都大学，京都
- Year and Date
  2015-03-17 – 2015-03-19
[Presentation] ユーザの技術に合わせた自動編曲機能をもつピアノ演奏練習システム2015
- Author(s)
  福田翼, 池宮由楽, 糸山克寿, 吉井和佳
- Organizer
  情報処理学会第77回全国大会
- Place of Presentation
  京都大学，京都
- Year and Date
  2015-03-17 – 2015-03-19
[Presentation] 早言いクイズ司会者ロボットの開発と評価2015
- Author(s)
  西牟田勇哉, 糸山克寿, 吉井和佳, 奥乃博
- Organizer
  情報処理学会第77回全国大会
- Place of Presentation
  京都大学，京都
- Year and Date
  2015-03-17 – 2015-03-19
[Presentation] 市販楽曲中の歌声の分離と音高推定に基づく歌唱表現編集システム2015
- Author(s)
  池宮由楽, 糸山克寿, 吉井和佳
- Organizer
  インタラクション2015
- Place of Presentation
  日本未来館，東京
- Year and Date
  2015-03-05 – 2015-03-07
[Presentation] ロボット聴覚オープンソフトウェアHARKの紹介2014
- Author(s)
  中臺一博, 奥乃博
- Organizer
  第15回計測自動制御学会システムインテグレーション部会講演会
- Place of Presentation
  東京ビッグサイト，東京
- Year and Date
  2014-12-14 – 2015-12-16
- Invited
[Presentation] クアドロコプタ搭載マイクロホンアレイを用いた深層学習による音声識別2014
- Author(s)
  上村知史, 杉山治, 小島諒介, 大畑琢磨, 中臺一博
- Organizer
  第15回計測自動制御学会システムインテグレーション部門講演会
- Place of Presentation
  東京ビッグサイト，東京
- Year and Date
  2014-12-14 – 2014-12-16
[Presentation] ユーザとの対話に基づきブログを自動生成するブログロボットの提案2014
- Author(s)
  杉山治, 小島諒介, 中臺一博
- Organizer
  第15回計測自動制御学会システムインテグレーション部門講演会
- Place of Presentation
  東京ビッグサイト，東京
- Year and Date
  2014-12-14 – 2014-12-16
[Presentation] 調理認識を対象としたレシピの確率モデルとその学習2014
- Author(s)
  小島諒介, 杉山治, 中臺一博
- Organizer
  第15回計測自動制御学会システムインテグレーション部門講演会
- Place of Presentation
  東京ビッグサイト，東京
- Year and Date
  2014-12-14 – 2014-12-16
[Presentation] 深度センサとマイクロフォンを用いた聴覚アウェアネスの提示2014
- Author(s)
  井山貴裕, 杉山治, 坂東宣昭, 糸山克寿, 吉井和佳, 奥乃博
- Organizer
  人工知能学会第39回 AI チャレンジ研究会
- Place of Presentation
  慶應義塾大学，神奈川
- Year and Date
  2014-11-21 – 2014-11-21
[Presentation] 相関行列スケーリングを用いた屋外音源探索手法の解析2014
- Author(s)
  大畑琢磨, 長峰諒英, 中村圭佑, 石崎孝幸, 水本武志, 中臺一博
- Organizer
  人工知能学会第39回 AI チャレンジ研究会
- Place of Presentation
  慶應義塾大学，神奈川
- Year and Date
  2014-11-21 – 2014-11-21
[Presentation] 屋外音環境理解における音源検出の性能評価と可視化2014
- Author(s)
  長峰諒英, 大畑琢磨, 上村知史, 小島諒介, 杉山治, 中村圭佑, 中臺一博
- Organizer
  人工知能学会第39回 AI チャレンジ研究会
- Place of Presentation
  慶應義塾大学，神奈川
- Year and Date
  2014-11-21 – 2014-11-21
[Presentation] マイクロホンアレイとスピーカをもつ柔軟索状ロボットのための動的スピーカ選択による姿勢推定の高速化2014
- Author(s)
  坂東宣昭, 糸山克寿, 昆陽雅司, 田所諭, 中臺一博, 吉井和佳, 奥乃博
- Organizer
  人工知能学会第39回 AI チャレンジ研究会
- Place of Presentation
  慶應義塾大学，神奈川
- Year and Date
  2014-11-21 – 2014-11-21
[Presentation] Transferring Vocal Expressions of a Professional Singer to Unaccompanied Singing Signals2014
- Author(s)
  Yukara Ikemiya, Katsutoshi Itoyama, Kazuyoshi Yoshii
- Organizer
  The 15th International Society for Music Information Retrieval (ISMIR 2014)
- Place of Presentation
  Taipei, Taiwan
- Year and Date
  2014-10-27 – 2014-10-31
[Presentation] 音環境理解研究の水平展開～マルチエージェントシステムからロボット聴覚・カエルの合唱解明へ～2014
- Author(s)
  奥乃博
- Organizer
  合同エージェントワークショップ＆シンポジウム2014（JAWS2014）
- Place of Presentation
  青島全日空ホテル，宮崎
- Year and Date
  2014-10-27 – 2014-10-29
- Invited
[Presentation] 能動耳介での音源定位に用いる周波数帯域の選択について2014
- Author(s)
  尾堂航, 公文誠
- Organizer
  第32回日本ロボット学会学術講演会
- Place of Presentation
  九州産業大学，福岡
- Year and Date
  2014-09-04 – 2014-09-06
[Presentation] マイクロホンアレイを用いた駆動機構付ホース型ロボットの姿勢推定2014
- Author(s)
  坂東宣昭, 糸山克寿, 昆陽雅司, 田所諭, 中臺一博, 吉井和佳, 奥乃博
- Organizer
  第32回日本ロボット学会学術講演会
- Place of Presentation
  九州産業大学，福岡
- Year and Date
  2014-09-04 – 2014-09-06
[Presentation] 「早言い」合図を識別してインタラクションに活用するロボット司会者2014
- Author(s)
  西牟田勇哉, 吉井和佳, 西出俊糸山克寿, 奥乃博
- Organizer
  第32回日本ロボット学会学術講演会
- Place of Presentation
  九州産業大学，福岡
- Year and Date
  2014-09-04 – 2014-09-06
[Presentation] 聴覚アウエアネス可視化に基づくジェスチャ操作インタフェースの開発2014
- Author(s)
  井山貴裕, 杉山治, 坂東宣昭, 糸山克寿, 吉井和佳, 奥乃博
- Organizer
  第32回日本ロボット学会学術講演会
- Place of Presentation
  九州産業大学，福岡
- Year and Date
  2014-09-04 – 2014-09-06
[Presentation] Deep Neural Networkを用いたマルチモーダル音声認識2014
- Author(s)
  野田邦明, 山口雄紀, 中臺一博, 奥乃博, 尾形哲也
- Organizer
  第32回日本ロボット学会学術講演会
- Place of Presentation
  九州産業大学，福岡
- Year and Date
  2014-09-04 – 2014-09-06
[Presentation] 人間の描画発達に基づくロボットの描画模倣学習モデルの構築2014
- Author(s)
  西出俊望月敬太, 奥乃博, 尾形哲也
- Organizer
  第32回日本ロボット学会学術講演会
- Place of Presentation
  九州産業大学，福岡
- Year and Date
  2014-09-04 – 2014-09-06
[Presentation] 混合音中の歌声F0軌跡に対する歌唱表現転写システム2014
- Author(s)
  池宮由楽, 糸山克寿, 吉井和佳, 奥乃博
- Organizer
  第104回情報処理学会音楽情報科学研究会(夏のシンポジウム) 音楽情報科学研究会
- Place of Presentation
  京都大学，京都
- Year and Date
  2014-08-25 – 2014-08-25
[Presentation] 演奏間相対テンポの結合力学モデルに基づく音響信号間アライメント2014
- Author(s)
  前澤陽, 糸山克寿, 吉井和佳, 奥乃博, 河原達也
- Organizer
  第104回情報処理学会音楽情報科学研究会(夏のシンポジウム) 音楽情報科学研究会
- Place of Presentation
  京都大学，京都
- Year and Date
  2014-08-25 – 2014-08-25
[Presentation] HARKを用いた多方向音声のアノテーションツールの開発2014
- Author(s)
  杉山治, 糸山克寿, 中臺一博, 奥乃博
- Organizer
  クラウドネットワークロボット研究会（CNR）, 電子情報通信学会
- Place of Presentation
  慶應義塾大学，神奈川
- Year and Date
  2014-06-14 – 2014-06-14
[Presentation] ビジュアルオドメトリと多層型レーザスキャナによる2次元地図作成と位置推定手法2014
- Author(s)
  大里章人, 加賀美聡, 溝口博
- Organizer
  JSME ロボティクス・メカトロニクス講演会2014 (ROBOMECH2014),
- Place of Presentation
  富山国際会議場，富山
- Year and Date
  2014-05-25 – 2014-05-29
[Presentation] パーティクルフィルタの選択的更新手法を用いた多層型2次元地図生成及び位置推定2014
- Author(s)
  仁瓶雄真, 江川拓良, 鮫島一平, 畑尾直孝, Simon Thompson, 加賀美聡, 竹村裕, 溝口博
- Organizer
  JSME ロボティクス・メカトロニクス講演会2014 (ROBOMECH2014),
- Place of Presentation
  富山国際会議場，富山
- Year and Date
  2014-05-25 – 2014-05-29
[Presentation] LRFを用いた高汎用性移動体追跡プラットフォームの設計2014
- Author(s)
  畑尾直孝, 加賀美聡
- Organizer
  JSME ロボティクス・メカトロニクス講演会2014 (ROBOMECH2014),
- Place of Presentation
  富山国際会議場，富山
- Year and Date
  2014-05-25 – 2014-05-29
[Presentation] 潜在共通構造モデルに基づく音響信号間アライメント2014
- Author(s)
  前澤陽, 糸山克寿, 吉井和佳, 奥乃博
- Organizer
  音学シンポジウム, 音楽情報科学研究会，情報処理学会
- Place of Presentation
  日本大学，東京
- Year and Date
  2014-05-25 – 2014-05-25
[Presentation] 市販楽曲からの歌い方ライブラリの作成2014
- Author(s)
  池宮由楽, 糸山克寿, 吉井和佳, 奥乃博
- Organizer
  音学シンポジウム, 音楽情報科学研究会，情報処理学会
- Place of Presentation
  日本大学，東京
- Year and Date
  2014-05-25 – 2014-05-25
[Book] 『パワーアシスト・ロボットに関する材料，電子機器，制御と実用化，その最新技術』2015
- Author(s)
  中臺一博, 奥乃博
- Total Pages
  590
- Publisher
  技術情報協会
[Book] 『感覚デバイス開発-- 機器が担うヒト感覚の生成・拡張・代替技術』2014
- Author(s)
  奥乃博
- Total Pages
  418
- Publisher
  NTS Inc.
[Book] 『感覚デバイス開発-- 機器が担うヒト感覚の生成・拡張・代替技術』2014
- Author(s)
  奥乃博, 中臺一博
- Total Pages
  418
- Publisher
  NTS Inc.
[Remarks] ロボット聴覚オープンソースソフトウエア　HARK
- URL
  http://www.hark.jp/
[Remarks] ロボット聴覚のデモ（京都大学大学院情報学研究科音声メディア研究室）
- URL
  http://winnie.kuis.kyoto-u.ac.jp/
[Remarks] 複数移動体追跡手法のソフトウェア（産業技術総合研究所）
- URL
  https://github.com/nhatao/mo_tracker
[Remarks] 東京工業大学大学院情報理工学研究科中臺研究室
- URL
  http://www.cyb.mei.titech.ac.jp/nakadai/
[Remarks] 熊本大学大学院自然科学研究科公文研究室
- URL
  http://as.mech.kumamoto-u.ac.jp/ja/research

2014 Fiscal Year Annual Research Report

ロボット聴覚の実環境理解に向けた多面的展開

Principal Investigator

奥乃 博 早稲田大学, 理工学術院, 教授(任期付) (60318201)

Current Status of Research Progress

Reason

Research Products

[Journal Article] Preferential Training of Neuro-Dynamical Model Based on Predictability of Target Dynamics2015

Author(s)

Journal Title

DOI

[Journal Article] Bayesian Audio-to-Score Alignment Based on Joint Inference of Timbre, Volume, Tempo, and Performer-Dependent Note Onset Timings2015

Author(s)

Journal Title

DOI

[Journal Article] 角度ベース複数仮説を用いたLRFによる複数種類・複数個の移動体追跡手法2015

Author(s)

Journal Title

[Journal Article] Scheme による3D図形の構成的制作2015

Author(s)

Journal Title

[Journal Article] A Recipe for Empathy: Integrating the mirror system, insula, somatosensory cortex and motherese2015

Author(s)

Journal Title

DOI

[Journal Article] Automatic Speech Recognition for Mixed Dialect Utterances by Mixing Dialect Language Models2015

Author(s)

Journal Title

DOI

[Journal Article] Audio-Visual Speech Recognition using Deep Learning2015

Author(s)

Journal Title

DOI

[Journal Article] Posture Estimation of Hose-shaped Robot by using Active Microphone Array2015

Author(s)

Journal Title

DOI

[Journal Article] Improved Sound Source Localization in Horizontal Plane for Binaural Robot Audition2015

Author(s)

Journal Title

DOI

[Journal Article] Developing Robot Emotions through Interaction with Caregives2015

Author(s)

Journal Title

DOI

[Journal Article] CHALLENGES IN DEPLOYING A MICROPHONE ARRAY TO LOCALIZE AND SEPARATE SOUND SOURCES IN REAL AUDITORY SCENES2015

Author(s)

Journal Title

DOI

[Journal Article] ROBOT AUDITION: ITS RISE AND PERSPECTIVES2015

Author(s)

Journal Title

DOI

[Journal Article] Singing Voice Analysis and Editing based on Mutually Dependent F0 Estimation and Source Separation2015

Author(s)

Journal Title

DOI

[Journal Article] Recognition of In-field Frog Chorusing using Bayesian Nonparametric Microphone Array Processing2015

Author(s)

Journal Title

[Journal Article] 統計的音響信号処理の新展開2015

Author(s)

Journal Title

[Journal Article] Multichannel Sound Source Dereverberation and Separation for Arbitrary Number of Sources based on Bayesian Nonparametrics2014

Author(s)

Journal Title

DOI

[Journal Article] Nonparametric Bayesian dereverberation ofpower spectrograms based on infinite-orderautoregressive processes and interpretation2014

Author(s)

Journal Title

DOI

[Journal Article] 擬似生成した複数方言言語モデル混合による混合方言音声認識2014

Author(s)

Journal Title

[Journal Article] The MEI Robot: Towards Using Motherese to Develop Multimodal Emotional Intelligence2014

Author(s)

Journal Title

DOI

[Journal Article] Transferring Vocal Expression of F0 Contour using Singing Voice Synthesizer2014

Author(s)

奥乃博早稲田大学, 理工学術院, 教授(任期付) (60318201)