Audio-Visual Integration to Target Recognition by Drone Audition

Research Project

Project/Area Number	17K00365
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Multi-year Fund
Section	一般
Research Field	Intelligent robotics
Research Institution	Kumamoto University
Principal Investigator	Kumon Makoto 熊本大学, 大学院先端科学研究部(工), 准教授 (70332864)
Co-Investigator(Kenkyū-buntansha)	中臺一博東京工業大学, 工学院, 特任教授 (70436715)
Project Period (FY)	2017-04-01 – 2020-03-31
Project Status	Completed (Fiscal Year 2019)
Budget Amount *help	¥4,550,000 (Direct Cost: ¥3,500,000、Indirect Cost: ¥1,050,000) Fiscal Year 2019: ¥910,000 (Direct Cost: ¥700,000、Indirect Cost: ¥210,000) Fiscal Year 2018: ¥1,820,000 (Direct Cost: ¥1,400,000、Indirect Cost: ¥420,000) Fiscal Year 2017: ¥1,820,000 (Direct Cost: ¥1,400,000、Indirect Cost: ¥420,000)
Keywords	ロボット聴覚 / 視聴覚統合 / センサフュージョン / ドローン聴覚 / 音源定位 / マルチロータヘリコプタ / 知能ロボティックス
Outline of Final Research Achievements	In this study, it is considered to recognize targets on the ground from drones with microphones. The target acoustic signal obtained at the drone is generally significantly distorted by the ego-noise, and, hence, it is difficult to recognize the target only by acoustic signals. This study aims to develop the technology to compensate this difficulty by incorporating visual sensor information. Acoustic features that contain pauses is fused with visual features that are normally provided sequentially where it is not trivial to associate the visual information with the acoustic target. Based on the developed methods, it is shown that audio-visual integration improves the audio target recognition under noisy situation, and as an example, three-dimensional position estimation of moving plural targets by the drone with microphones was achieved.
Academic Significance and Societal Importance of the Research Achievements	本来音によって特徴づけられる音源について、一定の条件の下で画像という異なるセンサ情報（外見の分からない音源）を通じて認識する出来るようになったことを通じ、様々なセンサ情報を自律的に統合する方向へと展開が可能で意義があると考えている。異常検知や、防犯等、社会一般で必要とされる技術としても利用可能である。

Report

(4 results)

2019 Annual Research Report Final Research Report ( PDF )
2018 Research-status Report
2017 Research-status Report

Research Products
(40 results)

All 2020 2019 2018 2017 Other

All Int'l Joint Research (3 results) Journal Article (11 results) (of which Int'l Joint Research: 1 results, Peer Reviewed: 9 results, Open Access: 3 results) Presentation (26 results) (of which Int'l Joint Research: 4 results, Invited: 3 results)

[Int'l Joint Research] バージニア工科大学/バージニア大学(米国)
- Related Report
  2019 Annual Research Report
[Int'l Joint Research] バージニア工科大学(米国)
- Related Report
  2018 Research-status Report
[Int'l Joint Research] バージニア工科大学(米国)
- Related Report
  2017 Research-status Report
[Journal Article] Multiple Sound Source Position Estimation by Drone Audition Based on Data Association Between Sound Source Localization and Identification2020
- Author(s)
  Mizuho Wakabayashi, Hiroshi G. Okuno , Makoto Kumon
- Journal Title
  
  IEEE Robotics and Automation Letters
  
  Volume: 5 Issue: 2 Pages: 782-789
- DOI
  10.1109/lra.2020.2965417
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Design and Implementation of Real-Time Visualization of Sound Source Positions by Drone Audition2020
- Author(s)
  Mizuho Wakabayashi, Kai Washizaka, Kotaro Hoshiba, Kazuhiro Nakadai, Hiroshi G. Okuno, Makoto Kumon
- Journal Title
  
  Proceedings of Int. Symposium on System Integration (SII2020)
  
  Volume: 1 Pages: 814-819
- Related Report
  2019 Annual Research Report
- Peer Reviewed
[Journal Article] Sound Source Tracking by Incorporating Target Motion Estimated by Visual Trackers2020
- Author(s)
  Yuto Kokusho, Makoto Kumon
- Journal Title
  
  Proceedings of Int. Symposium on System Integration (SII2020)
  
  Volume: 1 Pages: 652-657
- Related Report
  2019 Annual Research Report
- Peer Reviewed
[Journal Article] Sound Source Tracking by Drones with Microphone Arrays2020
- Author(s)
  Yamada Taiki、Itoyama Katsutoshi、Nishida Kenji、Nakadai Kazuhiro
- Journal Title
  
  2020 IEEE/SICE International Symposium on System Integration (SII2020)
  
  Volume: 1 Pages: 796-801
- DOI
  10.1109/sii46433.2020.9026185
- Related Report
  2019 Annual Research Report
- Peer Reviewed
[Journal Article] Belief-Driven Control Policy of a Drone with Microphones for Multiple Sound Source Search2019
- Author(s)
  Kenshiro Yamada, Makoto Kumon, Tomonari Furukawa
- Journal Title
  
  Proceedings of 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2019)
  
  Volume: - Pages: 5326-5332
- DOI
  10.1109/iros40897.2019.8968119
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Environmental sound segmentation utilizing Mask U-Net2019
- Author(s)
  Sudo Yui、Itoyama Katsutoshi、Nishida Kenji、Nakadai Kazuhiro
- Journal Title
  
  2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2019)
  
  Volume: 1 Pages: 5340-5345
- DOI
  10.1109/iros40897.2019.8967954
- Related Report
  2019 Annual Research Report
- Peer Reviewed
[Journal Article] Recent R&D Technologies and Future Prospective of Flying Robot in Tough Robotics Challenge2019
- Author(s)
  Kenzo Nonami, Kotaro Hoshiba, Kazuhiro Nakadai, Makoto Kumon, Hiroshi G Okuno, Yasutada Tanabe, Koichi Yonezawa, Hiroshi Tokutake, Satoshi Suzuki, Kohei Yamaguchi, Shigeru Sunada, Toshiyuki Nakata, Ryusuke Noda, Hao Liu
- Journal Title
  
  Disaster Robotics, Springer
  
  Volume: 128 Pages: 77-142
- DOI
  10.1007/978-3-030-05321-5_3
- ISBN
  9783030053208, 9783030053215
- Related Report
  2018 Research-status Report
- Peer Reviewed
[Journal Article] Transition and the current technologies in acoustic signal processing: From the viewpoint of robot audition2018
- Author(s)
  中臺一博
- Journal Title
  
  THE JOURNAL OF THE ACOUSTICAL SOCIETY OF JAPAN
  
  Volume: 74 Issue: 7 Pages: 394-400
- DOI
  10.20697/jasj.74.7_394
- NAID
  130007541848
- ISSN
  0369-4232, 2432-2040
- Year and Date
  2018-07-01
- Related Report
  2018 Research-status Report
- Peer Reviewed
[Journal Article] マイクロホンアレイを用いた音源定位・分離ソフトウェア入門2018
- Author(s)
  中臺一博
- Journal Title
  
  システム/制御/情報
  
  Volume: 62-2 Pages: 42-49
- NAID
  130007433961
- Related Report
  2017 Research-status Report
[Journal Article] Design of UAV-Embedded Microphone Array System for Sound Source Localization in Outdoor Environments2017
- Author(s)
  Hoshiba Kotaro、Washizaki Kai、Wakabayashi Mizuho、Ishiki Takahiro、Kumon Makoto、Bando Yoshiaki、Gabriel Daniel、Nakadai Kazuhiro、Okuno Hiroshi
- Journal Title
  
  Sensors
  
  Volume: 17 Issue: 11 Pages: 2535-2535
- DOI
  10.3390/s17112535
- NAID
  120006501320
- Related Report
  2017 Research-status Report
- Peer Reviewed / Open Access
[Journal Article] オープンソースコミュニティーに貢献するということ2017
- Author(s)
  中臺一博
- Journal Title
  
  映像情報メディア学会誌
  
  Volume: 71-5 Pages: 647-653
- NAID
  130007699413
- Related Report
  2017 Research-status Report
[Presentation] ロボット聴覚からのクロスモーダルへの期待2020
- Author(s)
  中臺一博
- Organizer
  第220回コンピュータビジョンとイメージメディア研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] 複数マイクロホンアレイを用いた尤度分布統合による移動音源追跡2020
- Author(s)
  山田泰基, 糸山克寿, 西田健次, 中臺一博
- Organizer
  情報処理学会第82回全国大会
- Related Report
  2019 Annual Research Report
[Presentation] Robot Audition and Drone Audition2019
- Author(s)
  Kazuhiro NAKADAI, Hiroshi G. OKUNO
- Organizer
  ICRA 2019 Workshop on Sound Source Localization and Its Applications for Robots
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] ound Source Tracking Using Multiple Microphone Arrays Mounted to an Unmanned Aerial Vehicle2019
- Author(s)
  Taiki YAMADA, Katsutoshi ITOYAMA, Kenji NISHIDA, Kazuhiro NAKADAI
- Organizer
  ICRA 2019 Workshop on Sound Source Localization and Its Applications for Robots
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] 視聴覚統合による動的環境下における三次元再構成の提案2019
- Author(s)
  紺野隆志, 西田健次, 糸山克寿, 中臺一博
- Organizer
  第55回人工知能学会 AIチャレンジ研究会
- Related Report
  2019 Annual Research Report
[Presentation] Close Sound Source Localization Inroporating Semi-Supervised Variational Bayesian NMF2019
- Author(s)
  Makoto Kumon, Kai Washizaki, Kazuhiro Nakadai
- Organizer
  Int. Symposium on System Integration (SII2019)
- Related Report
  2018 Research-status Report
- Int'l Joint Research
[Presentation] 複数のマイクロホンアレイを搭載した複数のUAVによる移動音源の三次元追跡2019
- Author(s)
  山田泰基, Daniel Gabriel, 糸山克寿, 西田健次, 中臺一博
- Organizer
  情報処理学会第８１回全国大会
- Related Report
  2018 Research-status Report
[Presentation] 屋外での移動音源追跡における動画像と音信号の統合の検討2018
- Author(s)
  國生悠斗, 公文誠
- Organizer
  第36回日本ロボット学会学術講演会
- Related Report
  2018 Research-status Report
[Presentation] 繰り返しベイズ推定を用いた視聴覚統合による話者位置推定2018
- Author(s)
  公文誠，鷲﨑海，Tomonari Furukawa
- Organizer
  計測自動制御学会システムインテグレーション部門講演会
- Related Report
  2018 Research-status Report
[Presentation] 複数の移動マイクロホンアレイによる移動音源の3次元定位2018
- Author(s)
  山田泰基, ガブリエルダニエル, 糸山克寿, 西田健次, 中臺一博
- Organizer
  計測自動制御学会システムインテグレーション部門講演会
- Related Report
  2018 Research-status Report
[Presentation] ロボット聴覚研究とその展開 - 災害時の迅速な要救助者発見に向けたドローン聴覚技術開発に至るまで2018
- Author(s)
  中臺一博
- Organizer
  蔵前技術士会
- Related Report
  2018 Research-status Report
- Invited
[Presentation] ドローン聴覚～災害地での迅速な要救助者の捜索を目指して～2018
- Author(s)
  中臺一博
- Organizer
  ImPACTシンポジウム
- Related Report
  2018 Research-status Report
- Invited
[Presentation] Quad-directional LSTMを用いた音楽音響信号修復とその評価2018
- Author(s)
  谷口亮輔, 干場功太郎, 中臺一博
- Organizer
  第80回情報処理学会全国大会
- Related Report
  2017 Research-status Report
[Presentation] 可聴音を用いた周波数自動選択に基づく距離推定法の検討2018
- Author(s)
  高尾麻衣子, 干場功太郎, 中臺一博
- Organizer
  第80回情報処理学会全国大会
- Related Report
  2017 Research-status Report
[Presentation] Evaluation of 2D bird localization algorithm using microphone arrays2018
- Author(s)
  Daniel Gabriel, Ryosuke Kojima, Kotaro Hoshiba, Kazuhiro Nakadai
- Organizer
  The 80th National Convention of IPSJ
- Related Report
  2017 Research-status Report
[Presentation] アクティブ周波数レンジフィルタを用いた雑音にロバストな音源定位手法の提案2017
- Author(s)
  干場功太郎, 中臺一博, 公文誠, 奥乃博
- Organizer
  人工知能学会第49回AIチャレンジ研究会
- Related Report
  2017 Research-status Report
[Presentation] マイクロホンアレイを有するマルチロータヘリコプタを用いた地上の複数音源の位置推定について2017
- Author(s)
  若林瑞保, 公文誠
- Organizer
  人工知能学会第49回AIチャレンジ研究会
- Related Report
  2017 Research-status Report
[Presentation] UAV搭載マイクロホンアレイを用いた組み込みシステムによる音源探査性能の評価2017
- Author(s)
  干場功太郎，中臺一博，公文誠，奥乃博
- Organizer
  第３５回日本ロボット学会学術講演会
- Related Report
  2017 Research-status Report
[Presentation] マルチロータヘリコプタ収録音の音源分離におけるシステムパラメータと分離性能について-GHDSSとBNP-MAPの比較2017
- Author(s)
  鷲崎海, 公文誠, 大塚琢馬, 奥乃博, 干場功太郎, 中臺一博
- Organizer
  第３５回日本ロボット学会学術講演会
- Related Report
  2017 Research-status Report
[Presentation] Grid based Recursive Bayes Filterに基づくマルチロータヘリコプタによる音源探査における地図管理2017
- Author(s)
  山田健志郎, 公文誠
- Organizer
  第３５回日本ロボット学会学術講演会
- Related Report
  2017 Research-status Report
[Presentation] Development of Microphone-Array-Embedded UAV for Search and Rescue Task2017
- Author(s)
  Kazuhiro Nakadai, Makoto Kumon, Hiroshi G. Okuno, Kotaro Hoshiba, Mizuho Wakabayashi, Kai Washizaki, Takahiro Ishiki, Daniel Gabriel, Yoshiaki Bando, Takayuki Morito, Ryosuke Kojima, Osamu Sugiyama
- Organizer
  International Conference on Intelligent Robots and Systems
- Related Report
  2017 Research-status Report
- Int'l Joint Research
[Presentation] Bi-directional LSTM を用いた音楽音響信号修復法の提案2017
- Author(s)
  谷口亮輔, 干場功太郎, 中臺一博
- Organizer
  第３５回日本ロボット学会学術講演会
- Related Report
  2017 Research-status Report
[Presentation] 可聴音を用いた周波数選択に基づく距離推定法の検討2017
- Author(s)
  高尾麻衣子, 干場功太郎, 中臺一博
- Organizer
  第３５回日本ロボット学会学術講演会
- Related Report
  2017 Research-status Report
[Presentation] Quad-directional LSTMを用いた音楽音響信号修復法の提案2017
- Author(s)
  谷口亮輔, 干場功太郎, 中臺一博
- Organizer
  人工知能学会第49回AIチャレンジ研究会
- Related Report
  2017 Research-status Report
[Presentation] 可聴音を用いた周波数選択に基づく距離推定法の実環境利用に向けた評価2017
- Author(s)
  高尾麻衣子, 干場功太郎, 中臺一博
- Organizer
  人工知能学会第49回AIチャレンジ研究会
- Related Report
  2017 Research-status Report
[Presentation] ロボット聴覚オープンソースソフトウェアHARK の技術紹介とその展開2017
- Author(s)
  中臺一博
- Organizer
  自動車技術会エレクトロニクス部門
- Related Report
  2017 Research-status Report
- Invited

Audio-Visual Integration to Target Recognition by Drone Audition

Principal Investigator

Kumon Makoto 熊本大学, 大学院先端科学研究部(工), 准教授 (70332864)

¥4,550,000 (Direct Cost: ¥3,500,000、Indirect Cost: ¥1,050,000)

Report

Research Products

[Int'l Joint Research] バージニア工科大学/バージニア大学(米国)

Related Report

[Int'l Joint Research] バージニア工科大学(米国)

Related Report

[Int'l Joint Research] バージニア工科大学(米国)

Related Report

[Journal Article] Multiple Sound Source Position Estimation by Drone Audition Based on Data Association Between Sound Source Localization and Identification2020

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Design and Implementation of Real-Time Visualization of Sound Source Positions by Drone Audition2020

Author(s)

Journal Title

Related Report

[Journal Article] Sound Source Tracking by Incorporating Target Motion Estimated by Visual Trackers2020

Author(s)

Journal Title

Related Report

[Journal Article] Sound Source Tracking by Drones with Microphone Arrays2020

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Belief-Driven Control Policy of a Drone with Microphones for Multiple Sound Source Search2019

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Environmental sound segmentation utilizing Mask U-Net2019

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Recent R&D Technologies and Future Prospective of Flying Robot in Tough Robotics Challenge2019

Author(s)

Journal Title

DOI

ISBN

Related Report

[Journal Article] Transition and the current technologies in acoustic signal processing: From the viewpoint of robot audition2018

Author(s)

Journal Title

DOI

NAID

ISSN

Year and Date

Related Report

[Journal Article] マイクロホンアレイを用いた音源定位・分離ソフトウェア入門2018

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Design of UAV-Embedded Microphone Array System for Sound Source Localization in Outdoor Environments2017

Author(s)

Journal Title

DOI

NAID

Related Report

[Journal Article] オープンソースコミュニティーに貢献するということ2017

Author(s)

Journal Title

NAID

Related Report

[Presentation] ロボット聴覚からのクロスモーダルへの期待2020

Author(s)

Organizer

Related Report

[Presentation] 複数マイクロホンアレイを用いた尤度分布統合による移動音源追跡2020

Author(s)

Organizer

Related Report

[Presentation] Robot Audition and Drone Audition2019

Author(s)

[Presentation] ロボット聴覚研究とその展開 - 災害時の迅速な要救助者発見に向けたドローン聴覚技術開発に至るまで2018

[Presentation] マイクロホンアレイを有するマルチロータヘリコプタを用いた地上の複数音源の位置推定について2017