• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Audio-Visual Integration to Target Recognition by Drone Audition

Research Project

Project/Area Number 17K00365
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeMulti-year Fund
Section一般
Research Field Intelligent robotics
Research InstitutionKumamoto University

Principal Investigator

Kumon Makoto  熊本大学, 大学院先端科学研究部(工), 准教授 (70332864)

Co-Investigator(Kenkyū-buntansha) 中臺 一博  東京工業大学, 工学院, 特任教授 (70436715)
Project Period (FY) 2017-04-01 – 2020-03-31
Project Status Completed (Fiscal Year 2019)
Budget Amount *help
¥4,550,000 (Direct Cost: ¥3,500,000、Indirect Cost: ¥1,050,000)
Fiscal Year 2019: ¥910,000 (Direct Cost: ¥700,000、Indirect Cost: ¥210,000)
Fiscal Year 2018: ¥1,820,000 (Direct Cost: ¥1,400,000、Indirect Cost: ¥420,000)
Fiscal Year 2017: ¥1,820,000 (Direct Cost: ¥1,400,000、Indirect Cost: ¥420,000)
Keywordsロボット聴覚 / 視聴覚統合 / センサフュージョン / ドローン聴覚 / 音源定位 / マルチロータヘリコプタ / 知能ロボティックス
Outline of Final Research Achievements

In this study, it is considered to recognize targets on the ground from drones with microphones. The target acoustic signal obtained at the drone is generally significantly distorted by the ego-noise, and, hence, it is difficult to recognize the target only by acoustic signals. This study aims to develop the technology to compensate this difficulty by incorporating visual sensor information.
Acoustic features that contain pauses is fused with visual features that are normally provided sequentially where it is not trivial to associate the visual information with the acoustic target.
Based on the developed methods, it is shown that audio-visual integration improves the audio target recognition under noisy situation, and as an example, three-dimensional position estimation of moving plural targets by the drone with microphones was achieved.

Academic Significance and Societal Importance of the Research Achievements

本来音によって特徴づけられる音源について、一定の条件の下で画像という異なるセンサ情報(外見の分からない音源)を通じて認識する出来るようになったことを通じ、様々なセンサ情報を自律的に統合する方向へと展開が可能で意義があると考えている。異常検知や、防犯等、社会一般で必要とされる技術としても利用可能である。

Report

(4 results)
  • 2019 Annual Research Report   Final Research Report ( PDF )
  • 2018 Research-status Report
  • 2017 Research-status Report
  • Research Products

    (40 results)

All 2020 2019 2018 2017 Other

All Int'l Joint Research (3 results) Journal Article (11 results) (of which Int'l Joint Research: 1 results,  Peer Reviewed: 9 results,  Open Access: 3 results) Presentation (26 results) (of which Int'l Joint Research: 4 results,  Invited: 3 results)

  • [Int'l Joint Research] バージニア工科大学/バージニア大学(米国)

    • Related Report
      2019 Annual Research Report
  • [Int'l Joint Research] バージニア工科大学(米国)

    • Related Report
      2018 Research-status Report
  • [Int'l Joint Research] バージニア工科大学(米国)

    • Related Report
      2017 Research-status Report
  • [Journal Article] Multiple Sound Source Position Estimation by Drone Audition Based on Data Association Between Sound Source Localization and Identification2020

    • Author(s)
      Mizuho Wakabayashi, Hiroshi G. Okuno , Makoto Kumon
    • Journal Title

      IEEE Robotics and Automation Letters

      Volume: 5 Issue: 2 Pages: 782-789

    • DOI

      10.1109/lra.2020.2965417

    • Related Report
      2019 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Design and Implementation of Real-Time Visualization of Sound Source Positions by Drone Audition2020

    • Author(s)
      Mizuho Wakabayashi, Kai Washizaka, Kotaro Hoshiba, Kazuhiro Nakadai, Hiroshi G. Okuno, Makoto Kumon
    • Journal Title

      Proceedings of Int. Symposium on System Integration (SII2020)

      Volume: 1 Pages: 814-819

    • Related Report
      2019 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Sound Source Tracking by Incorporating Target Motion Estimated by Visual Trackers2020

    • Author(s)
      Yuto Kokusho, Makoto Kumon
    • Journal Title

      Proceedings of Int. Symposium on System Integration (SII2020)

      Volume: 1 Pages: 652-657

    • Related Report
      2019 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Sound Source Tracking by Drones with Microphone Arrays2020

    • Author(s)
      Yamada Taiki、Itoyama Katsutoshi、Nishida Kenji、Nakadai Kazuhiro
    • Journal Title

      2020 IEEE/SICE International Symposium on System Integration (SII2020)

      Volume: 1 Pages: 796-801

    • DOI

      10.1109/sii46433.2020.9026185

    • Related Report
      2019 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Belief-Driven Control Policy of a Drone with Microphones for Multiple Sound Source Search2019

    • Author(s)
      Kenshiro Yamada, Makoto Kumon, Tomonari Furukawa
    • Journal Title

      Proceedings of 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2019)

      Volume: - Pages: 5326-5332

    • DOI

      10.1109/iros40897.2019.8968119

    • Related Report
      2019 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Environmental sound segmentation utilizing Mask U-Net2019

    • Author(s)
      Sudo Yui、Itoyama Katsutoshi、Nishida Kenji、Nakadai Kazuhiro
    • Journal Title

      2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2019)

      Volume: 1 Pages: 5340-5345

    • DOI

      10.1109/iros40897.2019.8967954

    • Related Report
      2019 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Recent R&D Technologies and Future Prospective of Flying Robot in Tough Robotics Challenge2019

    • Author(s)
      Kenzo Nonami, Kotaro Hoshiba, Kazuhiro Nakadai, Makoto Kumon, Hiroshi G Okuno, Yasutada Tanabe, Koichi Yonezawa, Hiroshi Tokutake, Satoshi Suzuki, Kohei Yamaguchi, Shigeru Sunada, Toshiyuki Nakata, Ryusuke Noda, Hao Liu
    • Journal Title

      Disaster Robotics, Springer

      Volume: 128 Pages: 77-142

    • DOI

      10.1007/978-3-030-05321-5_3

    • ISBN
      9783030053208, 9783030053215
    • Related Report
      2018 Research-status Report
    • Peer Reviewed
  • [Journal Article] Transition and the current technologies in acoustic signal processing: From the viewpoint of robot audition2018

    • Author(s)
      中臺 一博
    • Journal Title

      THE JOURNAL OF THE ACOUSTICAL SOCIETY OF JAPAN

      Volume: 74 Issue: 7 Pages: 394-400

    • DOI

      10.20697/jasj.74.7_394

    • NAID

      130007541848

    • ISSN
      0369-4232, 2432-2040
    • Year and Date
      2018-07-01
    • Related Report
      2018 Research-status Report
    • Peer Reviewed
  • [Journal Article] マイクロホンアレイを用いた音源定位・分離ソフトウェア入門2018

    • Author(s)
      中臺 一博
    • Journal Title

      システム/制御/情報

      Volume: 62-2 Pages: 42-49

    • NAID

      130007433961

    • Related Report
      2017 Research-status Report
  • [Journal Article] Design of UAV-Embedded Microphone Array System for Sound Source Localization in Outdoor Environments2017

    • Author(s)
      Hoshiba Kotaro、Washizaki Kai、Wakabayashi Mizuho、Ishiki Takahiro、Kumon Makoto、Bando Yoshiaki、Gabriel Daniel、Nakadai Kazuhiro、Okuno Hiroshi
    • Journal Title

      Sensors

      Volume: 17 Issue: 11 Pages: 2535-2535

    • DOI

      10.3390/s17112535

    • NAID

      120006501320

    • Related Report
      2017 Research-status Report
    • Peer Reviewed / Open Access
  • [Journal Article] オープンソースコミュニティーに貢献するということ2017

    • Author(s)
      中臺 一博
    • Journal Title

      映像情報メディア学会誌

      Volume: 71-5 Pages: 647-653

    • NAID

      130007699413

    • Related Report
      2017 Research-status Report
  • [Presentation] ロボット聴覚からのクロスモーダルへの期待2020

    • Author(s)
      中臺 一博
    • Organizer
      第220回コンピュータビジョンとイメージメディア研究発表会
    • Related Report
      2019 Annual Research Report
  • [Presentation] 複数マイクロホンアレイを用いた尤度分布統合による移動音源追跡2020

    • Author(s)
      山田 泰基, 糸山 克寿, 西田 健次, 中臺 一博
    • Organizer
      情報処理学会第82回全国大会
    • Related Report
      2019 Annual Research Report
  • [Presentation] Robot Audition and Drone Audition2019

    • Author(s)
      Kazuhiro NAKADAI, Hiroshi G. OKUNO
    • Organizer
      ICRA 2019 Workshop on Sound Source Localization and Its Applications for Robots
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] ound Source Tracking Using Multiple Microphone Arrays Mounted to an Unmanned Aerial Vehicle2019

    • Author(s)
      Taiki YAMADA, Katsutoshi ITOYAMA, Kenji NISHIDA, Kazuhiro NAKADAI
    • Organizer
      ICRA 2019 Workshop on Sound Source Localization and Its Applications for Robots
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] 視聴覚統合による動的環境下における三次元再構成の提案2019

    • Author(s)
      紺野 隆志, 西田 健次, 糸山 克寿, 中臺 一博
    • Organizer
      第55回人工知能学会 AIチャレンジ研究会
    • Related Report
      2019 Annual Research Report
  • [Presentation] Close Sound Source Localization Inroporating Semi-Supervised Variational Bayesian NMF2019

    • Author(s)
      Makoto Kumon, Kai Washizaki, Kazuhiro Nakadai
    • Organizer
      Int. Symposium on System Integration (SII2019)
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] 複数のマイクロホンアレイを搭載した複数のUAVによる移動音源の三次元追跡2019

    • Author(s)
      山田 泰基, Daniel Gabriel, 糸山 克寿, 西田 健次, 中臺 一博
    • Organizer
      情報処理学会第81回全国大会
    • Related Report
      2018 Research-status Report
  • [Presentation] 屋外での移動音源追跡における動画像と音信号の統合の検討2018

    • Author(s)
      國生悠斗, 公文誠
    • Organizer
      第36回日本ロボット学会学術講演会
    • Related Report
      2018 Research-status Report
  • [Presentation] 繰り返しベイズ推定を用いた視聴覚統合による話者位置推定2018

    • Author(s)
      公文誠,鷲﨑海,Tomonari Furukawa
    • Organizer
      計測自動制御学会システムインテグレーション部門講演会
    • Related Report
      2018 Research-status Report
  • [Presentation] 複数の移動マイクロホンアレイによる移動音源の3次元定位2018

    • Author(s)
      山田 泰基, ガブリエル ダニエル, 糸山 克寿, 西田 健次, 中臺 一博
    • Organizer
      計測自動制御学会システムインテグレーション部門講演会
    • Related Report
      2018 Research-status Report
  • [Presentation] ロボット聴覚研究とその展開 - 災害時の迅速な要救助者発見 に向けたドローン聴覚技術開発に至るまで2018

    • Author(s)
      中臺一博
    • Organizer
      蔵前技術士会
    • Related Report
      2018 Research-status Report
    • Invited
  • [Presentation] ドローン聴覚~災害地での迅速な要救助者の捜索を目指して~2018

    • Author(s)
      中臺一博
    • Organizer
      ImPACTシンポジウム
    • Related Report
      2018 Research-status Report
    • Invited
  • [Presentation] Quad-directional LSTMを用いた音楽音響信号修復とその評価2018

    • Author(s)
      谷口 亮輔, 干場 功太郎, 中臺 一博
    • Organizer
      第80回情報処理学会全国大会
    • Related Report
      2017 Research-status Report
  • [Presentation] 可聴音を用いた周波数自動選択に基づく距離推定法の検討2018

    • Author(s)
      高尾 麻衣子, 干場 功太郎, 中臺 一博
    • Organizer
      第80回情報処理学会全国大会
    • Related Report
      2017 Research-status Report
  • [Presentation] Evaluation of 2D bird localization algorithm using microphone arrays2018

    • Author(s)
      Daniel Gabriel, Ryosuke Kojima, Kotaro Hoshiba, Kazuhiro Nakadai
    • Organizer
      The 80th National Convention of IPSJ
    • Related Report
      2017 Research-status Report
  • [Presentation] アクティブ周波数レンジフィルタを用いた雑音にロバストな音源定位手法の提案2017

    • Author(s)
      干場功太郎, 中臺一博, 公文誠, 奥乃博
    • Organizer
      人工知能学会 第49回AIチャレンジ研究会
    • Related Report
      2017 Research-status Report
  • [Presentation] マイクロホンアレイを有するマルチロータヘリコプタを用いた地上の複数音源の 位置推定について2017

    • Author(s)
      若林瑞保, 公文誠
    • Organizer
      人工知能学会 第49回AIチャレンジ研究会
    • Related Report
      2017 Research-status Report
  • [Presentation] UAV搭載マイクロホンアレイを用いた組み込みシステムによる音源探査性能の評価2017

    • Author(s)
      干場功太郎,中臺一博,公文誠,奥乃博
    • Organizer
      第35回日本ロボット学会学術講演会
    • Related Report
      2017 Research-status Report
  • [Presentation] マルチロータヘリコプタ収録音の音源分離におけるシステムパラメータと分離性能について-GHDSSとBNP-MAPの比較2017

    • Author(s)
      鷲崎海, 公文誠, 大塚琢馬, 奥乃博, 干場功太郎, 中臺一博
    • Organizer
      第35回日本ロボット学会学術講演会
    • Related Report
      2017 Research-status Report
  • [Presentation] Grid based Recursive Bayes Filterに基づくマルチロータヘリコプタによる音源探査における地図管理2017

    • Author(s)
      山田健志郎, 公文誠
    • Organizer
      第35回日本ロボット学会学術講演会
    • Related Report
      2017 Research-status Report
  • [Presentation] Development of Microphone-Array-Embedded UAV for Search and Rescue Task2017

    • Author(s)
      Kazuhiro Nakadai, Makoto Kumon, Hiroshi G. Okuno, Kotaro Hoshiba, Mizuho Wakabayashi, Kai Washizaki, Takahiro Ishiki, Daniel Gabriel, Yoshiaki Bando, Takayuki Morito, Ryosuke Kojima, Osamu Sugiyama
    • Organizer
      International Conference on Intelligent Robots and Systems
    • Related Report
      2017 Research-status Report
    • Int'l Joint Research
  • [Presentation] Bi-directional LSTM を用いた音楽音響信号修復法の提案2017

    • Author(s)
      谷口 亮輔, 干場 功太郎, 中臺 一博
    • Organizer
      第35回日本ロボット学会学術講演会
    • Related Report
      2017 Research-status Report
  • [Presentation] 可聴音を用いた周波数選択に基づく距離推定法の検討2017

    • Author(s)
      高尾 麻衣子, 干場 功太郎, 中臺 一博
    • Organizer
      第35回日本ロボット学会学術講演会
    • Related Report
      2017 Research-status Report
  • [Presentation] Quad-directional LSTMを用いた音楽音響信号修復法の提案2017

    • Author(s)
      谷口 亮輔, 干場 功太郎, 中臺 一博
    • Organizer
      人工知能学会 第49回AIチャレンジ研究会
    • Related Report
      2017 Research-status Report
  • [Presentation] 可聴音を用いた周波数選択に基づく距離推定法の実環境利用に向けた評価2017

    • Author(s)
      高尾 麻衣子, 干場 功太郎, 中臺 一博
    • Organizer
      人工知能学会 第49回AIチャレンジ研究会
    • Related Report
      2017 Research-status Report
  • [Presentation] ロボット聴覚オープンソースソフトウェアHARK の技術紹介とその展開2017

    • Author(s)
      中臺 一博
    • Organizer
      自動車技術会 エレクトロニクス部門
    • Related Report
      2017 Research-status Report
    • Invited

URL: 

Published: 2017-04-28   Modified: 2022-02-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi