2019 Fiscal Year Annual Research Report

Audio-Visual Integration to Target Recognition by Drone Audition

Research Project

Project/Area Number	17K00365
Research Institution	Kumamoto University
Principal Investigator	公文誠熊本大学, 大学院先端科学研究部(工), 准教授 (70332864)
Co-Investigator(Kenkyū-buntansha)	中臺一博東京工業大学, 工学院, 特任教授 (70436715)
Project Period (FY)	2017-04-01 – 2020-03-31
Keywords	ロボット聴覚 / ドローン聴覚 / 視聴覚統合 / センサフュージョン
Outline of Annual Research Achievements	本課題ではドローンによる音源の追跡において、対象となる音源の音情報とそれ以外の情報（画像上の特徴など）を活用することに着目している。これらのマルチモーダルな情報の対応が得られる仮定の上でこれまでに音源の奥行き情報推定などが可能なことを示しており、昨今進展の著しい機械学習手法などによって特徴量間の対応を求めることが有用であると期待される。一方で、一般にモダリティ間の関係を事前に与えることは難しいことから、この仮定を緩和することに注力した。まずマイクアレイ・カメラのセンサ対で収録した試験データから音・画像の組合せを自動的に検出し、学習に供する音・画像情報統合技術を開発した。また、音源方向情報に加え、音色のように異なった特徴量を組み合わせることで、複数の音源を安定して追跡する手法も提案した。今後画像特徴も取り込むことが可能である。また、ドローンで収録される音信号は複数の対象の発する信号が混合したものとなるため、これらを適切に分離・識別することは音源追跡の重要な要素である。本課題ではロボット聴覚技術を利用した音源分離とbag-of-wordsのアプローチに基づく音源の区別方法や、深層学習に基づいた音源毎の信号のセグメンテーション手法を開発した。さらに、複数の音源が存在する時、音源位置の推定情報の不確かさが非ガウスとなることを明らかにし、このことを踏まえたドローンの飛行経路を計画するアクティブ音源探査手法を開発した。

Research Products
(12 results)

All 2020 2019 Other

All Int'l Joint Research (1 results) Journal Article (6 results) (of which Int'l Joint Research: 1 results, Peer Reviewed: 6 results, Open Access: 1 results) Presentation (5 results) (of which Int'l Joint Research: 2 results)

[Int'l Joint Research] バージニア工科大学/バージニア大学(米国)
- Country Name
  U.S.A.
- Counterpart Institution
  バージニア工科大学/バージニア大学
[Journal Article] Multiple Sound Source Position Estimation by Drone Audition Based on Data Association Between Sound Source Localization and Identification2020
- Author(s)
  Mizuho Wakabayashi, Hiroshi G. Okuno , Makoto Kumon
- Journal Title
  
  IEEE Robotics and Automation Letters
  
  Volume: 5 Pages: 782～789
- DOI
  https://doi.org/10.1109/LRA.2020.2965417
- Peer Reviewed / Open Access
[Journal Article] Design and Implementation of Real-Time Visualization of Sound Source Positions by Drone Audition2020
- Author(s)
  Mizuho Wakabayashi, Kai Washizaka, Kotaro Hoshiba, Kazuhiro Nakadai, Hiroshi G. Okuno, Makoto Kumon
- Journal Title
  
  Proceedings of Int. Symposium on System Integration (SII2020)
  
  Volume: 1 Pages: 814-819
- Peer Reviewed
[Journal Article] Sound Source Tracking by Incorporating Target Motion Estimated by Visual Trackers2020
- Author(s)
  Yuto Kokusho, Makoto Kumon
- Journal Title
  
  Proceedings of Int. Symposium on System Integration (SII2020)
  
  Volume: 1 Pages: 652-657
- Peer Reviewed
[Journal Article] Sound Source Tracking by Drones with Microphone Arrays2020
- Author(s)
  Yamada Taiki、Itoyama Katsutoshi、Nishida Kenji、Nakadai Kazuhiro
- Journal Title
  
  2020 IEEE/SICE International Symposium on System Integration (SII2020)
  
  Volume: 1 Pages: 796-801
- DOI
  https://doi.org/10.1109/SII46433.2020.9026185
- Peer Reviewed
[Journal Article] Belief-Driven Control Policy of a Drone with Microphones for Multiple Sound Source Search2019
- Author(s)
  Yamada Kenshiro、Kumon Makoto、Furukawa Tomonari
- Journal Title
  
  2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS2019)
  
  Volume: 1 Pages: 5326-6332
- DOI
  10.1109/IROS40897.2019.8968119
- Peer Reviewed / Int'l Joint Research
[Journal Article] Environmental sound segmentation utilizing Mask U-Net2019
- Author(s)
  Sudo Yui、Itoyama Katsutoshi、Nishida Kenji、Nakadai Kazuhiro
- Journal Title
  
  2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2019)
  
  Volume: 1 Pages: 5340-5345
- DOI
  https://doi.org/10.1109/IROS40897.2019.8967954
- Peer Reviewed
[Presentation] ロボット聴覚からのクロスモーダルへの期待2020
- Author(s)
  中臺一博
- Organizer
  第220回コンピュータビジョンとイメージメディア研究発表会
[Presentation] 複数マイクロホンアレイを用いた尤度分布統合による移動音源追跡2020
- Author(s)
  山田泰基, 糸山克寿, 西田健次, 中臺一博
- Organizer
  情報処理学会第82回全国大会
[Presentation] Robot Audition and Drone Audition2019
- Author(s)
  Kazuhiro NAKADAI, Hiroshi G. OKUNO
- Organizer
  ICRA 2019 Workshop on Sound Source Localization and Its Applications for Robots
- Int'l Joint Research
[Presentation] ound Source Tracking Using Multiple Microphone Arrays Mounted to an Unmanned Aerial Vehicle2019
- Author(s)
  Taiki YAMADA, Katsutoshi ITOYAMA, Kenji NISHIDA, Kazuhiro NAKADAI
- Organizer
  ICRA 2019 Workshop on Sound Source Localization and Its Applications for Robots
- Int'l Joint Research
[Presentation] 視聴覚統合による動的環境下における三次元再構成の提案2019
- Author(s)
  紺野隆志, 西田健次, 糸山克寿, 中臺一博
- Organizer
  第55回人工知能学会 AIチャレンジ研究会

2019 Fiscal Year Annual Research Report

Audio-Visual Integration to Target Recognition by Drone Audition

Principal Investigator

公文 誠 熊本大学, 大学院先端科学研究部(工), 准教授 (70332864)

Research Products

[Int'l Joint Research] バージニア工科大学/バージニア大学(米国)

Country Name

Counterpart Institution

[Journal Article] Multiple Sound Source Position Estimation by Drone Audition Based on Data Association Between Sound Source Localization and Identification2020

Author(s)

Journal Title

DOI

[Journal Article] Design and Implementation of Real-Time Visualization of Sound Source Positions by Drone Audition2020

Author(s)

Journal Title

[Journal Article] Sound Source Tracking by Incorporating Target Motion Estimated by Visual Trackers2020

Author(s)

Journal Title

[Journal Article] Sound Source Tracking by Drones with Microphone Arrays2020

Author(s)

Journal Title

DOI

[Journal Article] Belief-Driven Control Policy of a Drone with Microphones for Multiple Sound Source Search2019

Author(s)

Journal Title

DOI

[Journal Article] Environmental sound segmentation utilizing Mask U-Net2019

Author(s)

Journal Title

DOI

[Presentation] ロボット聴覚からのクロスモーダルへの期待2020

Author(s)

Organizer

[Presentation] 複数マイクロホンアレイを用いた尤度分布統合による移動音源追跡2020

Author(s)

Organizer

[Presentation] Robot Audition and Drone Audition2019

Author(s)

Organizer

[Presentation] ound Source Tracking Using Multiple Microphone Arrays Mounted to an Unmanned Aerial Vehicle2019

Author(s)

Organizer

[Presentation] 視聴覚統合による動的環境下における三次元再構成の提案2019

Author(s)

Organizer

公文誠熊本大学, 大学院先端科学研究部(工), 准教授 (70332864)