Realization of deep reinforcement learning in real environment and its application to swarm robots

Research Project

Project/Area Number	19K12147
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Multi-year Fund
Section	一般
Review Section	Basic Section 61040:Soft computing-related
Research Institution	Matsue National College of Technology
Principal Investigator	HORIUCHI Tadashi 松江工業高等専門学校, 電子制御工学科, 教授 (50294129)
Co-Investigator(Kenkyū-buntansha)	青代敏行東京都立産業技術高等専門学校, ものづくり工学科, 准教授 (40571849)
Project Period (FY)	2019-04-01 – 2024-03-31
Project Status	Completed (Fiscal Year 2023)
Budget Amount *help	¥4,420,000 (Direct Cost: ¥3,400,000、Indirect Cost: ¥1,020,000) Fiscal Year 2021: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000) Fiscal Year 2020: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000) Fiscal Year 2019: ¥1,820,000 (Direct Cost: ¥1,400,000、Indirect Cost: ¥420,000)
Keywords	深層強化学習 / 知能ロボティクス / 群ロボット / 行動獲得
Outline of Research at the Start	本研究では，実環境での深層強化学習の実現と群ロボットの協調行動の獲得を目指す．実環境での深層強化学習を実現するために，「シミュレーションと実機の差異の低減手法」を導入する．その差異の低減手法として，実機ロボットに搭載したカメラの画像に対してセグメンテーション（領域分割）と低解像度化をすることで，実環境での見え方をシミュレーション環境での見え方に近づけて，実環境の複雑さを低減する．そのうえで，複数のロボットが存在する環境を想定し，群ロボットの協調行動の獲得を実現する．さらに，実社会での応用において重要となる「深層強化学習における行動選択の説明性の向上」を実現する．
Outline of Final Research Achievements	In this study, we realized deep reinforcement learning in real environment and applied it to acquiring cooperative behavior of swarm robots. First, we investigated the method to reduce the difference between the simulation environment and the real robot environment. Using this method, we achieved behavior acquisition based on visual information for a single mobile robot using deep reinforcement learning. Next, assuming the environment in which multiple robots exist, we achieved the acquisition of cooperative behavior for real swarm robots. More concretely, we realized to acquire following and overtaking behaviors as cooperative behaviors in swarm robot environment by deep reinforcement learning to enable robot. Finally, in order to improve the explainability of robot action selection using deep reinforcement learning, we confirmed the effectiveness of visualizing the attention area, that indicates which area in the camera image the robot focused on when selecting an action.
Academic Significance and Societal Importance of the Research Achievements	本研究は，人工知能の中核である機械学習の技術を実機ロボットに応用したものである．深層学習と強化学習を組み合わせた深層強化学習という手法を用いて，実機ロボットが視覚情報に基づいて行動を自分自身で学習することを実現した．また，複数のロボットが存在する環境を想定し，実機の群ロボットを対象とした協調行動の獲得を実現した．具体的には，群ロボットの環境において，追従行動および追い抜き行動の獲得を実現し，深層強化学習による協調行動の学習が可能であることを明らかにした．これらの成果は，物流倉庫や福祉施設などにおける搬送ロボット群の行動制御などにつながると期待できる．

Report

(6 results)

2023 Annual Research Report Final Research Report ( PDF )
2022 Research-status Report
2021 Research-status Report
2020 Research-status Report
2019 Research-status Report

Research Products
(31 results)

All 2024 2023 2022 2021 2020 2019

All Journal Article (3 results) (of which Peer Reviewed: 3 results, Open Access: 1 results) Presentation (28 results) (of which Int'l Joint Research: 5 results, Invited: 1 results)

[Journal Article] モーションセンサを用いたオールの動きを可視化する試み～ローイング運動時のブレードワークに着目して～2021
- Author(s)
  一箭フェルナンドヒロシ，堀内匡
- Journal Title
  
  スポーツパフォーマンス研究
  
  Volume: 13 Pages: 418-427
- Related Report
  2021 Research-status Report
- Peer Reviewed / Open Access
[Journal Article] Generation of Hand Contour Images using Colored Glove and Its Application to Hand Pattern Recognition2020
- Author(s)
  藤嶋教彰，北尾樹，高橋歩武，堀内匡
- Journal Title
  
  IEEJ Transactions on Electronics, Information and Systems
  
  Volume: 140 Issue: 1 Pages: 61-67
- DOI
  10.1541/ieejeiss.140.61
- NAID
  130007779189
- ISSN
  0385-4221, 1348-8155
- Year and Date
  2020-01-01
- Related Report
  2019 Research-status Report
- Peer Reviewed
[Journal Article] Vision-based Behavior Acquisition by Deep Reinforcement Learning in Multi-Robot Environment2020
- Author(s)
  R. Watanuki, T. Horiuchi and T. Aodai
- Journal Title
  
  ICIC Express Letters, Part B: Applications
  
  Volume: 11 Pages: 237-244
- NAID
  40022249302
- Related Report
  2019 Research-status Report
- Peer Reviewed
[Presentation] A Study on Bayesian Optimization of Hyper-Parameters and Reward Settings in Deep Reinforcement Learning2024
- Author(s)
  R. Sota, T. Nishimura and T. Horiuchi
- Organizer
  The 2nd International Symposium on Data-Driven Intelligent Optimization for Decision Making
- Related Report
  2023 Annual Research Report
- Int'l Joint Research
[Presentation] 画像処理技術を用いた板金加工部品の自動寸法検査システムに関する検討2024
- Author(s)
  佃結衣，伊藤皓史郎，外谷昭洋，堀内匡
- Organizer
  2024年電子情報通信学会総合大会
- Related Report
  2023 Annual Research Report
[Presentation] Vision Transformerによる画像認識と判断根拠の可視化2024
- Author(s)
  原歩睦，堀内匡
- Organizer
  第28回日本知能情報ファジィ学会中国・四国支部大会
- Related Report
  2023 Annual Research Report
[Presentation] 深層強化学習を用いた群ロボットの協調行動の獲得2024
- Author(s)
  生和直央，堀内匡
- Organizer
  第28回日本知能情報ファジィ学会中国・四国支部大会
- Related Report
  2023 Annual Research Report
[Presentation] A Study on Optimization of Hyper-Parameters in Deep Reinforcement Learning by Bayesian Optimization2023
- Author(s)
  R. Sota, T. Nishimura and T. Horiuchi
- Organizer
  The 17th International Conference on Innovative Computing, Information and Control
- Related Report
  2023 Annual Research Report
- Int'l Joint Research
[Presentation] ベイズ最適化を用いた深層強化学習のハイパーパラメータの最適化に関する検討2023
- Author(s)
  曽田涼介，西村拓人，堀内匡
- Organizer
  2023年電気学会電子・情報・システム部門大会
- Related Report
  2023 Annual Research Report
[Presentation] ベイズ最適化による深層強化学習のハイパーパラメータと報酬関数の最適化の試み2023
- Author(s)
  曽田涼介，西村拓人，堀内匡
- Organizer
  電気学会システム/制御合同研究会
- Related Report
  2023 Annual Research Report
[Presentation] 深層強化学習を用いた群ロボットの追い抜き行動の獲得2023
- Author(s)
  生和直央，堀内匡
- Organizer
  第27回日本知能情報ファジィ学会中国・四国支部大会
- Related Report
  2022 Research-status Report
[Presentation] ベイズ最適化を用いた深層強化学習のハイパーパラメータの最適化2023
- Author(s)
  曽田涼介，西村拓人，堀内匡
- Organizer
  第27回日本知能情報ファジィ学会中国・四国支部大会
- Related Report
  2022 Research-status Report
[Presentation] 深層強化学習を用いた群ロボットの協調行動の獲得に関する検討2022
- Author(s)
  福島英，曽田涼介，堀内匡
- Organizer
  2022年電気学会電子・情報・システム部門大会
- Related Report
  2022 Research-status Report
[Presentation] 深層強化学習を用いた移動ロボットの行動獲得における改良手法の検討2022
- Author(s)
  曽田涼介，福島英，堀内匡
- Organizer
  2022年電気学会電子・情報・システム部門大会
- Related Report
  2022 Research-status Report
[Presentation] 深層強化学習を用いた群ロボットの協調行動の獲得2022
- Author(s)
  福島英，綿貫零真，堀内匡
- Organizer
  第26回日本知能情報ファジィ学会中国・四国支部大会
- Related Report
  2021 Research-status Report
[Presentation] IMUセンサを用いた漕艇動作可視化システムに関する研究2022
- Author(s)
  石原悠登，石川晃太朗，一箭フェルナンドヒロシ，堀内匡
- Organizer
  第26回日本知能情報ファジィ学会中国・四国支部大会
- Related Report
  2021 Research-status Report
[Presentation] 深層強化学習を用いた群ロボットの行動獲得に関する実験的考察2021
- Author(s)
  福島英，綿貫零真，堀内匡
- Organizer
  2021年電気学会電子・情報・システム部門大会
- Related Report
  2021 Research-status Report
[Presentation] ボート競技における漕艇動作可視化システムの基礎的検討2021
- Author(s)
  石原悠登，石川晃太朗，一箭フェルナンドヒロシ，堀内匡
- Organizer
  2021年電気学会電子・情報・システム部門大会
- Related Report
  2021 Research-status Report
[Presentation] A PBL-Oriented English Class to Foster Globally Competent Future Engineers2021
- Author(s)
  S. Matsuda, T. Horiuchi, M. Higa, A. Ashraful and J. Salsgiver
- Organizer
  The 14th International Symposium on Advances in Technology Education
- Related Report
  2021 Research-status Report
- Int'l Joint Research
[Presentation] 深層強化学習を用いた群ロボットの行動獲得に関する検討2021
- Author(s)
  福島英，山田航平，堀内匡
- Organizer
  第25回日本知能情報ファジィ学会中国・四国支部大会
- Related Report
  2020 Research-status Report
[Presentation] ボート競技における漕艇動作可視化システムに関する検討2021
- Author(s)
  石原悠登，石川晃太朗，堀内匡，一箭フェルナンドヒロシ
- Organizer
  第25回日本知能情報ファジィ学会中国・四国支部大会
- Related Report
  2020 Research-status Report
[Presentation] 深層強化学習を用いた群ロボットの実機環境での行動獲得の試み2020
- Author(s)
  綿貫零真，カンサリュー，福島英，堀内匡
- Organizer
  計測自動制御学会システム・情報部門学術講演会2020
- Related Report
  2020 Research-status Report
[Presentation] 深層強化学習における移動ロボットの注視領域の可視化手法に関する検討2020
- Author(s)
  福田隼也，綿貫零真，堀内匡
- Organizer
  電気学会システム研究会
- Related Report
  2020 Research-status Report
[Presentation] 人工知能・機械学習の活用とロボティクス2020
- Author(s)
  堀内匡
- Organizer
  技術コミュニティラボ第9回ミーティング
- Related Report
  2019 Research-status Report
- Invited
[Presentation] A Study on Vision-based Behavior Acquisition of Multi-Robot System by Deep Reinforcement Learning2019
- Author(s)
  R. Watanuki, T. Horiuchi and T. Aodai
- Organizer
  SICE Annual Conference 2019
- Related Report
  2019 Research-status Report
- Int'l Joint Research
[Presentation] Vision-based Behavior Acquisition by Deep Reinforcement Learning in Multi-Robot Environment2019
- Author(s)
  R. Watanuki, T. Horiuchi and T. Aodai
- Organizer
  The 14th International Conference on Innovative Computing, Information and Control
- Related Report
  2019 Research-status Report
- Int'l Joint Research
[Presentation] 深層強化学習を用いた実機ロボットの室内環境での行動獲得の試み2019
- Author(s)
  栗山海渡，綿貫零真，堀内匡，青代敏行
- Organizer
  2019年電気学会電子・情報・システム部門大会
- Related Report
  2019 Research-status Report
[Presentation] A Study on Behavior Acquisition by Deep Reinforcement Learning in Multi-Robot Environment2019
- Author(s)
  R. Watanuki, T. Horiuchi and T. Aodai
- Organizer
  2019年電気学会電子・情報・システム部門大会
- Related Report
  2019 Research-status Report
[Presentation] 深層強化学習における移動ロボットの注視領域の可視化の試み2019
- Author(s)
  西田吉克，福田隼也，綿貫零真，堀内匡，青代敏行
- Organizer
  計測自動制御学会システム・情報部門学術講演会2019
- Related Report
  2019 Research-status Report
[Presentation] 深層強化学習における移動ロボットの注視領域の可視化に関する検討2019
- Author(s)
  福田隼也，綿貫零真，堀内匡，青代敏行
- Organizer
  第24回日本知能情報ファジィ学会中国・四国支部大会
- Related Report
  2019 Research-status Report
[Presentation] 深層強化学習の群ロボットへの応用：移動ロボット群の行動獲得2019
- Author(s)
  綿貫零真，カンサリュー，堀内匡，青代敏行
- Organizer
  第24回日本知能情報ファジィ学会中国・四国支部大会
- Related Report
  2019 Research-status Report

Realization of deep reinforcement learning in real environment and its application to swarm robots

Principal Investigator

HORIUCHI Tadashi 松江工業高等専門学校, 電子制御工学科, 教授 (50294129)

¥4,420,000 (Direct Cost: ¥3,400,000、Indirect Cost: ¥1,020,000)

Report

Research Products

[Journal Article] モーションセンサを用いたオールの動きを可視化する試み～ローイング運動時のブレードワークに着目して～2021

Author(s)

Journal Title

Related Report

[Journal Article] Generation of Hand Contour Images using Colored Glove and Its Application to Hand Pattern Recognition2020

Author(s)

Journal Title

DOI

NAID

ISSN

Year and Date

Related Report

[Journal Article] Vision-based Behavior Acquisition by Deep Reinforcement Learning in Multi-Robot Environment2020

Author(s)

Journal Title

NAID

Related Report

[Presentation] A Study on Bayesian Optimization of Hyper-Parameters and Reward Settings in Deep Reinforcement Learning2024

Author(s)

Organizer

Related Report

[Presentation] 画像処理技術を用いた板金加工部品の自動寸法検査システムに関する検討2024

Author(s)

Organizer

Related Report

[Presentation] Vision Transformerによる画像認識と判断根拠の可視化2024

Author(s)

Organizer

Related Report

[Presentation] 深層強化学習を用いた群ロボットの協調行動の獲得2024

Author(s)

Organizer

Related Report

[Presentation] A Study on Optimization of Hyper-Parameters in Deep Reinforcement Learning by Bayesian Optimization2023

Author(s)

Organizer

Related Report

[Presentation] ベイズ最適化を用いた深層強化学習のハイパーパラメータの最適化に関する検討2023

Author(s)

Organizer

Related Report

[Presentation] ベイズ最適化による深層強化学習のハイパーパラメータと報酬関数の最適化の試み2023

Author(s)

Organizer

Related Report

[Presentation] 深層強化学習を用いた群ロボットの追い抜き行動の獲得2023

Author(s)

Organizer

Related Report

[Presentation] ベイズ最適化を用いた深層強化学習のハイパーパラメータの最適化2023

Author(s)

Organizer

Related Report

[Presentation] 深層強化学習を用いた群ロボットの協調行動の獲得に関する検討2022

Author(s)

Organizer

Related Report

[Presentation] 深層強化学習を用いた移動ロボットの行動獲得における改良手法の検討2022

Author(s)

Organizer

Related Report

[Presentation] 深層強化学習を用いた群ロボットの協調行動の獲得2022

Author(s)

Organizer

Related Report

[Presentation] IMUセンサを用いた漕艇動作可視化システムに関する研究2022

Author(s)

Organizer

Related Report

[Presentation] 深層強化学習を用いた群ロボットの行動獲得に関する実験的考察2021

Author(s)

Organizer

Related Report

[Presentation] ボート競技における漕艇動作可視化システムの基礎的検討2021