2020 Fiscal Year Annual Research Report

Studies on autonomous learning of agents' organizational formation for system efficiency

Research Project

Project/Area Number	20H04245
Research Institution	Waseda University
Principal Investigator	菅原俊治早稲田大学, 理工学術院, 教授 (70396133)
Project Period (FY)	2020-04-01 – 2024-03-31
Keywords	マルチエージェントシステム / 組織行動 / 社会学習 / 機械学習 / 組織化 / 深層強化学習 / マルチエージェントプランニング
Outline of Annual Research Achievements	一昨年度に課題とした複数の異なる能力を持つエージェントが、順番に個別の作業を進めるタイプのタスクにおいて、自分自身が担当すべき個別作業の学習と、その前後で作業するエージェントとの協調・調整行動の発現を、より複雑で多くのエージェントが必要なタスクでも実現する手法を提案した。しかし、同時に順列的作業では時間差があり、エージェント毎の学習の進行の差が障壁となり、より柔軟な自律学習手法が必要であることも判明した。第2に、協調行動を学習したエージェントが観測可能範囲の着目している対象を明示すること、また周辺にいる協調している・協調していないエージェントへの着目度の変化などを解明する仕組みを提案した。初期実験では、組織あるいはグループを組むべき相手に応じて着目度が変わることが得られたが、より複雑な環境構造や組織構造、エージェントの処理速度に差があるときなどの確認が必要である。また、データに含まれる雑音の影響を無視できないことも判明し、その改善にも取り組み、初期的な結果を得ている。他方、学習機能には頼らずに、堅実なアルゴリズムによるエージェント（自走ロボットを想定）の移動アルゴリズムを考案した。これは、(1) 目的地が集中するなどの場合、既存のアルゴリズムでは対応できなかったが、目的地の近くで待機し、そのあとに順に目的地に進むという効率的行動を実現したこと、(2) エージェントの作業時間に差や揺らぎがあるときに柔軟に対応し行動を変える手法を提案・評価した。前者は、当該分野でトップ会議であるAAMASに、後者は応用に関する重要会議であるCompsacにそれぞれ採択されている。また国内でも関連論文が情報処理学会の研究会で最優秀論文書と優秀論文賞を受賞している。
Current Status of Research Progress	Current Status of Research Progress 3: Progress in research has been slightly delayed. Reason 2020年度から現在も続くCOVID-19による研究活動・学会活動の行動制限のため、研究の一時停止や繰り延べを強いられ、計画と比較して、やや遅れていると考える。研究内容については、当初計画に沿って進めている。またAAMAS 2022やCompSac 2022など、人工知能や応用システムに関する重要な国際会議にもfull papersとして採択されていること、情報処理学会の研究会などでの（最）優秀論文賞も受賞しており、研究内容は十分な評価を受けているものと判断する。一方で、計画時には判明していたかった課題も見つかり、たとえば、データの雑音に対する影響や耐雑音を向上させる基本手法も初期結果を提案している。計画と比較すればやや遅れはあるものの、追加の研究項目の進行を考慮すると全体としては順調と考えており、今後も計画に沿った内容で研究を進める予定である。
Strategy for Future Research Activity	本年度は、COVID-19の影響で遅れていた目的2に相当する「行動規範の自律的選択による最適組織化の実現」を進めると共に、やはり同影響で遅れていた「能力の相互バランスと相互補完を実現する共同グループ構成」に着手する。また、これまでの研究で明らかになった新しい研究項目として、協調的・組織的行動を実現するための行動の揺らぎや雑音の影響、さらに「最適組織化の実現」に向けた状況に応じて行動を変える学習法とその説明可能性について取り組む。その後、協調行動やグループ化を発現させるために適切な報酬構造やそこに必要な情報を調査する予定である。特に、エージェント毎に学習の進度が異なるときに、環境がそれを認知しながら報酬を与える方法や、共同作業に必要なエージェント数を増加させたときの協調構造・組織構造について調べる。

Research Products
(12 results)

All 2022 2021

All Journal Article (7 results) (of which Peer Reviewed: 7 results, Open Access: 5 results) Presentation (5 results)

[Journal Article] Standby-Based Deadlock Avoidance Method for Multi-Agent Pickup and Delivery Tasks2022
- Author(s)
  Tomoki Yamauchi, Yuki Miyashita and Toshiharu Sugawara
- Journal Title
  
  Proceedings of the 21st International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS 2022)
  
  Volume: - Pages: 1427-1435
- DOI
  10.5555/3535850.3536009
- Peer Reviewed / Open Access
[Journal Article] Distributed Multi-Agent Deep Reinforcement Learning for Robust Coordination against Noise2022
- Author(s)
  Yoshinari Motokawa and Toshiharu Sugawara
- Journal Title
  
  Proceedings of The 2022 International Joint Conference on Neural Networks (IJCNN 2022)
  
  Volume: 1 Pages: 採択済み
- Peer Reviewed
[Journal Article] Distributed and Asynchronous Planning and Execution for Multi-agent Systems through Short-Sighted Conflict Resolution2022
- Author(s)
  Yuki Miyashita, Tomoki Yamauchi and Toshiharu Sugawara
- Journal Title
  
  Proceedings of 2021 IEEE 46th Annual Computers, Software, and Applications Conference (COMPSAC 2022)
  
  Volume: 1 Pages: 採択済み
- Peer Reviewed
[Journal Article] mpact of Monetary Rewards on Users' Behavior in Social Media2022
- Author(s)
  Yutaro Usui, Fujio Toriumi and Toshiharu Sugawara
- Journal Title
  
  Proceedings of the 10th International Conference on Complex Networks and their Applications X (Complex Networks 2021), Studies in Computational Intelligence
  
  Volume: 1015 Pages: 632-644
- DOI
  10.1007/978-3-030-93409-5_52
- Peer Reviewed / Open Access
[Journal Article] Understanding How Retweets Influence the Behaviors of Social Networking Service Users via Agent-Based Simulation2021
- Author(s)
  Yizhou Yan, Fujio Toriumi, and Toshiharu Sugawara,
- Journal Title
  
  Computational Social Networks (Springer-Nature)
  
  Volume: 8 Pages: Article 18
- DOI
  10.1186/s40649-021-00099-8
- Peer Reviewed / Open Access
[Journal Article] MAT-DQN: Towards Interpretable Multi-Agent Deep Reinforcement Learning2021
- Author(s)
  Yoshinari Motokawa and Toshiharu Sugawara
- Journal Title
  
  Proceedings of the 30th International Conference on Artificial Neural Networks (ICANN 2021)
  
  Volume: LNCS 12894 Pages: 556-567
- DOI
  10.1007/978-3-030-86380-7_45
- Peer Reviewed / Open Access
[Journal Article] Multi-agent Task Allocation Based on Reciprocal Trust in Distributed Environments2021
- Author(s)
  Koki Sato and Toshiharu Sugawara
- Journal Title
  
  Smart Innovation, Systems and Technologies Series (Springer-Nature)
  
  Volume: 241 Pages: 477-488
- DOI
  10.1007/978-981-16-2994-5_40
- Peer Reviewed / Open Access
[Presentation] DA3:マルチエージェント深層強化学習における協調行動の解釈性確立と対ノイズ性能の検証2022
- Author(s)
  元川善就, 菅原俊治
- Organizer
  知能システム研究会 (情報処理学会)
[Presentation] 時間制限付き半順序作業における協調行動学習のための漸進的報酬設計の提案2022
- Author(s)
  小國祥寛, 宮下裕貴, 菅原俊治
- Organizer
  知能システム研究会 (情報処理学会)
[Presentation] 負荷均等性を考慮した蟻コロニー最適化に基づく複数UAVの3次元フォーメーション遷移2022
- Author(s)
  鈴木嘉恵, 菅原俊治
- Organizer
  人工知能と知識処理研究会（電子情報通信学会）
[Presentation] スパース報酬のマルチエージェント強化学習における優先度付き経験再生の導入2022
- Author(s)
  李宗岳, 菅原俊治
- Organizer
  人工知能と知識処理研究会（電子情報通信学会）
[Presentation] SNSにおける記事紹介による活性化法の提案2021
- Author(s)
  臼井佑太郎, 鳥海不二夫, 菅原俊治
- Organizer
  第35回人工知能学会全国大会

2020 Fiscal Year Annual Research Report

Studies on autonomous learning of agents' organizational formation for system efficiency

Principal Investigator

菅原 俊治 早稲田大学, 理工学術院, 教授 (70396133)

Current Status of Research Progress

Reason

Research Products

[Journal Article] Standby-Based Deadlock Avoidance Method for Multi-Agent Pickup and Delivery Tasks2022

Author(s)

Journal Title

DOI

[Journal Article] Distributed Multi-Agent Deep Reinforcement Learning for Robust Coordination against Noise2022

Author(s)

Journal Title

[Journal Article] Distributed and Asynchronous Planning and Execution for Multi-agent Systems through Short-Sighted Conflict Resolution2022

Author(s)

Journal Title

[Journal Article] mpact of Monetary Rewards on Users' Behavior in Social Media2022

Author(s)

Journal Title

DOI

[Journal Article] Understanding How Retweets Influence the Behaviors of Social Networking Service Users via Agent-Based Simulation2021

Author(s)

Journal Title

DOI

[Journal Article] MAT-DQN: Towards Interpretable Multi-Agent Deep Reinforcement Learning2021

Author(s)

Journal Title

DOI

[Journal Article] Multi-agent Task Allocation Based on Reciprocal Trust in Distributed Environments2021

Author(s)

Journal Title

DOI

[Presentation] DA3:マルチエージェント深層強化学習における協調行動の解釈性確立と対ノイズ性能の検証2022

Author(s)

Organizer

[Presentation] 時間制限付き半順序作業における協調行動学習のための漸進的報酬設計の提案2022

Author(s)

Organizer

[Presentation] 負荷均等性を考慮した蟻コロニー最適化に基づく複数UAVの3次元フォーメーション遷移2022

Author(s)

Organizer

[Presentation] スパース報酬のマルチエージェント強化学習における優先度付き経験再生の導入2022

Author(s)

Organizer

[Presentation] SNSにおける記事紹介による活性化法の提案2021

Author(s)

Organizer

菅原俊治早稲田大学, 理工学術院, 教授 (70396133)