MULTI-AGENT REINFORCEMENT LEARNING WITH NEUROEVOLUTION

Research Project

Project/Area Number	14580421
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	Intelligent informatics
Research Institution	The University of Tokushima
Principal Investigator	ONO Norihiko THE UNIVERSITY OF TOKUSHIMA, FACULTY OF ENGINEERING, PROFESSOR, 工学部, 教授 (60194594)
Co-Investigator(Kenkyū-buntansha)	ONO Isao THE UNIVERSITY OF TOKUSHIMA, FACULTY OF ENGINEERING, ASSOCIATE PROFESSOR, 工学部, 助教授 (00304551)
Project Period (FY)	2002 – 2003
Project Status	Completed (Fiscal Year 2003)
Budget Amount *help	¥3,600,000 (Direct Cost: ¥3,600,000) Fiscal Year 2003: ¥1,400,000 (Direct Cost: ¥1,400,000) Fiscal Year 2002: ¥2,200,000 (Direct Cost: ¥2,200,000)
Keywords	MULTI-AGENT SYSTEMS / MULTI-AGENT LEARNING / REINFORCEMENT LEARNING / MACHINE LEARNING / EVOLUTIONARY COMPUTATION / NEURAL NETWORKS / NEURO-EVOLUTION / REAL-CODED GA
Research Abstract	Several attempts have been reported to let multiple monolithic reinforcement learning (RL) agents synthesize highly coordinated behavior needed to accomplish their common goal effectively. Most of these straightforward application of RL scale poorly to more complex multi-agent learning problems, because the state space for each RL agent grows exponentially with the number of its partner agents engaged in the joint task. To cope with the exponentially large state space in multi-agent RL (MARL), we previously proposed a MARL scheme, based on neural network representation of the decision policy for an agent and its optimization with a real-coded GA, and showed the effectiveness of the scheme through its application to those multi-agent learning problems that can not be solved appropriately using any other conventional MARL scheme, such as the asynchronous multi-agent seesaw balancing problem and the dynamic channel allocation problem in cellular telephone systems. However, we can not apply the scheme directly to the design problems of large-scale multi-agent systems (MASs), because the scheme needs a huge amount of computation resources. To remedy the drawback, we propose a hierarchical design scheme of a large-scale MAS, which simply decomposes the whole task of the MAS into its subtasks hierarchically and optimizes each of the subtasks with the above-mentioned MARL scheme. The effectiveness of the design scheme is shown through its application to the RoboCup soccer team design problem where the task of the team is decomposed into the primitive actions by the soccer agents, interaction among the actions and coordination by the agents.

Report

(3 results)

2003 Annual Research Report Final Research Report Summary
2002 Annual Research Report

Research Products
(34 results)

All 2004 2003 2002 Other

All Journal Article (22 results) Publications (12 results)

[Journal Article] A grid-Oriented Genetic Algorithm Framework for Bioinformatics2004
- Author(s)
  Hiroaki Imade
- Journal Title
  
  New Generation Computing 22
  
  Pages: 177-186
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2003 Final Research Report Summary
[Journal Article] 対戦型ゲーム戦略の共進化的獲得に関する実験的考察2004
- Author(s)
  橋勇人
- Journal Title
  
  第48回システム制御情報学会研究発表講演会講演論文集
  
  Pages: 595-596
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2003 Final Research Report Summary
[Journal Article] 非同期的マルチエージェント系の政策獲得に関する実験的考察2004
- Author(s)
  清水雅也
- Journal Title
  
  第48回システム制御情報学会研究発表講演会講演論文集
  
  Pages: 615-616
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2003 Final Research Report Summary
[Journal Article] A Grid-Oriented Genetic Algorithm Framework for Bioinformatics2004
- Author(s)
  Hiroaki Imade, Ryohei Morishita, Isao Ono, Norihiko Ono, Masahiro Okamoto
- Journal Title
  
  New Generation Computing Vol.22
  
  Pages: 177-186
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2003 Final Research Report Summary
[Journal Article] On Emergent Design Scheme of Asynchronous Multi-agent Systems (in Japanese)2004
- Author(s)
  Masaya Shimizu, Norihiko Ono, Isao Ono
- Journal Title
  
  Proc.the 48th Annual Conference of the Institute of Systems, Control and Information Engineers (ISCIE), Kyoto, May 19-21,2004
  
  Pages: 615-616
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2003 Final Research Report Summary
[Journal Article] "Gridifying" A Parallel NSS-EA using The Improved GOGA Framework And Its Performance Evaluation on OBI Grid2004
- Author(s)
  Hiroaki Imade, Naoaki Mizuguchi, Isao Ono, Norihiko Ono, Masahiro Okamoto
- Journal Title
  
  First International Workshop on Life Science Grid (LSGRID2004)
  
  Pages: 159-160
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2003 Final Research Report Summary
[Journal Article] An Evolutionary Algorithm Taking Account of Mutual Interactions among Substances for Inference of Genetic Networks2004
- Author(s)
  Isao Ono, Yoshiaki Seike, Ryohei Morishita, Norihiko Ono, Masahiko Nakatsui, Masahiro Okamoto
- Journal Title
  
  Proc.the 2004 Congress on Evolutionary Computation (CEC2004)
  
  Pages: 2060-2067
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2003 Final Research Report Summary
[Journal Article] "Gridifying" A Parallel NSS-EA using The improved GOGA Framework And Its Performance Evaluation on OBIGrid,2004
- Author(s)
  Hiroaki Imade, Naoaki Mizuguchi, Isao Ono, Norihiko Ono, Masahiro Okamoto
- Journal Title
  
  Proc.the First International Workshop of Life Science Grid
  
  Pages: 159-160
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2003 Final Research Report Summary
[Journal Article] マルチエージェントによる対戦型ゲームへの共進化的接近に関する実験的考察2003
- Author(s)
  伊吹大介
- Journal Title
  
  第47回システム制御情報学会研究発表講演会講演論文集
  
  Pages: 497-498
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2003 Final Research Report Summary
[Journal Article] 異種エージェントによる対戦型ゲームへの共進化的接近に関する実験的考察2003
- Author(s)
  寺坂和也
- Journal Title
  
  第47回システム制御情報学会研究発表講演会講演論文集
  
  Pages: 499-500
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2003 Final Research Report Summary
[Journal Article] A Genetic Hill Climbing for Function Optimization Using A Neighborhood Based on Interaction among Prameters2003
- Author(s)
  Hiroshi Takeichi
- Journal Title
  
  Proceedings of the 2003 Congress on Evolutionary Computation (CEC2003)
  
  Pages: 1251-1258
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2003 Final Research Report Summary
[Journal Article] A Framework of Grid-Oriented Genetic Algorithms for Large-Scale Optimization in Bioinformatics2003
- Author(s)
  Hiroaki Imade, Ryohei Morishita, Isao Ono, Norihiko Ono, Masahiro Okamoto
- Journal Title
  
  Proc.the 2003 Congress on Evolutionary Computation (CEC2003)
  
  Pages: 623-630
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2003 Final Research Report Summary
[Journal Article] Finding Multiple Solutions Based on An Evolutionary Algorithm for Inference of Genetic Networks by S-system2003
- Author(s)
  Ryohei Morishita, Hiroaki Imade, Isao Ono, Norihiko Ono, Masahiro Okamoto
- Journal Title
  
  Proc.the 2003 Congress on Evolutionary Computation (CEC2003)
  
  Pages: 615-622
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2003 Final Research Report Summary
[Journal Article] A Genetic Hill Climbing for Function Optimization Using A Neighborhood Based on Interaction among Parameters2003
- Author(s)
  Hiroshi Takeichi, Isao Ono, Norihiko Ono
- Journal Title
  
  Proc.the 2003 Congress on Evolutionary Computation (CEC2003)
  
  Pages: 1251-1258
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2003 Final Research Report Summary
[Journal Article] A Grid-Oriented Genetic Algorithm for Estimating Genetic Networks by S-Systems2003
- Author(s)
  Hiroaki Imade, Ryohei Morishita, Isao Ono, Norihiko Ono, Masahiro Okamoto
- Journal Title
  
  Proc.of SICE Annual Conference 2003 in Fukui
  
  Pages: 3317-3322
- NAID
  130005440510
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2003 Final Research Report Summary
[Journal Article] Multi-agent Reinforcement Learning with Evolutionary Neural Networks (in Japanese)2002
- Author(s)
  Takayuki Yamamoto, Souya Michitsuji, Isao Ono, Norihiko Ono
- Journal Title
  
  Proc.the 2002 SICE Symposium on Systems and Information (SSI-2002)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2003 Final Research Report Summary
[Journal Article] An Evolutionary Algorithm for Inference of Genetic Networks on A Grid Computing Environment
- Author(s)
  Hiroaki Imade, Naoaki Mizuguchi, Isao Ono, Norihiko Ono, Masahiro Okamoto
- Journal Title
  
  Proc.the Fifth International Conference on Simulated Evolution And Learning, Busan, Oct.2004 (CD-ROM)
  
  Pages: 1-6
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2003 Final Research Report Summary
[Journal Article] On Co-Evolutionary Optimization of Competitive Game Strategies (in Japanese)
- Author(s)
  Hayato Hashi, Daisuke Ibuki, Norihiko Ono, Isao Ono
- Journal Title
  
  Proc.the 48th Annual Conference of the Institute of Systems, Control and Information Engineers (ISCIE), Kyoto, May 19-21,2004
  
  Pages: 595-596
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2003 Final Research Report Summary
[Journal Article] On Co-Evolutionary Acquisition of Competitive Polices by Two Kinds of Agents, (in Japanese)
- Author(s)
  Kazuya Terasala, Souya Michitsuji, Norihiko Ono, Isao Ono
- Journal Title
  
  Proc.the 47th Annual Conference of the Institute of Systems, Control and Information Engineers (ISCIE), Kyoto, May 14-16,2003
  
  Pages: 499-500
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2003 Final Research Report Summary
[Journal Article] On Co-Evolutionary Approach to Competitive Games by Multiple Agent (in Japanese)
- Author(s)
  Daisuke Ibuki, Souya Michitsuji, Norihiko Ono, Isao Ono
- Journal Title
  
  Proc.the 47th Annual Conference of the Institute of Systems, Control and Information Engineers (ISCIE), Kyoto, May 14-16,2003
  
  Pages: 497-498
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2003 Final Research Report Summary
[Journal Article] On Designing Soccer Agents with Evolutionary Neural Networks (in Japanese)
- Author(s)
  Souya Michitsuji, Norihiko Ono, Isao Ono
- Journal Title
  
  Proc.the 46th Annual Conference of the Institute of Systems, Control and Information Engineers (ISCIE), Kobe, May 15-17,2002
  
  Pages: 545-546
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2003 Final Research Report Summary
[Journal Article] Global Optimization of Protein 3-Dimensional Structures in NMR by A Genetic Algorithm
- Author(s)
  Isao Ono, Hiroshi Fujiki, Masaki Ootsuka, Naoto Nakashima, Norihiko Ono, Shin-ich Tate
- Journal Title
  
  Proc.the 2002 Congress on Evolutionary Computation, Honolulu, May 2002
  
  Pages: 303-308
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2003 Final Research Report Summary
[Publications] 橋勇人: "対戦型ゲーム戦略の共進化的獲得に関する実験的考察"第48回システム制御情報学会研究発表講演会講演論文集. (2004)
- Related Report
  2003 Annual Research Report
[Publications] 石本匡哉: "ニューラルネットエージェントの例題の共進化に関する実験的考察"第48回システム制御情報学会研究発表講演会講演論文集. (2004)
- Related Report
  2003 Annual Research Report
[Publications] 清水雅也: "非同期的マルチエージェント系の政策獲得に関する実験的考察"第48回システム制御情報学会研究発表講演会講演論文集. (2004)
- Related Report
  2003 Annual Research Report
[Publications] 北村章: "リカレントニューラルネットの構造と重みの進化的最適化に関する実験的考察"第48回システム制御情報学会研究発表講演会講演論文集. (2004)
- Related Report
  2003 Annual Research Report
[Publications] Hiroaki Imada: "A Grid-Oriented Genetic Algorithm Framework for Bioinformatics"New Generation Computing. 22・2. 177-186 (2004)
- Related Report
  2003 Annual Research Report
[Publications] Hiroshi Takeichi: "A Genetic Hill Climbing for Function Optimization Using A Neighborhood Based on Interaction among Parameters"Proc.2003 Congress on Evolutionary Computation (CEC2003). 1251-1258 (2003)
- Related Report
  2003 Annual Research Report
[Publications] 道辻壮哉: "進化型ニューラルネットによるサッカーエージェントの創発的設計"第46回システム制御情報学会研究発表講演会講演論文集. 545-546 (2002)
- Related Report
  2002 Annual Research Report
[Publications] 間口将行: "対戦型ゲームにおける行動政策の共進化的獲得のための世代交代モデル"第46回システム制御情報学会研究発表講演会講演論文集. 577-578 (2002)
- Related Report
  2002 Annual Research Report
[Publications] 山元隆行: "マルチエージェント強化学習への進化型ニューラルネットによる接近"計測自動制御学会システム・情報部門学術講演会予講集. (2002)
- Related Report
  2002 Annual Research Report
[Publications] 伊吹大介: "マルチエージェントによる対戦型ゲームヘの共進化的接近に関する実験的考察"第47回システム制御情報学会研究発表講演会講演論文集. (2003)
- Related Report
  2002 Annual Research Report
[Publications] 寺坂和也: "異種エージェントによる対戦型ゲームヘの共進化的接近に関する実験的考察"第47回システム制御情報学会研究発表講演会講演論文集. (2003)
- Related Report
  2002 Annual Research Report
[Publications] 阿部哲: "リカレントニューラルネットの構造と重みの進化的最適化のための世代交代モデルの提案"第47回システム制御情報学会研究発表講演会講演論文集. (2003)
- Related Report
  2002 Annual Research Report

MULTI-AGENT REINFORCEMENT LEARNING WITH NEUROEVOLUTION

Principal Investigator

ONO Norihiko THE UNIVERSITY OF TOKUSHIMA, FACULTY OF ENGINEERING, PROFESSOR, 工学部, 教授 (60194594)

¥3,600,000 (Direct Cost: ¥3,600,000)

Report

Research Products

[Journal Article] A grid-Oriented Genetic Algorithm Framework for Bioinformatics2004

Author(s)

Journal Title

Description

Related Report

[Journal Article] 対戦型ゲーム戦略の共進化的獲得に関する実験的考察2004

Author(s)

Journal Title

Description

Related Report

[Journal Article] 非同期的マルチエージェント系の政策獲得に関する実験的考察2004

Author(s)

Journal Title

Description

Related Report

[Journal Article] A Grid-Oriented Genetic Algorithm Framework for Bioinformatics2004

Author(s)

Journal Title

Description

Related Report

[Journal Article] On Emergent Design Scheme of Asynchronous Multi-agent Systems (in Japanese)2004

Author(s)

Journal Title

Description

Related Report

[Journal Article] "Gridifying" A Parallel NSS-EA using The Improved GOGA Framework And Its Performance Evaluation on OBI Grid2004

Author(s)

Journal Title

Description

Related Report

[Journal Article] An Evolutionary Algorithm Taking Account of Mutual Interactions among Substances for Inference of Genetic Networks2004

Author(s)

Journal Title

Description

Related Report

[Journal Article] "Gridifying" A Parallel NSS-EA using The improved GOGA Framework And Its Performance Evaluation on OBIGrid,2004

Author(s)

Journal Title

Description

Related Report

[Journal Article] マルチエージェントによる対戦型ゲームへの共進化的接近に関する実験的考察2003

Author(s)

Journal Title

Description

Related Report

[Journal Article] 異種エージェントによる対戦型ゲームへの共進化的接近に関する実験的考察2003

Author(s)

Journal Title

Description

Related Report

[Journal Article] A Genetic Hill Climbing for Function Optimization Using A Neighborhood Based on Interaction among Prameters2003

Author(s)

Journal Title

Description

Related Report

[Journal Article] A Framework of Grid-Oriented Genetic Algorithms for Large-Scale Optimization in Bioinformatics2003

Author(s)

Journal Title

Description

Related Report

[Journal Article] Finding Multiple Solutions Based on An Evolutionary Algorithm for Inference of Genetic Networks by S-system2003

Author(s)

Journal Title

Description

Related Report

[Journal Article] A Genetic Hill Climbing for Function Optimization Using A Neighborhood Based on Interaction among Parameters2003

Author(s)

Journal Title

Description

Related Report

[Journal Article] A Grid-Oriented Genetic Algorithm for Estimating Genetic Networks by S-Systems2003

Author(s)

Journal Title

NAID

[Publications] 橋勇人: "対戦型ゲーム戦略の共進化的獲得に関する実験的考察"第48回システム制御情報学会研究発表講演会講演論文集. (2004)

[Publications] 石本匡哉: "ニューラルネットエージェントの例題の共進化に関する実験的考察"第48回システム制御情報学会研究発表講演会講演論文集. (2004)

[Publications] 清水雅也: "非同期的マルチエージェント系の政策獲得に関する実験的考察"第48回システム制御情報学会研究発表講演会講演論文集. (2004)

[Publications] 北村章: "リカレントニューラルネットの構造と重みの進化的最適化に関する実験的考察"第48回システム制御情報学会研究発表講演会講演論文集. (2004)

[Publications] 道辻壮哉: "進化型ニューラルネットによるサッカーエージェントの創発的設計"第46回システム制御情報学会研究発表講演会講演論文集. 545-546 (2002)

[Publications] 間口将行: "対戦型ゲームにおける行動政策の共進化的獲得のための世代交代モデル"第46回システム制御情報学会研究発表講演会講演論文集. 577-578 (2002)

[Publications] 山元隆行: "マルチエージェント強化学習への進化型ニューラルネットによる接近"計測自動制御学会システム・情報部門学術講演会予講集. (2002)

[Publications] 伊吹大介: "マルチエージェントによる対戦型ゲームヘの共進化的接近に関する実験的考察"第47回システム制御情報学会研究発表講演会講演論文集. (2003)

[Publications] 寺坂和也: "異種エージェントによる対戦型ゲームヘの共進化的接近に関する実験的考察"第47回システム制御情報学会研究発表講演会講演論文集. (2003)

[Publications] 阿部哲: "リカレントニューラルネットの構造と重みの進化的最適化のための世代交代モデルの提案"第47回システム制御情報学会研究発表講演会講演論文集. (2003)