群強化学習法の開発

Research Project

Project/Area Number	19650031
Research Category	Grant-in-Aid for Exploratory Research
Allocation Type	Single-year Grants
Research Field	Intelligent informatics
Research Institution	Kyoto Institute of Technology
Principal Investigator	飯間等 Kyoto Institute of Technology, 工芸科学研究科, 准教授 (70273547)
Co-Investigator(Kenkyū-buntansha)	黒江康明京都工芸繊維大学, 工芸科学研究科, 教授 (10153397)
Project Period (FY)	2007 – 2008
Project Status	Completed (Fiscal Year 2008)
Budget Amount *help	¥1,800,000 (Direct Cost: ¥1,800,000) Fiscal Year 2008: ¥800,000 (Direct Cost: ¥800,000) Fiscal Year 2007: ¥1,000,000 (Direct Cost: ¥1,000,000)
Keywords	強化学習 / Particle Swarm Optimization / アントコロニー最適化法 / 群知能
Research Abstract	通常の強化学習では一つのエージェントのみを用いて学習を行うので複雑な問題では学習に時間がかかりすぎるという欠点がある。したがって、強化学習の実用化に向けて学習を高速に行う新しい方法を開発することが必要不可欠である。本研究では、短時間で学習を行うために複数のエージェントを用意し、各エージェントが通常の強化学習法で学習を行うとともに、エージェント間の情報交換により他のエージェントの学習成果を参照して学習を行う群強化学習法を提案した。本年度は、鳥の群れ行動にヒントを得た最適化手法であるParticle Swarm Optimizationを用いた群強化学習法におけるエージェント間の情報交換方法を提案した。また、各エージェントが行う個別学習法として、SarsaやActor-Criticを用いた方法を提案した。また、より複雑な問題に対する群強化学習法の有効性を検証するために、倒立振子制御問題、サッカーゲーム問題、マルチエージェント環境の問題に群強化学習法を適用し、これらの問題に対しても短時間に良い方策を獲得できることを確認した。さらに、蟻の群れ行動にヒントを得た最適化手法であるアントコロニー最適化法を用いた群強化学習法を提案した。この群強化学習法では他のエージェントの学習成果を行動選択に利用する新しい枠組みを用いている。以上の成果より、従来の1エージェント強化学習法より短時間に良い方策を獲得できる群強化学習法を開発することができた。

Report

(2 results)

2008 Annual Research Report
2007 Annual Research Report

Research Products
(14 results)

All 2009 2008 2007

All Journal Article (5 results) (of which Peer Reviewed: 5 results) Presentation (9 results)

[Journal Article] Swarm Reinforcement Learning Algorithms Based on Sarsa Method2008
- Author(s)
  Hitoshi Iima, Yasuaki Kuroe
- Journal Title
  
  Proceedings of SICE Annual Conference 2008
  
  Pages: 2045-2049
- Related Report
  2008 Annual Research Report
- Peer Reviewed
[Journal Article] Swarm Reinforceient Learning Algori thms Based on Particle Swarm Optimization2008
- Author(s)
  Hitoshi Iima, Yasuaki Kuroe
- Journal Title
  
  Proceedings of 2008 International Conference on Systerns, Man and Cybernetics
  
  Pages: 1110-1115
- Related Report
  2008 Annual Research Report
- Peer Reviewed
[Journal Article] 各個体の自律探索機能を強化したParticle Swarm Optimization2008
- Author(s)
  飯間等
- Journal Title
  
  計測自動制御学会論文集 44
  
  Pages: 61-70
- NAID
  10020126978
- Related Report
  2007 Annual Research Report
- Peer Reviewed
[Journal Article] Swarm Reinforcement Learning Algorithms-Exchange of Informationamong Multiple Agents2007
- Author(s)
  Hitoshi Iima
- Journal Title
  
  SICE Annual Conference 2007 Proceedings
  
  Pages: 2779-2784
- Related Report
  2007 Annual Research Report
- Peer Reviewed
[Journal Article] Particle Swarm Optimization with Enhanced Autonomous Search Ability2007
- Author(s)
  Hitoshi Iima
- Journal Title
  
  Proceedings of the 7th International Conference on Optimization: Techniques and Applications
- NAID
  130006980491
- Related Report
  2007 Annual Research Report
- Peer Reviewed
[Presentation] 群強化学習法のサッカーゲーム問題への適用2009
- Author(s)
  馬場口豊, 黒江康明, 飯間等
- Organizer
  第36回知能システムシンポジウム
- Place of Presentation
  京都
- Year and Date
  2009-03-18
- Related Report
  2008 Annual Research Report
[Presentation] マルチエージェントタスクに対する群強化学習法の適用2009
- Author(s)
  野口直起, 黒江康明, 飯間等
- Organizer
  第36回知能システムシンポジウム
- Place of Presentation
  京都
- Year and Date
  2009-03-17
- Related Report
  2008 Annual Research Report
[Presentation] Actor-Criticを用いた群強化学習法-情報交換の方法とその性能評価-2008
- Author(s)
  飯間等, 黒江康明
- Organizer
  計測自動制御学会システム・情報部門学術講演会2008
- Place of Presentation
  姫路
- Year and Date
  2008-11-28
- Related Report
  2008 Annual Research Report
[Presentation] フェロモンに基づく行動選択手法を用いた群強化学習法とその性能評価2008
- Author(s)
  松田祥子, 黒江康明, 飯間等
- Organizer
  計測自動制御学会システム・情報部門学術講演会2008
- Place of Presentation
  姫路
- Year and Date
  2008-11-26
- Related Report
  2008 Annual Research Report
[Presentation] 連続状態行動空間におけるActor-Criticを用いた群強化学習法2008
- Author(s)
  飯間等, 黒江康明
- Organizer
  第52回システム制御情報学会研究発表講演会
- Place of Presentation
  京都
- Year and Date
  2008-05-17
- Related Report
  2008 Annual Research Report
[Presentation] Actor-Criticを用いた群強化学習法2008
- Author(s)
  飯間等
- Organizer
  第35回知能システムシンポジウム
- Place of Presentation
  東京
- Year and Date
  2008-03-17
- Related Report
  2007 Annual Research Report
[Presentation] フェロモンに基づく行動選択手法を用いた群強化学習法2008
- Author(s)
  松田祥子
- Organizer
  第35回知能システムシンポジウム
- Place of Presentation
  東京
- Year and Date
  2008-03-17
- Related Report
  2007 Annual Research Report
[Presentation] アントコロニー最適化法に基づく群強化学習法とその性能評価2007
- Author(s)
  松田祥子
- Organizer
  システム・情報部門学術講演会2007
- Place of Presentation
  東京
- Year and Date
  2007-11-26
- Related Report
  2007 Annual Research Report
[Presentation] アントコロニー最適化法に基づく群強化学習法2007
- Author(s)
  松田祥子
- Organizer
  第51回システム制御情報学会研究発表講演会
- Place of Presentation
  京都
- Year and Date
  2007-05-16
- Related Report
  2007 Annual Research Report

群強化学習法の開発

Principal Investigator

飯間 等 Kyoto Institute of Technology, 工芸科学研究科, 准教授 (70273547)

¥1,800,000 (Direct Cost: ¥1,800,000)

Report

Research Products

[Journal Article] Swarm Reinforcement Learning Algorithms Based on Sarsa Method2008

Author(s)

Journal Title

Related Report

[Journal Article] Swarm Reinforceient Learning Algori thms Based on Particle Swarm Optimization2008

Author(s)

Journal Title

Related Report

[Journal Article] 各個体の自律探索機能を強化したParticle Swarm Optimization2008

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Swarm Reinforcement Learning Algorithms-Exchange of Informationamong Multiple Agents2007

Author(s)

Journal Title

Related Report

[Journal Article] Particle Swarm Optimization with Enhanced Autonomous Search Ability2007

Author(s)

Journal Title

NAID

Related Report

[Presentation] 群強化学習法のサッカーゲーム問題への適用2009

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] マルチエージェントタスクに対する群強化学習法の適用2009

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Actor-Criticを用いた群強化学習法-情報交換の方法とその性能評価-2008

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] フェロモンに基づく行動選択手法を用いた群強化学習法とその性能評価2008

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 連続状態行動空間におけるActor-Criticを用いた群強化学習法2008

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Actor-Criticを用いた群強化学習法2008

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] フェロモンに基づく行動選択手法を用いた群強化学習法2008

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] アントコロニー最適化法に基づく群強化学習法とその性能評価2007

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] アントコロニー最適化法に基づく群強化学習法2007

Author(s)

Organizer

Place of Presentation

飯間等 Kyoto Institute of Technology, 工芸科学研究科, 准教授 (70273547)