2016 Fiscal Year Annual Research Report

Theory and Applications of Exploitation-oriented Learning XoL in Multi-agent Systems

Research Project

Project/Area Number	26330267
Research Institution	National Institution for Academic Degrees and Quality Enhancement of Higher Education
Principal Investigator	宮崎和光独立行政法人大学改革支援・学位授与機構, 研究開発部, 准教授 (20282866)
Project Period (FY)	2014-04-01 – 2017-03-31
Keywords	マルチエージェントシステム / 経験強化型学習 / 強化学習 / 機械学習 / 人工知能 / 深層学習 / ソフトコンピューティング
Outline of Annual Research Achievements	本研究課題では、マルチエージェント学習における経験強化型学習XoLに関する発展的研究を行った。特に、平成２８年度においては、当初の予定通り、マルチエージェント学習におけるXoLの有効性を示すための応用例の拡充を図った。特に、Keepaway Taskとして呼ばれるサッカーゲームを模したマルチエージェント環境への応用を行った。具体的には、「International Workshop on Multiagent Learning: Theory and Applications」において、サッカーロボットに関するプロトタイプ的な手法を提案するとともに、「14th European Conference on Multi-Agent Systems」においては、それをさらに発展・拡充させた「Proposal of an Action Selection Strategy with Expected Failure Probability and its Evaluation in Multi-agent Reinforcement Learning」という題目の発表を行った。また、近年、深層学習が大きな広がりをみせているので、深層学習を利用した応用例として、「電気学会システム研究会」において、ATARI 2600ゲームを題材とした研究発表を行った。そこでは、Deep Q-Networkとして知られる既存手法にXoLの一手法であるProfit Sharingを組み合わせことで、大幅なる性能の改善を実現した。なお、本成果はシングルエージェント環境下のものであるが、現在は、本研究課題の目的であるマルチエージェント環境下への適用を進めている。特に、Palro、Rapiro、LEGOロボット等の異なる種類のロボットにより構成されるマルチロボット環境下への意識的意思決定システムの実装を中心に研究を進めている。

Research Products
(8 results)

All 2016

All Journal Article (1 results) (of which Peer Reviewed: 1 results, Open Access: 1 results, Acknowledgement Compliant: 1 results) Presentation (6 results) (of which Int'l Joint Research: 2 results, Invited: 1 results) Book (1 results)

[Journal Article] A Study of an Indirect Reward on Multi-agent Environments2016
- Author(s)
  Kazuteru Miyazaki
- Journal Title
  
  Procedia Computer Science
  
  Volume: 88 Pages: 94-101
- DOI
  https://doi.org/10.1016/j.procs.2016.07.411
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Presentation] Proposal of an Action Selection Strategy with Expected Failure Probability and its Evaluation in Multi-agent Reinforcement Learning2016
- Author(s)
  Kazuteru Miyazaki, Koudai Furukawa and Hiroaki Kobayashi
- Organizer
  14th European Conference on Multi-Agent Systems
- Place of Presentation
  ヴァレンシア（スペイン）
- Year and Date
  2016-12-15 – 2016-12-16
- Int'l Joint Research
[Presentation] 深層学習を組み込んだ経験強化型学習に関する実験的考察2016
- Author(s)
  宮崎和光
- Organizer
  電気学会システム研究会「機械学習研究の最新動向」
- Place of Presentation
  伊豆高原　五景館
- Year and Date
  2016-12-02
[Presentation] 経験強化型学習XoL －強化学習における試行錯誤回数の低減をめざして－2016
- Author(s)
  宮崎和光
- Organizer
  第6回知能工学部会研究会「賢さの先端研究会」，第54 回システム工学部会研究会機械学習の最先端研究- 理論および応用研究 -
- Place of Presentation
  フォーラムミカサエコ 7Fホール（東京都千代田区神田）
- Year and Date
  2016-11-07
- Invited
[Presentation] Proposal and Evaluation of an Action Selection Strategy with Expected Failure Probability in Multiagent Learning2016
- Author(s)
  Kazuteru Miyazaki, Koudai Furukawa and Hiroaki Kobayashi
- Organizer
  International Workshop on Multiagent Learning: Theory and Applications
- Place of Presentation
  Kunibikimesse, Matsue, Shimane
- Year and Date
  2016-09-30
- Int'l Joint Research
[Presentation] 深層学習を組み込んだ経験強化型学習XoL：deep Q-networkとの比較2016
- Author(s)
  宮崎和光
- Organizer
  電気学会システム研究会「機械学習応用研究の最前線」
- Place of Presentation
  星陵会館（東京都千代田区永田町）
- Year and Date
  2016-07-09
[Presentation] 2報酬PS強化学習法の提案とその有効性の検証2016
- Author(s)
  小玉直樹、宮崎和光、小林博明
- Organizer
  電気学会システム研究会「機械学習応用研究の最前線」
- Place of Presentation
  星陵会館（東京都千代田区永田町）
- Year and Date
  2016-07-09
[Book] これからの強化学習2016
- Author(s)
  牧野貴樹、澁谷長史、白川真一、浅田稔、麻生英樹、荒井幸代、飯間等、伊藤真、大倉和博、黒江康明、杉本徳和、坪井祐太、銅谷賢治、前田新一、松井藤五郎、南泰浩、宮崎和光、目黒豊美、森村哲郎、森本淳、保田俊行、吉本潤一郎
- Total Pages
  320
- Publisher
  森北出版

2016 Fiscal Year Annual Research Report

Theory and Applications of Exploitation-oriented Learning XoL in Multi-agent Systems

Principal Investigator

宮崎 和光 独立行政法人大学改革支援・学位授与機構, 研究開発部, 准教授 (20282866)

Research Products

[Journal Article] A Study of an Indirect Reward on Multi-agent Environments2016

Author(s)

Journal Title

DOI

[Presentation] Proposal of an Action Selection Strategy with Expected Failure Probability and its Evaluation in Multi-agent Reinforcement Learning2016

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 深層学習を組み込んだ経験強化型学習に関する実験的考察2016

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 経験強化型学習XoL －強化学習における試行錯誤回数の低減をめざして－2016

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Proposal and Evaluation of an Action Selection Strategy with Expected Failure Probability in Multiagent Learning2016

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 深層学習を組み込んだ経験強化型学習XoL：deep Q-networkとの比較2016

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 2報酬PS強化学習法の提案とその有効性の検証2016

Author(s)

Organizer

Place of Presentation

Year and Date

[Book] これからの強化学習2016

Author(s)

Total Pages

Publisher

宮崎和光独立行政法人大学改革支援・学位授与機構, 研究開発部, 准教授 (20282866)