Theory and Applications of Exploitation-oriented Learning XoL in Multi-agent Systems

Research Project

Project/Area Number	26330267
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Multi-year Fund
Section	一般
Research Field	Intelligent informatics
Research Institution	National Institution for Academic Degrees and Quality Enhancement of Higher Education
Principal Investigator	Miyazaki Kazuteru 独立行政法人大学改革支援・学位授与機構, 研究開発部, 准教授 (20282866)
Project Period (FY)	2014-04-01 – 2017-03-31
Project Status	Completed (Fiscal Year 2016)
Budget Amount *help	¥4,290,000 (Direct Cost: ¥3,300,000、Indirect Cost: ¥990,000) Fiscal Year 2016: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000) Fiscal Year 2015: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000) Fiscal Year 2014: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
Keywords	マルチエージェントシステム / 経験強化型学習 / 強化学習 / 機械学習 / 人工知能 / ソフトコンピューティング / 深層学習
Outline of Final Research Achievements	This research has achieved several progresses about theory and applications of Exploitation-oriented Learning XoL in multi-agent learning. In multi-agent learning, it is important to avoid the concurrent learning problem that occurs when multiple agents learn simultaneously. Firstly, we have proposed a method to avoid the problem. Secondly, we have focused on a positive effect of an indirect reward which is given to the agent that does not receive a reward directly. Especially, we have proposed a method to reduce the perceptual aliasing problem caused by imperfect perception. We have also described the relationship between our previous multi-agent learning theorem and the positive effect. Lastly, we have extended application areas to show the effectiveness of XoL in multi-agent learning through experiments to Keepaway tasks like soccer games. We believe that these results contribute to claim that XoL surpasses traditional reinforcement learning methods in multi-agent learning.

Report

(4 results)

2016 Annual Research Report Final Research Report ( PDF )
2015 Research-status Report
2014 Research-status Report

Research Products
(22 results)

All 2016 2015 2014 Other

All Journal Article (4 results) (of which Peer Reviewed: 4 results, Open Access: 3 results, Acknowledgement Compliant: 2 results) Presentation (16 results) (of which Int'l Joint Research: 3 results, Invited: 1 results) Book (1 results) Remarks (1 results)

[Journal Article] A Study of an Indirect Reward on Multi-agent Environments2016
- Author(s)
  Kazuteru Miyazaki
- Journal Title
  
  Procedia Computer Science
  
  Volume: 88 Pages: 94-101
- DOI
  10.1016/j.procs.2016.07.411
- NAID
  40020804252
- Related Report
  2016 Annual Research Report
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Proposal of a Propagation Algorithm of the Expected Failure Probability and the Effectiveness on Multi-agent Environments2016
- Author(s)
  村岡宏紀、宮崎和光、小林博明
- Journal Title
  
  IEEJ Transactions on Electronics, Information and Systems
  
  Volume: 136 Issue: 3 Pages: 273-281
- DOI
  10.1541/ieejeiss.136.273
- NAID
  130005132275
- ISSN
  0385-4221, 1348-8155
- Related Report
  2015 Research-status Report
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] Proposed Decision-Making System Based on Consciousness in Multiple Rewards and Penalties Environments2015
- Author(s)
  Kazuteru Miyazaki
- Journal Title
  
  International Journal of Machine Learning and Computing
  
  Volume: 5 (2) Issue: 2 Pages: 121-126
- DOI
  10.7763/ijmlc.2015.v5.494
- Related Report
  2014 Research-status Report
- Peer Reviewed / Open Access
[Journal Article] The Necessity of a Secondary System in Machine Consciousness2014
- Author(s)
  Kazuteru Miyazaki and Junichi Takeno
- Journal Title
  
  Procedia Computer Science
  
  Volume: 41 Pages: 15-22
- DOI
  10.1016/j.procs.2014.11.079
- Related Report
  2014 Research-status Report
- Peer Reviewed / Open Access
[Presentation] Proposal of an Action Selection Strategy with Expected Failure Probability and its Evaluation in Multi-agent Reinforcement Learning2016
- Author(s)
  Kazuteru Miyazaki, Koudai Furukawa and Hiroaki Kobayashi
- Organizer
  14th European Conference on Multi-Agent Systems
- Place of Presentation
  ヴァレンシア（スペイン）
- Year and Date
  2016-12-15
- Related Report
  2016 Annual Research Report
- Int'l Joint Research
[Presentation] 深層学習を組み込んだ経験強化型学習に関する実験的考察2016
- Author(s)
  宮崎和光
- Organizer
  電気学会システム研究会「機械学習研究の最新動向」
- Place of Presentation
  伊豆高原　五景館
- Year and Date
  2016-12-02
- Related Report
  2016 Annual Research Report
[Presentation] 経験強化型学習XoL －強化学習における試行錯誤回数の低減をめざして－2016
- Author(s)
  宮崎和光
- Organizer
  第6回知能工学部会研究会「賢さの先端研究会」，第54 回システム工学部会研究会機械学習の最先端研究- 理論および応用研究 -
- Place of Presentation
  フォーラムミカサエコ 7Fホール（東京都千代田区神田）
- Year and Date
  2016-11-07
- Related Report
  2016 Annual Research Report
- Invited
[Presentation] Proposal and Evaluation of an Action Selection Strategy with Expected Failure Probability in Multiagent Learning2016
- Author(s)
  Kazuteru Miyazaki, Koudai Furukawa and Hiroaki Kobayashi
- Organizer
  International Workshop on Multiagent Learning: Theory and Applications
- Place of Presentation
  Kunibikimesse, Matsue, Shimane
- Year and Date
  2016-09-30
- Related Report
  2016 Annual Research Report
- Int'l Joint Research
[Presentation] 深層学習を組み込んだ経験強化型学習XoL：deep Q-networkとの比較2016
- Author(s)
  宮崎和光
- Organizer
  電気学会システム研究会「機械学習応用研究の最前線」
- Place of Presentation
  星陵会館（東京都千代田区永田町）
- Year and Date
  2016-07-09
- Related Report
  2016 Annual Research Report
[Presentation] 2報酬PS強化学習法の提案とその有効性の検証2016
- Author(s)
  小玉直樹、宮崎和光、小林博明
- Organizer
  電気学会システム研究会「機械学習応用研究の最前線」
- Place of Presentation
  星陵会館（東京都千代田区永田町）
- Year and Date
  2016-07-09
- Related Report
  2016 Annual Research Report
[Presentation] マルチエージェント環境における間接報酬に関する一考察2016
- Author(s)
  宮崎和光
- Organizer
  電気学会　システム研究会　「機械学習応用研究の最前線」
- Place of Presentation
  東京トラック事業健保会館（東京都千代田区）
- Year and Date
  2016-03-08
- Related Report
  2015 Research-status Report
[Presentation] 予想失敗確率を組み込んだ行動選択戦略の提案とマルチエージェント環境下での有効性の検証2016
- Author(s)
  古川耕大、宮崎和光、小林博明
- Organizer
  電気学会　システム研究会　「機械学習応用研究の最前線」
- Place of Presentation
  東京トラック事業健保会館（東京都千代田区）
- Year and Date
  2016-03-08
- Related Report
  2015 Research-status Report
[Presentation] 政策の多様性を重視した直接政策探索法の提案2015
- Author(s)
  徳久文彬，小野功，宮崎和光
- Organizer
  計測自動制御学会システム・情報部門学術講演会 2015
- Place of Presentation
  函館アリーナ
- Year and Date
  2015-11-18
- Related Report
  2015 Research-status Report
[Presentation] The Necessity of a Secondary System in Multi-agent Learning2015
- Author(s)
  Kazuteru Miyazaki
- Organizer
  The First International Symposium on Swarm Behavior and Bio-Inspired Robotics
- Place of Presentation
  Kyoto University
- Year and Date
  2015-10-28
- Related Report
  2015 Research-status Report
- Int'l Joint Research
[Presentation] 学位取得者に対するアンケート調査の分析2015
- Author(s)
  宮崎和光
- Organizer
  電気学会　合同システム研究会
- Place of Presentation
  電力中央研究所（東京都千代田区）
- Year and Date
  2015-06-20
- Related Report
  2015 Research-status Report
[Presentation] マルチエージェント学習における2次系の必要性に関する研究2015
- Author(s)
  宮崎和光
- Organizer
  電気学会システム研究会機械学習応用研究の最前線
- Place of Presentation
  青山学院大学相模原キャンパス
- Year and Date
  2015-03-11
- Related Report
  2014 Research-status Report
[Presentation] Profit Sharing強化学習への予想失敗確率の導入とその有効性に関する研究2015
- Author(s)
  古川耕大, 宮崎和光小林博明
- Organizer
  第27回自律分散システムシンポジウム
- Place of Presentation
  東京理科大学神楽坂キャンパス
- Year and Date
  2015-01-22
- Related Report
  2014 Research-status Report
[Presentation] Proposed Decision-Making System Based on Consciousness in Multiple Rewards and Penalties Environments2014
- Author(s)
  Kazuteru Miyazaki
- Organizer
  2014 International Conference on Artificial Intelligence
- Place of Presentation
  バルセロナ（スペイン）
- Year and Date
  2014-12-23
- Related Report
  2014 Research-status Report
[Presentation] The Necessity of a Secondary System in Machine Consciousness2014
- Author(s)
  Kazuteru Miyazaki and Junichi Takeno
- Organizer
  2014 Annual International Conference on Biologically Inspired Cognitive Architectures
- Place of Presentation
  マサチューセッツ工科大学（アメリカ）
- Year and Date
  2014-11-08
- Related Report
  2014 Research-status Report
[Presentation] 複数種類の報酬と罰に対応した意識的意思決定システムの提案2014
- Author(s)
  宮崎和光
- Organizer
  第13回情報科学技術フォーラム
- Place of Presentation
  筑波大学筑波キャンパス
- Year and Date
  2014-09-05
- Related Report
  2014 Research-status Report
[Book] これからの強化学習2016
- Author(s)
  牧野貴樹、澁谷長史、白川真一、浅田稔、麻生英樹、荒井幸代、飯間等、伊藤真、大倉和博、黒江康明、杉本徳和、坪井祐太、銅谷賢治、前田新一、松井藤五郎、南泰浩、宮崎和光、目黒豊美、森村哲郎、森本淳、保田俊行、吉本潤一郎
- Total Pages
  320
- Publisher
  森北出版
- Related Report
  2016 Annual Research Report
[Remarks]
- URL
  http://www7b.biglobe.ne.jp/~kazuteru/indexj.html
- Related Report
  2014 Research-status Report

Theory and Applications of Exploitation-oriented Learning XoL in Multi-agent Systems

Principal Investigator

Miyazaki Kazuteru 独立行政法人大学改革支援・学位授与機構, 研究開発部, 准教授 (20282866)

¥4,290,000 (Direct Cost: ¥3,300,000、Indirect Cost: ¥990,000)

Report

Research Products

[Journal Article] A Study of an Indirect Reward on Multi-agent Environments2016

Author(s)

Journal Title

DOI

NAID

Related Report

[Journal Article] Proposal of a Propagation Algorithm of the Expected Failure Probability and the Effectiveness on Multi-agent Environments2016

Author(s)

Journal Title

DOI

NAID

ISSN

Related Report

[Journal Article] Proposed Decision-Making System Based on Consciousness in Multiple Rewards and Penalties Environments2015

Author(s)

Journal Title

DOI

Related Report

[Journal Article] The Necessity of a Secondary System in Machine Consciousness2014

Author(s)

Journal Title

DOI

Related Report

[Presentation] Proposal of an Action Selection Strategy with Expected Failure Probability and its Evaluation in Multi-agent Reinforcement Learning2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 深層学習を組み込んだ経験強化型学習に関する実験的考察2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 経験強化型学習XoL －強化学習における試行錯誤回数の低減をめざして－2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Proposal and Evaluation of an Action Selection Strategy with Expected Failure Probability in Multiagent Learning2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 深層学習を組み込んだ経験強化型学習XoL：deep Q-networkとの比較2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 2報酬PS強化学習法の提案とその有効性の検証2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] マルチエージェント環境における間接報酬に関する一考察2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 予想失敗確率を組み込んだ行動選択戦略の提案とマルチエージェント環境下での有効性の検証2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 政策の多様性を重視した直接政策探索法の提案2015

Author(s)

Organizer