• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Theory and Applications of Exploitation-oriented Learning XoL in Multi-agent Systems

Research Project

Project/Area Number 26330267
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeMulti-year Fund
Section一般
Research Field Intelligent informatics
Research InstitutionNational Institution for Academic Degrees and Quality Enhancement of Higher Education

Principal Investigator

Miyazaki Kazuteru  独立行政法人大学改革支援・学位授与機構, 研究開発部, 准教授 (20282866)

Project Period (FY) 2014-04-01 – 2017-03-31
Project Status Completed (Fiscal Year 2016)
Budget Amount *help
¥4,290,000 (Direct Cost: ¥3,300,000、Indirect Cost: ¥990,000)
Fiscal Year 2016: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
Fiscal Year 2015: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
Fiscal Year 2014: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
Keywordsマルチエージェントシステム / 経験強化型学習 / 強化学習 / 機械学習 / 人工知能 / ソフトコンピューティング / 深層学習
Outline of Final Research Achievements

This research has achieved several progresses about theory and applications of Exploitation-oriented Learning XoL in multi-agent learning. In multi-agent learning, it is important to avoid the concurrent learning problem that occurs when multiple agents learn simultaneously. Firstly, we have proposed a method to avoid the problem. Secondly, we have focused on a positive effect of an indirect reward which is given to the agent that does not receive a reward directly. Especially, we have proposed a method to reduce the perceptual aliasing problem caused by imperfect perception. We have also described the relationship between our previous multi-agent learning theorem and the positive effect. Lastly, we have extended application areas to show the effectiveness of XoL in multi-agent learning through experiments to Keepaway tasks like soccer games. We believe that these results contribute to claim that XoL surpasses traditional reinforcement learning methods in multi-agent learning.

Report

(4 results)
  • 2016 Annual Research Report   Final Research Report ( PDF )
  • 2015 Research-status Report
  • 2014 Research-status Report
  • Research Products

    (22 results)

All 2016 2015 2014 Other

All Journal Article (4 results) (of which Peer Reviewed: 4 results,  Open Access: 3 results,  Acknowledgement Compliant: 2 results) Presentation (16 results) (of which Int'l Joint Research: 3 results,  Invited: 1 results) Book (1 results) Remarks (1 results)

  • [Journal Article] A Study of an Indirect Reward on Multi-agent Environments2016

    • Author(s)
      Kazuteru Miyazaki
    • Journal Title

      Procedia Computer Science

      Volume: 88 Pages: 94-101

    • DOI

      10.1016/j.procs.2016.07.411

    • NAID

      40020804252

    • Related Report
      2016 Annual Research Report
    • Peer Reviewed / Open Access / Acknowledgement Compliant
  • [Journal Article] Proposal of a Propagation Algorithm of the Expected Failure Probability and the Effectiveness on Multi-agent Environments2016

    • Author(s)
      村岡宏紀、宮崎和光、小林博明
    • Journal Title

      IEEJ Transactions on Electronics, Information and Systems

      Volume: 136 Issue: 3 Pages: 273-281

    • DOI

      10.1541/ieejeiss.136.273

    • NAID

      130005132275

    • ISSN
      0385-4221, 1348-8155
    • Related Report
      2015 Research-status Report
    • Peer Reviewed / Acknowledgement Compliant
  • [Journal Article] Proposed Decision-Making System Based on Consciousness in Multiple Rewards and Penalties Environments2015

    • Author(s)
      Kazuteru Miyazaki
    • Journal Title

      International Journal of Machine Learning and Computing

      Volume: 5 (2) Issue: 2 Pages: 121-126

    • DOI

      10.7763/ijmlc.2015.v5.494

    • Related Report
      2014 Research-status Report
    • Peer Reviewed / Open Access
  • [Journal Article] The Necessity of a Secondary System in Machine Consciousness2014

    • Author(s)
      Kazuteru Miyazaki and Junichi Takeno
    • Journal Title

      Procedia Computer Science

      Volume: 41 Pages: 15-22

    • DOI

      10.1016/j.procs.2014.11.079

    • Related Report
      2014 Research-status Report
    • Peer Reviewed / Open Access
  • [Presentation] Proposal of an Action Selection Strategy with Expected Failure Probability and its Evaluation in Multi-agent Reinforcement Learning2016

    • Author(s)
      Kazuteru Miyazaki, Koudai Furukawa and Hiroaki Kobayashi
    • Organizer
      14th European Conference on Multi-Agent Systems
    • Place of Presentation
      ヴァレンシア(スペイン)
    • Year and Date
      2016-12-15
    • Related Report
      2016 Annual Research Report
    • Int'l Joint Research
  • [Presentation] 深層学習を組み込んだ経験強化型学習に関する実験的考察2016

    • Author(s)
      宮崎和光
    • Organizer
      電気学会 システム研究会 「機械学習研究の最新動向」
    • Place of Presentation
      伊豆高原 五景館
    • Year and Date
      2016-12-02
    • Related Report
      2016 Annual Research Report
  • [Presentation] 経験強化型学習XoL -強化学習における試行錯誤回数の低減をめざして-2016

    • Author(s)
      宮崎和光
    • Organizer
      第6回知能工学部会研究会「賢さの先端研究会」,第54 回システム工学部会研究会 機械学習の最 先端研究- 理論および応用研究 -
    • Place of Presentation
      フォーラムミカサ エコ 7Fホール(東京都千代田区神田)
    • Year and Date
      2016-11-07
    • Related Report
      2016 Annual Research Report
    • Invited
  • [Presentation] Proposal and Evaluation of an Action Selection Strategy with Expected Failure Probability in Multiagent Learning2016

    • Author(s)
      Kazuteru Miyazaki, Koudai Furukawa and Hiroaki Kobayashi
    • Organizer
      International Workshop on Multiagent Learning: Theory and Applications
    • Place of Presentation
      Kunibikimesse, Matsue, Shimane
    • Year and Date
      2016-09-30
    • Related Report
      2016 Annual Research Report
    • Int'l Joint Research
  • [Presentation] 深層学習を組み込んだ経験強化型学習XoL:deep Q-networkとの比較2016

    • Author(s)
      宮崎和光
    • Organizer
      電気学会 システム研究会 「機械学習応用研究の最前線」
    • Place of Presentation
      星陵会館(東京都千代田区永田町)
    • Year and Date
      2016-07-09
    • Related Report
      2016 Annual Research Report
  • [Presentation] 2報酬PS強化学習法の提案とその有効性の検証2016

    • Author(s)
      小玉直樹、宮崎和光、小林博明
    • Organizer
      電気学会 システム研究会 「機械学習応用研究の最前線」
    • Place of Presentation
      星陵会館(東京都千代田区永田町)
    • Year and Date
      2016-07-09
    • Related Report
      2016 Annual Research Report
  • [Presentation] マルチエージェント環境における間接報酬に関する一考察2016

    • Author(s)
      宮崎和光
    • Organizer
      電気学会 システム研究会 「機械学習応用研究の最前線」
    • Place of Presentation
      東京トラック事業健保会館(東京都千代田区)
    • Year and Date
      2016-03-08
    • Related Report
      2015 Research-status Report
  • [Presentation] 予想失敗確率を組み込んだ行動選択戦略の提案とマルチエージェント環境下での有効性の検証2016

    • Author(s)
      古川耕大、宮崎和光、小林博明
    • Organizer
      電気学会 システム研究会 「機械学習応用研究の最前線」
    • Place of Presentation
      東京トラック事業健保会館(東京都千代田区)
    • Year and Date
      2016-03-08
    • Related Report
      2015 Research-status Report
  • [Presentation] 政策の多様性を重視した直接政策探索法の提案2015

    • Author(s)
      徳久文彬,小野功,宮崎和光
    • Organizer
      計測自動制御学会 システム・情報部門 学術講演会 2015
    • Place of Presentation
      函館アリーナ
    • Year and Date
      2015-11-18
    • Related Report
      2015 Research-status Report
  • [Presentation] The Necessity of a Secondary System in Multi-agent Learning2015

    • Author(s)
      Kazuteru Miyazaki
    • Organizer
      The First International Symposium on Swarm Behavior and Bio-Inspired Robotics
    • Place of Presentation
      Kyoto University
    • Year and Date
      2015-10-28
    • Related Report
      2015 Research-status Report
    • Int'l Joint Research
  • [Presentation] 学位取得者に対するアンケート調査の分析2015

    • Author(s)
      宮崎和光
    • Organizer
      電気学会 合同システム研究会
    • Place of Presentation
      電力中央研究所(東京都千代田区)
    • Year and Date
      2015-06-20
    • Related Report
      2015 Research-status Report
  • [Presentation] マルチエージェント学習における2次系の必要性に関する研究2015

    • Author(s)
      宮崎和光
    • Organizer
      電気学会 システム研究会 機械学習応用研究の最前線
    • Place of Presentation
      青山学院大学 相模原キャンパス
    • Year and Date
      2015-03-11
    • Related Report
      2014 Research-status Report
  • [Presentation] Profit Sharing強化学習への予想失敗確率の導入とその有効性に関する研究2015

    • Author(s)
      古川耕大, 宮崎和光小林博明
    • Organizer
      第27回自律分散システムシンポジウム
    • Place of Presentation
      東京理科大学 神楽坂キャンパス
    • Year and Date
      2015-01-22
    • Related Report
      2014 Research-status Report
  • [Presentation] Proposed Decision-Making System Based on Consciousness in Multiple Rewards and Penalties Environments2014

    • Author(s)
      Kazuteru Miyazaki
    • Organizer
      2014 International Conference on Artificial Intelligence
    • Place of Presentation
      バルセロナ(スペイン)
    • Year and Date
      2014-12-23
    • Related Report
      2014 Research-status Report
  • [Presentation] The Necessity of a Secondary System in Machine Consciousness2014

    • Author(s)
      Kazuteru Miyazaki and Junichi Takeno
    • Organizer
      2014 Annual International Conference on Biologically Inspired Cognitive Architectures
    • Place of Presentation
      マサチューセッツ工科大学(アメリカ)
    • Year and Date
      2014-11-08
    • Related Report
      2014 Research-status Report
  • [Presentation] 複数種類の報酬と罰に対応した意識的意思決定システムの提案2014

    • Author(s)
      宮崎和光
    • Organizer
      第13回情報科学技術フォーラム
    • Place of Presentation
      筑波大学 筑波キャンパス
    • Year and Date
      2014-09-05
    • Related Report
      2014 Research-status Report
  • [Book] これからの強化学習2016

    • Author(s)
      牧野貴樹、澁谷長史、白川真一、浅田稔、麻生英樹、荒井幸代、飯間等、伊藤真、大倉和博、黒江康明、杉本徳和、坪井祐太、銅谷賢治、前田新一、松井藤五郎、南泰浩、宮崎和光、目黒豊美、森村哲郎、森本淳、保田俊行、吉本潤一郎
    • Total Pages
      320
    • Publisher
      森北出版
    • Related Report
      2016 Annual Research Report
  • [Remarks]

    • URL

      http://www7b.biglobe.ne.jp/~kazuteru/indexj.html

    • Related Report
      2014 Research-status Report

URL: 

Published: 2014-04-04   Modified: 2018-03-22  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi