• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2016 Fiscal Year Final Research Report

Theoretical research of the policy gradient reinforcement learning without Markov properties and its application to games

Research Project

  • PDF
Project/Area Number 26330419
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeMulti-year Fund
Section一般
Research Field Entertainment and game informatics 1
Research InstitutionShibaura Institute of Technology

Principal Investigator

Harukazu Igarashi  芝浦工業大学, 工学部, 教授 (80288886)

Co-Investigator(Renkei-kenkyūsha) ISHIHARA Seiji  東京電機大学, 理工学部, 准教授 (50351656)
Research Collaborator MORIOKA Yuichi  
YAMAMOTO Kazumasa  
Project Period (FY) 2014-04-01 – 2017-03-31
Keywords強化学習 / 方策勾配法 / マルチエージェント / コンピュータ将棋 / ロボカップ / ソフトマックス探索
Outline of Final Research Achievements

In this research project, we have made theoretical and practical research for developing expressions of policy functions and learning methods in the policy gradient reinforcement learning algorithms. Our final goal is constructing a general methodology that can be applied to computer games and engineering fields. The results of this project are as follows.
(1)Theoretical research on the policy gradient reinforcement learning: we proposed new methods in ①hierarchical reinforcement learning to learn higher strategies of agents, ②learning with separated knowledge of environmental dynamics and action-values in agent policies, and ③learning with a fuzzy controller for policies.
(2) Practical application of the policy gradient reinforcement learning: we applied the proposed learning methods to pursuit games, robot soccer games and computer shogi and examines the efficiency of our methods.

Free Research Field

人工知能

URL: 

Published: 2018-03-22  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi