2017 Fiscal Year Final Research Report

A policy selection method based on the priming effect in the cognitive psychology for reinforcement learning agent

Research Project

PDF

Project/Area Number	16K12493
Research Category	Grant-in-Aid for Challenging Exploratory Research
Allocation Type	Multi-year Fund
Research Field	Intelligent informatics
Research Institution	Tokyo Denki University
Principal Investigator	SUZUKI Tsuyoshi 東京電機大学, 工学部, 教授 (00349789)
Co-Investigator(Kenkyū-buntansha)	温文東京大学, 大学院工学系研究科(工学部), 特別研究員 (50646601) 河野仁東京工芸大学, 工学部, 助教 (70758367)
Project Period (FY)	2016-04-01 – 2018-03-31
Keywords	知識選択 / 活性化拡散モデル / 転移学習 / 強化学習
Outline of Final Research Achievements	This research proposes a policy transfer method of a reinforcement learning agent for suitable learning in unknown or dynamic environments based on a spreading activation model in the cognitive psychology. The agent saves policies learned in various environments and learns flexibly by partially using suitable policy according to the environment. In the proposed method, an undirected graph is created between policies, and the network is constructed by them. The agent updates the activate value that policy has according to the environment while repeating processes of recall, activation, spreading, attenuation and learns based on the network. Agent uses this network in transfer learning. Experimental simulations comparing the proposed method with several existing methods are conducted to confirm the usefulness of the proposed method. Simulation results show that the agent achieves the task by selecting the optimal one from policies with the proposed method.
Free Research Field	ロボット工学、情報通信工学