2017 Fiscal Year Final Research Report
A policy selection method based on the priming effect in the cognitive psychology for reinforcement learning agent
Project/Area Number |
16K12493
|
Research Category |
Grant-in-Aid for Challenging Exploratory Research
|
Allocation Type | Multi-year Fund |
Research Field |
Intelligent informatics
|
Research Institution | Tokyo Denki University |
Principal Investigator |
|
Co-Investigator(Kenkyū-buntansha) |
温 文 東京大学, 大学院工学系研究科(工学部), 特別研究員 (50646601)
河野 仁 東京工芸大学, 工学部, 助教 (70758367)
|
Project Period (FY) |
2016-04-01 – 2018-03-31
|
Keywords | 知識選択 / 活性化拡散モデル / 転移学習 / 強化学習 |
Outline of Final Research Achievements |
This research proposes a policy transfer method of a reinforcement learning agent for suitable learning in unknown or dynamic environments based on a spreading activation model in the cognitive psychology. The agent saves policies learned in various environments and learns flexibly by partially using suitable policy according to the environment. In the proposed method, an undirected graph is created between policies, and the network is constructed by them. The agent updates the activate value that policy has according to the environment while repeating processes of recall, activation, spreading, attenuation and learns based on the network. Agent uses this network in transfer learning. Experimental simulations comparing the proposed method with several existing methods are conducted to confirm the usefulness of the proposed method. Simulation results show that the agent achieves the task by selecting the optimal one from policies with the proposed method.
|
Free Research Field |
ロボット工学、情報通信工学
|