2006 Fiscal Year Final Research Report Summary
Autonomous, Harmonious and Purposive Acquisition of Various Functions of Robots by Reinforcement Learning and the Relation to the Intelligence Formation
Project/Area Number |
15300064
|
Research Category |
Grant-in-Aid for Scientific Research (B)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Perception information processing/Intelligent robotics
|
Research Institution | Oita University |
Principal Investigator |
SHIBATA Katsunari Oita University, Faculty of Engineering, Associate Professor, 工学部, 助教授 (10260522)
|
Project Period (FY) |
2003 – 2006
|
Keywords | Reinforcement Learning / Recurrent Neural Network / Symbol / Robot / Spatial Abstraction / Intelligent Exploration / Temporal Abstraction / Emergence of Intelligence |
Research Abstract |
This research was aimed to show that by the learning using the training signals that are derived by reinforcement learning, various functions emerge according to the necessity in a neural network to which sensor signals are directly entered and whose outputs are motor commands. The main fruits are as follows. 1. It is said that neural networks are not good at symbol processing. However, it was shown that the output representation of a neural network became binary only by reinforcement learning. 2. It was shown that a real robot could learn box-pushing behavior using neural network without giving any informatio a about image processing, image recognition, or the given task. 3. It was shown that a real robot could learn to reach an object in some degree even in a quasi-real world where various objects and colorful leaflets exist. 4. It was shown that a recurrent neural network trained by reinforcement learning could learn some tasks that are thought to be relevant to the spatial or temporal abstraction.
|