2016 Fiscal Year Final Research Report
Elucidation of communication emergence mechanism based on action time series in reinforcement learning agents.
Project/Area Number |
25871049
|
Research Category |
Grant-in-Aid for Young Scientists (B)
|
Allocation Type | Multi-year Fund |
Research Field |
Soft computing
Intelligent informatics
|
Research Institution | Okinawa National College of Technology |
Principal Investigator |
Sato Takashi 沖縄工業高等専門学校, メディア情報工学科, 准教授 (70426576)
|
Research Collaborator |
HASHIMOTO Takashi 北陸先端科学技術大学院大学, 知識科学系知識マネジメント領域, 教授 (90313709)
|
Project Period (FY) |
2013-04-01 – 2017-03-31
|
Keywords | ジェスチャー理論 / 原始的コミュニケーションの創発 / Q-learning / Neural Q-learning / Recurrent Q-learning / マルチエージェント・システム / 拡張版SOM / 暗示的フィードバック |
Outline of Final Research Achievements |
Based on the gesture theory, we discussed an individual's ability and other factors necessary for emergence of proto-communication in a primitive society in which the communication was not established among the individuals. To verify the individual's ability aspect, we adopted a collision avoidance game and a reinforcement learning agents who can learn their action history as the game players. Our simulation showed that, by evaluating various models including a hybrid model between the Q-learning and the recurrent neural network, the abilities to learn and predict the past action history and its order can be played an important role in the emergence of communication. Also, to examine an element contributed to the formation of communication, we adopted a communication game with extended SOM learning agents. The second simulations suggested that "implicit feedback" obtained from situations other than individuals, which is proposed by us, can be improved the communication success rate.
|
Free Research Field |
複雑系、人工生命、進化言語学、進化的計算論
|