2011 Fiscal Year Final Research Report
Researches on System Stabilization based on Learning Process of Multiple Agents.
Project/Area Number |
21500153
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Intelligent informatics
|
Research Institution | National Institute of Advanced Industrial Science and Technology |
Principal Investigator |
NODA Itsuki 独立行政法人産業技術総合研究所, サービス工学研究センター, 研究チーム長 (40357744)
|
Co-Investigator(Kenkyū-buntansha) |
YAMASHITA Tomohisa 独立行政法人産業技術総合研究所, サービス工学研究センターサービス設計支援技術研究チーム, 研究員 (50415759)
|
Project Period (FY) |
2009 – 2011
|
Keywords | 知的エージェント / マルチエージェント社会シミュレーション / 強化学習 / 学習パラメータ |
Research Abstract |
In order to overcome difficulties of unstable situation caused by dilemma of simul-taneous learning of multiple agents, we proposed several methods to control insensitiveness to environmental changes and information flow for the multiple agents. RASP is a method to control stepsize parameter of reinforcement learning auto-matically according to the degree of changes of environment. We also investigate a method to control Exploration/Exploitation ratio to stabilize learning processes.
|
Research Products
(22 results)