Researches on System Stabilization based on Learning Process of Multiple Agents.
Project/Area Number |
21500153
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Intelligent informatics
|
Research Institution | National Institute of Advanced Industrial Science and Technology |
Principal Investigator |
NODA Itsuki 独立行政法人産業技術総合研究所, サービス工学研究センター, 研究チーム長 (40357744)
|
Co-Investigator(Kenkyū-buntansha) |
YAMASHITA Tomohisa 独立行政法人産業技術総合研究所, サービス工学研究センターサービス設計支援技術研究チーム, 研究員 (50415759)
|
Project Period (FY) |
2009 – 2011
|
Project Status |
Completed (Fiscal Year 2011)
|
Budget Amount *help |
¥4,420,000 (Direct Cost: ¥3,400,000、Indirect Cost: ¥1,020,000)
Fiscal Year 2011: ¥1,300,000 (Direct Cost: ¥1,000,000、Indirect Cost: ¥300,000)
Fiscal Year 2010: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000)
Fiscal Year 2009: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
|
Keywords | 知的エージェント / マルチエージェント社会シミュレーション / 強化学習 / 学習パラメータ / マルチエージェント / デマンドパスシミュレーション / デマンドバスシミュレーション |
Research Abstract |
In order to overcome difficulties of unstable situation caused by dilemma of simul-taneous learning of multiple agents, we proposed several methods to control insensitiveness to environmental changes and information flow for the multiple agents. RASP is a method to control stepsize parameter of reinforcement learning auto-matically according to the degree of changes of environment. We also investigate a method to control Exploration/Exploitation ratio to stabilize learning processes.
|
Report
(4 results)
Research Products
(39 results)