2009 Fiscal Year Annual Research Report

マルチエージェントの学習過程に注目した系安定化・全体最適化に関する研究

Research Project

Project/Area Number	21500153
Research Institution	National Institute of Advanced Industrial Science and Technology
Principal Investigator	野田五十樹 National Institute of Advanced Industrial Science and Technology, 情報技術研究部門, 主任研究員 (40357744)
Co-Investigator(Kenkyū-buntansha)	山下倫央産業技術総合研究所, 情報技術研究部門, 研究員 (50415759)
Keywords	マルチエージェント / 強化学習 / デマンドバスシミュレーション / 学習パラメータ
Research Abstract	本年度は、これまで開発を進めてきた手法の一般化を行い、理論的な裏づけのある枠組みの構築を行った。まず学習・行動選択パラメータについては、強化学習のステップサイズパラメータを周りの環境に合わせて調整する方法を構築し、Recursive Adaptation of Stepsize Parameter(RASP)と名づけて具体的な学習アルゴリズムを定式化した。この方法は強化学習で用いる指数移動平均(EMA)を再帰的に求めることで、ステップサイズパラメータによる学習対象値の高次導関数を効率よく求めるというものであり、数学的に裏づけされた汎用性の高い手法となっている。このため、さまざまな学習課題に適用でき、実際に金融データや気象データなどを用いた適応実験を進め、これらの系の分析に用いることができることがわかってきている。また、学習性能についても、いくつかの数値実験を行い、Optimal Stepsize Algorithmなどの既存手法より適切にパラメータ学習ができることが示された。また行動選択については、デマンドバスなどのシミュレーションを行う環境を整備し、具体的事例からエージェントの行動選択やそれによる系の最適化・サービス安定化手法などを探る実験を進めた。これについては今後実験を重ね、公共交通など規模の大きい社会システムでの系の安定分析の例として用いることができるよう、整備を進める予定である。

Research Products
(11 results)

All 2010 2009

All Journal Article (2 results) (of which Peer Reviewed: 1 results) Presentation (8 results) Book (1 results)

[Journal Article] ロボカップ12年2010
- Author(s)
  野田五十樹, 松原仁
- Journal Title
  
  人工知能学会誌 25
  
  Pages: 183-188
[Journal Article] Recursive Adaptation of Stepsize Parameter for Non-stationary Environments2009
- Author(s)
  Itsuki Noda
- Journal Title
  
  Principles of Practice in Multi-Agent Systems 1
  
  Pages: 525-533
- Peer Reviewed
[Presentation] ニュートン法よる強化学習ステップサイズパラメータの調整法2010
- Author(s)
  野田五十樹
- Organizer
  情報処理学会全国大会
- Place of Presentation
  東京
- Year and Date
  2010-03-09
[Presentation] 強化学習と社会の安定化2010
- Author(s)
  野田五十樹
- Organizer
  FUN-AI 2010
- Place of Presentation
  函館
- Year and Date
  2010-02-22
[Presentation] 指数的移動平均2乗誤差の最小化によるステップサイズパラメータの調整法2009
- Author(s)
  野田五十樹
- Organizer
  JAWS-2009
- Place of Presentation
  蔵王
- Year and Date
  2009-10-28
[Presentation] Adaptation of Stepsize Parameter for Non-Stationary Environments by Recursive Exponential Moving Average2009
- Author(s)
  Itsuki Noda
- Organizer
  ECML 2009 LNIID Workshop
- Place of Presentation
  スロベニア、ブレッド
- Year and Date
  2009-09-07
[Presentation] 機械学習における再帰的ステップサイズパラメータ調整法を用いた価格変動の分析2009
- Author(s)
  松井宏樹, 野田五十樹, 尹煕元
- Organizer
  人工知能学会全国大会
- Place of Presentation
  高松
- Year and Date
  2009-06-19
[Presentation] テマンド型交通導入に関する仮想社会実験2009
- Author(s)
  舟山和男, 吉村忍, 野田五十樹, 藤井秀樹, 狩野宏和
- Organizer
  人工知能学会全国大会
- Place of Presentation
  高松
- Year and Date
  2009-06-19
[Presentation] 強化学習エージェントと報酬頻度に関する考察2009
- Author(s)
  野田五十樹
- Organizer
  人工知能学会全国大会
- Place of Presentation
  高松
- Year and Date
  2009-06-17
[Presentation] Recursive Adaptation of Stepsize Parameter for Unstable Environments2009
- Author(s)
  Itsuki Noda
- Organizer
  ALA-2009
- Place of Presentation
  ハンガリー、ブタペスト
- Year and Date
  2009-05-12
[Book] "Recursive Adaptation of Stepsize Parameter for Non-Stationary Environments" in Adaptive Learning Agents2010
- Author(s)
  Itsuki Noda
- Total Pages
  18
- Publisher
  Springer

2009 Fiscal Year Annual Research Report

マルチエージェントの学習過程に注目した系安定化・全体最適化に関する研究

Principal Investigator

野田 五十樹 National Institute of Advanced Industrial Science and Technology, 情報技術研究部門, 主任研究員 (40357744)

Research Products

[Journal Article] ロボカップ12年2010

Author(s)

Journal Title

[Journal Article] Recursive Adaptation of Stepsize Parameter for Non-stationary Environments2009

Author(s)

Journal Title

[Presentation] ニュートン法よる強化学習ステップサイズパラメータの調整法2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 強化学習と社会の安定化2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 指数的移動平均2乗誤差の最小化によるステップサイズパラメータの調整法2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Adaptation of Stepsize Parameter for Non-Stationary Environments by Recursive Exponential Moving Average2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 機械学習における再帰的ステップサイズパラメータ調整法を用いた価格変動の分析2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] テマンド型交通導入に関する仮想社会実験2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 強化学習エージェントと報酬頻度に関する考察2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Recursive Adaptation of Stepsize Parameter for Unstable Environments2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Book] "Recursive Adaptation of Stepsize Parameter for Non-Stationary Environments" in Adaptive Learning Agents2010

Author(s)

Total Pages

Publisher

野田五十樹 National Institute of Advanced Industrial Science and Technology, 情報技術研究部門, 主任研究員 (40357744)