2009 Fiscal Year Annual Research Report

あいまで変動する環境におけるモジュール型意思決定モデルの研究

Research Project

Project/Area Number	21300113
Research Institution	Kyoto University
Principal Investigator	石井信 Kyoto University, 情報学研究科, 教授 (90294280)
Co-Investigator(Kenkyū-buntansha)	中村泰大阪大学, 工学研究科, 助教 (70403334) 前田新一京都大学, 情報学研究科, 助教 (20379530)
Keywords	強化学習 / モジュールアーキテクチャ / 計算論的神経科学 / ロボット / 非侵襲脳計測
Research Abstract	マルチモジュール型学習アルゴリズムの開発複数モジュール型制御アルゴリズムを、mixture-of-expertsの統計学習として定式化し、オンライン学習法としてアルゴリズム化した。また、複数の基底関数から線形表現される価値関数のオンライン学習において、基底関数を動的に配置するアルゴリズムを導出して、従来法よりも効率が良いことを示した(Mori and Ishii, 2009)。また、価値関数ベースの強化学習法の一般形をセミパラメトリック統計に基づき導出し、漸近的推定分散最小となるオンラインアルゴリズムを求めた(Ueno, et al., 2009 ; Ueno, et al., to appear)。多階層タスクにおける推論過程の認知科学的解明二つの階層を持つ複雑なルール推論課題を被験者に課し、低い階層および高い階層での推論過程に関わる神経基盤を核磁気共鳴図(fMRI)非侵襲脳活動計測によって調べた。階層ごとに、異なる前頭前野の領域がその推論に関わる可能性が示唆された(Yoshida, et al., 2009)。多関節ロボットの制御 26本の人工筋肉で動作する多関節ロボットの制御系の拡張を進めた。人間がロボットの手を持って運動をさせた際の軌道をみまね学習させ、一方で、自由制御下でのロボットの逆キネマティクスを学習させ、この二つを組み合わせるモデル同定制御法を開発した。簡単な握手運動の生成に適用し、本手法により複雑な多関節ロボットの制御が可能であることを示した(Nishioka, et al., 2010)。

Research Products

(9 results)

All 2010 2009

All Journal Article (3 results) (of which Peer Reviewed: 3 results) Presentation (5 results) Book (1 results)

[Journal Article] Hierarchical rule switching in prefrontal cortex2010
- Author(s)
  W.Yoshida, H.Funakoshi, S.Ishii
- Journal Title
  
  NeuroImage 50(1)
  
  Pages: 314-322
- Peer Reviewed
[Journal Article] Boosting perceptual learning by fake feedback2009
- Author(s)
  K.Shibata, N.Yamagishi, S.Ishii, M.Kawato
- Journal Title
  
  Vision Research 49(21)
  
  Pages: 2574-2585
- Peer Reviewed
[Journal Article] Adaptive particle allocation for multifocal visual attention based on particle filtering2009
- Author(s)
  N.Yano, T.Shibata, S.Ishii
- Journal Title
  
  Journal of Artificial Life and Robotics 13
  
  Pages: 522-535
- Peer Reviewed
[Presentation] Robust approximation in decomposed reinforcement learning2009
- Author(s)
  T.Mori, S.Ishii
- Organizer
  International Conference on Neural Information Processing
- Place of Presentation
  Bangkok, Thailand
- Year and Date
  2009-12-02
[Presentation] An additive reinforcement learning2009
- Author(s)
  T.Mori, S.Ishii
- Organizer
  International Conference on Artificial Neural Networks
- Place of Presentation
  Limassol, Cyprus
- Year and Date
  2009-09-14
[Presentation] Optimal online learning procedures for model-free policy evaluation2009
- Author(s)
  T.Ueno, M.Kawanabe, S.Maeda, S.Ishii
- Organizer
  European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases
- Place of Presentation
  Ljubljana, Slovenija
- Year and Date
  2009-09-08
[Presentation] Visual attention model involving feature-based inhibition of return2009
- Author(s)
  S.Hotta, S.Oba, S.Ishii
- Organizer
  International Symposium on Artificial Life and Robotics
- Place of Presentation
  Beppu, Japan
- Year and Date
  2009-02-05
[Presentation] Machine learning approarch to 9-DOF arm robot control2009
- Author(s)
  S.Nishioka, S.Maeda, S.Ishii
- Organizer
  International Symposium on Artificial Life and Robitics
- Place of Presentation
  Beppu, Japan
- Year and Date
  2009-02-04
[Book] 価値と学習・よくわかる認知科学分担執筆 (IV-7節)2010
- Author(s)
  石井信
- Total Pages
  3
- Publisher
  ミネルヴァ書房

2009 Fiscal Year Annual Research Report

あいまで変動する環境におけるモジュール型意思決定モデルの研究

Principal Investigator

石井 信 Kyoto University, 情報学研究科, 教授 (90294280)

Research Products

[Journal Article] Hierarchical rule switching in prefrontal cortex2010

Author(s)

Journal Title

[Journal Article] Boosting perceptual learning by fake feedback2009

Author(s)

Journal Title

[Journal Article] Adaptive particle allocation for multifocal visual attention based on particle filtering2009

Author(s)

Journal Title

[Presentation] Robust approximation in decomposed reinforcement learning2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] An additive reinforcement learning2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Optimal online learning procedures for model-free policy evaluation2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Visual attention model involving feature-based inhibition of return2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Machine learning approarch to 9-DOF arm robot control2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Book] 価値と学習・よくわかる認知科学 分担執筆 (IV-7節)2010

Author(s)

Total Pages

Publisher

石井信 Kyoto University, 情報学研究科, 教授 (90294280)

[Book] 価値と学習・よくわかる認知科学分担執筆 (IV-7節)2010