2006 Fiscal Year Annual Research Report

複雑な環境における脳の意思決定モデルとロボット制御への応用

Research Project

Project/Area Number	18300101
Research Institution	Nara Institute of Science and Technology
Principal Investigator	石井信奈良先端科学技術大学院大学, 情報科学研究科, 教授 (90294280)
Co-Investigator(Kenkyū-buntansha)	中村泰大阪大学, 大学院・工学研究科, 助手 (70403334) 柴田智広奈良先端科学技術大学院大学, 情報科学研究科, 助教授 (40359873)
Keywords	教科学習 / 部分観測 / 前頭前野 / 意思決定 / 計算論的認知心理学 / サンプリング / 多自由度ロボット
Research Abstract	線形確率システムとして表現できないような複雑な環境における最適意思決定過程を模擬する機械学習モデルを、強化学習に注目して構築し、工学応用、特に多自由度ロボットに対して変動する環境下での自律制御を行った。高度で階層的な推論を必要とするタスクを題材として、複雑な問題解決に関わる階層的な脳内モデルを構築し、非侵襲脳活動計測器を用いた認知科学実験により検証した。 1.強化学習のアルゴリズム開発方策勾配法ベースの方策オフ型強化学習法に注目しながら新しい強化学習アルゴリズムを開発した。サンプリングに基づく部分観測マルコフ決定過程の解法を開発し、4人で行うマルチエージェントゲームであるHeartsの効率よい自動学習に成功した(Fujita and Ishii, in press)。 2.階層的部分観測環境における神経基盤部分観測課題における最適意思決定過程の神経基盤を調べるために、fMRIを用いた認知科学実験を行い、前部前頭前野の関わりを明らかにした(Yoshida and Ishii,2006)。また、階層性ある意思決定課題においては、前頭前野の異なる領域が関わることが分かった。 3.多自由度ロボットの強化学習法による制御方策勾配方策オフ型強化学習法を、2足準受動歩行ロボット実機に実装し、通常の強化学習法よりも早く安定して学習が可能であることが分かった(Ueno et al.,2006)。中枢パターン生成器に対する強化学習法をヘビ型ロボットに適用し、アクチュエータの故障などシステムの動的変化に追随できることを示した(Makino et al.,2007)。

Research Products
(16 results)

All 2006

All Journal Article (15 results) Book (1 results)

[Journal Article] Reinforcement learning for quasi-passive dynamic walking of an unstable biped robot2006
- Author(s)
  K.Hitomi
- Journal Title
  
  Robotics and Autonomous Systems 54(12)
  
  Pages: 982-988
[Journal Article] 適応的サンプリングによる階層モデル化された対象の効率的状態推定2006
- Author(s)
  坂東誉司
- Journal Title
  
  システム制御情報学会論文誌 50(10)
  
  Pages: 369-377
[Journal Article] The role of short time depression in sustained neural activity in the prefrontal cortex : A simulation study2006
- Author(s)
  Y.Igarashi
- Journal Title
  
  Neural Networks 19(8)
  
  Pages: 1137-1152
[Journal Article] Switching particle filters for efficient visual tracking2006
- Author(s)
  T.Bando
- Journal Title
  
  Journal of Robotics and Autonomous Systems 54
  
  Pages: 873-884
[Journal Article] Balancing plasticity and stability of on-line learning based on hierarchical Bayesian adaptation of forgetting factors2006
- Author(s)
  J.Hirayama
- Journal Title
  
  Neurocomputing 69
  
  Pages: 1954-1961
[Journal Article] Stochastic resonance with a differential code in a cortical model2006
- Author(s)
  Y.Sakumura
- Journal Title
  
  Neural Networks 19
  
  Pages: 469-476
[Journal Article] 重点サンプリング法に基づくNatural Actor-Critic法による効果的なサンプルの再利用2006
- Author(s)
  森健
- Journal Title
  
  電子情報通信学会論文誌 J89-D(5)
  
  Pages: 954-966
[Journal Article] 学習ダイナミクスの制御と脳の物質機構2006
- Author(s)
  銅谷賢治
- Journal Title
  
  トステム/制御/情報 50(8)
  
  Pages: 303-308
[Journal Article] Reinforcement learning : machine learning and natural learning2006
- Author(s)
  S.Ishii
- Journal Title
  
  New Generation Computing 24
  
  Pages: 325-350
[Journal Article] Adaptive control of a looper-like robot based on the CPG-actor-critic method.2006
- Author(s)
  K.Makino
- Journal Title
  
  International Symposium on Artificial Life and Robotics
  
  Pages: GS18-5
[Journal Article] On-line variational PCA for adaptive visual tracking2006
- Author(s)
  T.Date
- Journal Title
  
  International Symposium on Artificial Life and Robotics
  
  Pages: GS18-5
[Journal Article] A probabilistic modeling of MOSAIC learning2006
- Author(s)
  S.Osaga
- Journal Title
  
  International Symposium on Artificial Life and Robotics
  
  Pages: GS16-3
[Journal Article] Estimation of source-filter model via acoustical feature extraction by GA-like algorithm2006
- Author(s)
  M.Ihara
- Journal Title
  
  International Symposium on Artificial Life and Robotics
  
  Pages: GS12-4
[Journal Article] Fast and Stable Learning of Quasi-Passive Dynamic Walking by an Unstable Biped Robot based on Off-Policy Natural Actor-Critic2006
- Author(s)
  T.Ueno
- Journal Title
  
  IEEE/RSJ International Conference on Intelligent Robots and Systems
  
  Pages: 5226-5231
[Journal Article] Feature extraction for decision-theoretic planning in partially observable environments2006
- Author(s)
  H.Fujita
- Journal Title
  
  Artificial Neural Networks - ICANN 2006, Lecture Notes in Computation Science 4132
  
  Pages: I-820-829
[Book] Bayesian Brain (Chapter 1 : A probability primer)2006
- Author(s)
  K.Doya
- Total Pages
  323
- Publisher
  MIT Press

2006 Fiscal Year Annual Research Report

複雑な環境における脳の意思決定モデルとロボット制御への応用

Principal Investigator

石井 信 奈良先端科学技術大学院大学, 情報科学研究科, 教授 (90294280)

Research Products

[Journal Article] Reinforcement learning for quasi-passive dynamic walking of an unstable biped robot2006

Author(s)

Journal Title

[Journal Article] 適応的サンプリングによる階層モデル化された対象の効率的状態推定2006

Author(s)

Journal Title

[Journal Article] The role of short time depression in sustained neural activity in the prefrontal cortex : A simulation study2006

Author(s)

Journal Title

[Journal Article] Switching particle filters for efficient visual tracking2006

Author(s)

Journal Title

[Journal Article] Balancing plasticity and stability of on-line learning based on hierarchical Bayesian adaptation of forgetting factors2006

Author(s)

Journal Title

[Journal Article] Stochastic resonance with a differential code in a cortical model2006

Author(s)

Journal Title

[Journal Article] 重点サンプリング法に基づくNatural Actor-Critic法による効果的なサンプルの再利用2006

Author(s)

Journal Title

[Journal Article] 学習ダイナミクスの制御と脳の物質機構2006

Author(s)

Journal Title

[Journal Article] Reinforcement learning : machine learning and natural learning2006

Author(s)

Journal Title

[Journal Article] Adaptive control of a looper-like robot based on the CPG-actor-critic method.2006

Author(s)

Journal Title

[Journal Article] On-line variational PCA for adaptive visual tracking2006

Author(s)

Journal Title

[Journal Article] A probabilistic modeling of MOSAIC learning2006

Author(s)

Journal Title

[Journal Article] Estimation of source-filter model via acoustical feature extraction by GA-like algorithm2006

Author(s)

Journal Title

[Journal Article] Fast and Stable Learning of Quasi-Passive Dynamic Walking by an Unstable Biped Robot based on Off-Policy Natural Actor-Critic2006

Author(s)

Journal Title

[Journal Article] Feature extraction for decision-theoretic planning in partially observable environments2006

Author(s)

Journal Title

[Book] Bayesian Brain (Chapter 1 : A probability primer)2006

Author(s)

Total Pages

Publisher

石井信奈良先端科学技術大学院大学, 情報科学研究科, 教授 (90294280)