学習システムを備えた生物規範に基づく2足歩行の実現

Research Project

Project/Area Number	05J02829
Research Category	Grant-in-Aid for JSPS Fellows
Allocation Type	Single-year Grants
Section	国内
Research Field	Intelligent mechanics/Mechanical systems
Research Institution	Nara Institute of Science and Technology
Principal Investigator	松原崇充 Nara Institute of Science and Technology, 情報科学研究科, 特別研究員(DC1)
Project Period (FY)	2005 – 2007
Project Status	Completed (Fiscal Year 2007)
Budget Amount *help	¥2,700,000 (Direct Cost: ¥2,700,000) Fiscal Year 2007: ¥900,000 (Direct Cost: ¥900,000) Fiscal Year 2006: ¥900,000 (Direct Cost: ¥900,000) Fiscal Year 2005: ¥900,000 (Direct Cost: ¥900,000)
Keywords	2足歩行 / 生物規範 / 強化学習 / ヒューマノイド / 全身協調動作 / 倒立振子モデル / 方策勾配法 / 方策こう配法 / 自然方策勾配法 / 変分ベイズ法 / CPG
Research Abstract	人間のダイナミクスは近似的に倒立振子モデルで説明されることが知られており,運動生理学やロボット工学の分野で様々な運動の説明や実現に広く使われている.我々は,このような汎用性の高いモデルに基づいて,ロボットの全身運動を効率的に学習・獲得させることを試みた.具体的には、倒立振子の状態・行動空間においてタスクを達成するための適切な重心制御則を学習し,獲得された重心の運動は全身の関節運動へと変換される.提案手法を,21自由度を有するヒューマノイドロボットにおけるインパクト動作の学習課題に適用した結果,数千回の試行回数により適切な全身運動が学習できることを数値実験により確認した.この成果に某づいて,国際学会で発表1件を行い,さらに論文誌1件への掲載が決定された.また,倒立振子モデル上での運動を全身関節の運動へと分配する際に問題となる不良設定性を,人の見本動データに基づいて解決する方法を提案し,シミュレーション実験によりその有効性を示した.この成果について,国内学会での発表1件を行った. さらに,より効率的な運動学習・環境適応性を実現するため,平均報酬の計量空間における強化学習法を提案した.提案手法は,強化学習の目的関数である平均報酬の計量空間上での方策勾配法であり,従来法で問題であったプラトー現象と呼ばれる学習の停滞を生じることなく,効率的な学習が実現される.また,極値付近では2次収束性も有している.この成果について,国内の研究会での発表1件を行った.今後は,2足歩行や全身運動のようなロボットの複雑な運動の学習に対する適用を目指す.

Report

(3 results)

Research Products

(7 results)

All 2008 2007 2006 Other

All Journal Article (3 results) (of which Peer Reviewed: 2 results) Presentation (4 results)

[Journal Article] Learning CPG-based Biped Locomotion with a Policy Gradient Method: Apply to a Humanoid Robot2008
- Author(s)
  Gen Endo
- Journal Title
  
  The International Journal of Robotics Research vol.27,No.2
  
  Pages: 213-228
- Related Report
  2007 Annual Research Report
- Peer Reviewed
[Journal Article] Learning CPG-based Biped Locomotion with a Policy Gradient Method2006
- Author(s)
  Takamitsu Matsubara, Jun Morimoto, Jun Nakanishi, Masa-aki Sato, Kenji Doya
- Journal Title
  
  Robotics and Autonomous Systems 54
  
  Pages: 911-920
- Related Report
  2006 Annual Research Report
[Journal Article] Learning to acquire whole-body humanoid CoM movements to achieve dynamic tasks
- Author(s)
  Takamitsu Matsubara
- Journal Title
  
  Advanced Robotics
- Related Report
  2007 Annual Research Report
- Peer Reviewed
[Presentation] 平均報酬の多様体に基づく方策勾配法2007
- Author(s)
  松原崇充
- Organizer
  電子情報通信学会NC研究会
- Place of Presentation
  名古屋大学
- Year and Date
  2007-12-22
- Related Report
  2007 Annual Research Report
[Presentation] Learning to acquire whole-body humanoid CoM movements to achieve dynamic tasks2007
- Author(s)
  Takamitsu Matsubara
- Organizer
  IEEE International Conference on Robotics and Automation (ICRA)
- Place of Presentation
  Rome, Italy
- Related Report
  2007 Annual Research Report
[Presentation] 逆運動学問題に置ける自然勾配法とヤコビアン擬似逆行列に基づく解法の等価性2007
- Author(s)
  松原崇充
- Organizer
  日本ロボット学会学術講演会
- Place of Presentation
  千葉工業大学
- Related Report
  2007 Annual Research Report
[Presentation] 人間動作より抽出される低次元特徴空間におけるヒューマノイドの全身運動制御2007
- Author(s)
  松原崇充
- Organizer
  日本ロボット学会学術講演会
- Place of Presentation
  千葉工業大学
- Related Report
  2007 Annual Research Report

学習システムを備えた生物規範に基づく2足歩行の実現

Principal Investigator

松原 崇充 Nara Institute of Science and Technology, 情報科学研究科, 特別研究員(DC1)

¥2,700,000 (Direct Cost: ¥2,700,000)

Report

Research Products

[Journal Article] Learning CPG-based Biped Locomotion with a Policy Gradient Method: Apply to a Humanoid Robot2008

Author(s)

Journal Title

Related Report

[Journal Article] Learning CPG-based Biped Locomotion with a Policy Gradient Method2006

Author(s)

Journal Title

Related Report

[Journal Article] Learning to acquire whole-body humanoid CoM movements to achieve dynamic tasks

Author(s)

Journal Title

Related Report

[Presentation] 平均報酬の多様体に基づく方策勾配法2007

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Learning to acquire whole-body humanoid CoM movements to achieve dynamic tasks2007

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] 逆運動学問題に置ける自然勾配法とヤコビアン擬似逆行列に基づく解法の等価性2007

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] 人間動作より抽出される低次元特徴空間におけるヒューマノイドの全身運動制御2007

Author(s)

Organizer

Place of Presentation

Related Report

松原崇充 Nara Institute of Science and Technology, 情報科学研究科, 特別研究員(DC1)