• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Reinforcement Learning of Action Strategies and Joint Stiffness of Tendon-driven Biped Robot

Research Project

Project/Area Number 21560275
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent mechanics/Mechanical systems
Research InstitutionMeiji University

Principal Investigator

KOBAYASHI Hiroaki  明治大学, 理工学部, 教授 (60130811)

Co-Investigator(Kenkyū-buntansha) TANAKA Sumio  明治大学, 理工学部, 講師 (40287884)
Co-Investigator(Renkei-kenkyūsha) HYODO Kazuhito  神奈川工科大学, 工学部, 教授 (10271371)
MIYAZAKI Kazuteru  独立行政法人大学評価・学位授与機構, 准教授 (20282866)
Project Period (FY) 2009 – 2011
Project Status Completed (Fiscal Year 2011)
Budget Amount *help
¥3,510,000 (Direct Cost: ¥2,700,000、Indirect Cost: ¥810,000)
Fiscal Year 2011: ¥650,000 (Direct Cost: ¥500,000、Indirect Cost: ¥150,000)
Fiscal Year 2010: ¥2,210,000 (Direct Cost: ¥1,700,000、Indirect Cost: ¥510,000)
Fiscal Year 2009: ¥650,000 (Direct Cost: ¥500,000、Indirect Cost: ¥150,000)
Keywords機械知能 / 知能ロボット / 制御工学 / 機械学習 / ロボティックス / 知能ロボティックス / 知能ロボティクス
Research Abstract

In this research, a learning method for robots to learn appropriate actions by profits and penalties given from the environment was developed and applied to action learning in the robotic succor game and walking movement of a biped robot. To apply it to the real robots and to improve the efficiency, a method to decide the criterion for penalties was considered and states in that the robot already had learned sufficiently were treated as a fixed-mode state(deterministic action strategy is used). Furthermore, the mechanism and control of a biped robot driven with motors(muscles) and wires(tendons) were considered. The tensile force control of tendons was done with robust stiffness-adjustable device, since tensile force up to 400N(40kgf) is expected during walking.

Report

(4 results)
  • 2011 Annual Research Report   Final Research Report ( PDF )
  • 2010 Annual Research Report
  • 2009 Annual Research Report
  • Research Products

    (21 results)

All 2012 2011 2010 2009 Other

All Journal Article (8 results) (of which Peer Reviewed: 8 results) Presentation (12 results) Remarks (1 results)

  • [Journal Article] Evaluation of the Improved Penalty Avoiding Rational Policy Making Algorithm in Real World Environment2012

    • Author(s)
      Kazuteru Miyazaki, Masaki Itou, and Hiroaki Kobayashi
    • Journal Title

      Lecture Notes in Computer Science

      Volume: Vol.7196 Pages: 270-280

    • Related Report
      2011 Final Research Report
    • Peer Reviewed
  • [Journal Article] Introduction of Fixed Mode States into Online Profit Sharing and Its Application to Waist Trajectory Generation of Biped Robot2012

    • Author(s)
      Seiya Kuroda, Kazuteru Miyazaki and Hiroaki Kobayashi
    • Journal Title

      Lecture Notes in Computer Science

      Volume: Vol.7188 Pages: 297-308

    • Related Report
      2011 Final Research Report
    • Peer Reviewed
  • [Journal Article] Introduction of Fixed Mode States into Online Profit Sharing and Its Application to Waist Trajectory Generation of Biped Robot2012

    • Author(s)
      Seiya Kuroda, Kazuteru Miyazaki, Hiroaki Kobayashi
    • Journal Title

      Lecture Notes in Computer Science

      Volume: 7188 Pages: 293-308

    • Related Report
      2011 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Evaluation of the Improved Penalty Avoiding Rational Policy Making Algorithm in Real World Environment2012

    • Author(s)
      Kazuteru Miyazaki, Masaki Itou, Hiroaki Kobayashi
    • Journal Title

      Lecture Notes in Computer Science

      Volume: 7196 Pages: 270-280

    • DOI

      10.1007/978-3-642-28487-8_28

    • ISBN
      9783642284861, 9783642284878
    • Related Report
      2011 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Threshold Learning in the Improved Penalty Avoiding Rational Policy Marking Algorithm2010

    • Author(s)
      Kazuteru Miyazaki, Ryouhei Kobayashi, and Hiroaki Kobayashi
    • Journal Title

      Proc. of SICE Annual Conference 2010

      Pages: 3240-3245

    • Related Report
      2011 Final Research Report
    • Peer Reviewed
  • [Journal Article] Threshold Learning in the Improved Penalty Avoiding Rational Policy Marking Algorithm2010

    • Author(s)
      Kazuteru Miyazaki, Ryouhei Kobayashi, Hiroaki Kobayashi
    • Journal Title

      Proc.of SICE Annual Conference 2010

      Pages: 3240-3245

    • Related Report
      2010 Annual Research Report
    • Peer Reviewed
  • [Journal Article] A New Improved PenaltyAvoiding Rational Policy Making Algorithm for Keepaway with Conti-nuous State Spaces2009

    • Author(s)
      Takuji Watanabe, Kazuteru Miyazaki, and Hiroaki Kobayashi
    • Journal Title

      Journal of Advanced Computational Intelligence and Intelligent Informatics

      Volume: Vol.13, No.6 Pages: 675-683

    • Related Report
      2011 Final Research Report
    • Peer Reviewed
  • [Journal Article] A New Improved Penalty Avoiding Rational Policy Making Algorithm for Keepaway with Continuous State Spaces2009

    • Author(s)
      Takuji Watanabe, Kazuteru Miyazaki, HIroaki Kobayashi
    • Journal Title

      Journal of Advanced Computational Intelligence and Intelligent Informatics Vol.13, No.6

      Pages: 675-683

    • Related Report
      2009 Annual Research Report
    • Peer Reviewed
  • [Presentation] 腱駆動2足歩行ロボットにおける腰軌道の強化学習への固定状態導入による効率化の研究2011

    • Author(s)
      伊藤大貴、岡島勇也、田中純夫、小林博明、宮崎和光
    • Organizer
      第54回自動制御連合講演会
    • Place of Presentation
      豊橋技術科学大学
    • Year and Date
      2011-11-20
    • Related Report
      2011 Annual Research Report 2011 Final Research Report
  • [Presentation] 罰と報酬を用いる強化学習の失敗確率伝播に関する研究2011

    • Author(s)
      村岡宏紀、宮崎和光、小林博明
    • Organizer
      第54回自動制御連合講演会
    • Place of Presentation
      豊橋技術科学大学
    • Year and Date
      2011-11-20
    • Related Report
      2011 Annual Research Report 2011 Final Research Report
  • [Presentation] Introduction of Fixed Mode States into Online Profit Sharing and Its Application to Waist Trajectory Generation of Biped Robot2011

    • Author(s)
      Seiya Kuroda, Kazuteru Miyazaki and Hiroaki Kobayashi
    • Organizer
      The 9th European Workshop on Reinforcement Learning(EWRL-9)
    • Place of Presentation
      Athens Royal Olympic Hotel
    • Year and Date
      2011-09-11
    • Related Report
      2011 Final Research Report
  • [Presentation] Introduction of Fixed Mode States into Online Profit Sharing and Its Application to Waist Trajectory Generation of Biped Robot2011

    • Author(s)
      Seiya Kuroda, Kazuteru Miyazaki, Hiroaki Kobayashi
    • Organizer
      The 9th European Workshop on Reinforcement Learning (EWRL-9)
    • Place of Presentation
      Athens Royal Olympic Hotel
    • Year and Date
      2011-09-11
    • Related Report
      2011 Annual Research Report
  • [Presentation] 腱駆動2足歩行ロボットの開発と腰軌道および腱張力の強化学習2010

    • Author(s)
      黒田聖弥、日野雄太、岡島勇也、田中純夫、兵頭和人、小林博明
    • Organizer
      第53回自動制御連合講演会
    • Place of Presentation
      高知市高知城ホール
    • Year and Date
      2010-11-04
    • Related Report
      2011 Final Research Report
  • [Presentation] マルチエージェント連続タスクへの改良型罰回避政策形成アルゴリズムの適用とサッカーロボットを用いた実験による評価2010

    • Author(s)
      伊藤昌樹、宮崎和光、小林博明
    • Organizer
      第53回自動制御連合講演会
    • Place of Presentation
      高知市高知城ホール
    • Year and Date
      2010-11-04
    • Related Report
      2011 Final Research Report 2010 Annual Research Report
  • [Presentation] 腱駆動2足歩行ロボットの開発と腰軌道および腱張力の強化学習-その2010

    • Author(s)
      伊藤昌樹、宮崎和光、小林博明
    • Organizer
      第53回自動制御連合講演会
    • Place of Presentation
      高知市高知城ホール
    • Year and Date
      2010-11-04
    • Related Report
      2010 Annual Research Report
  • [Presentation] Threshold Learning in the Improved Penalty Avoiding Rational Policy Marking Algorithm2010

    • Author(s)
      Kazuteru Miyazaki, Ryouhei Kobayashi, and Hiroaki Kobayashi
    • Organizer
      SICE Annual Conference 2010
    • Place of Presentation
      Grand Hotel, Taipei, Taiwan
    • Year and Date
      2010-08-21
    • Related Report
      2011 Final Research Report
  • [Presentation] Threshold Learning in the Improved Penalty Avoiding Rational Policy Mazking Algorithm2010

    • Author(s)
      Kazuteru Miyazaki, Ryouhei Kobayashi, Hiroaki Kobayashi
    • Organizer
      SICE Annual Conference 2010
    • Place of Presentation
      Grand Hotel, Taipei, Taiwan
    • Year and Date
      2010-08-21
    • Related Report
      2010 Annual Research Report
  • [Presentation] 腱駆動2足歩行ロボットの開発と腰軌道および腱張力の強化学習2010

    • Author(s)
      黒田聖也, 平野晃一郎, 小林博明, 田中純夫
    • Organizer
      日本機械学会関東支部第16期総会講演会
    • Place of Presentation
      明治大学アカデミーコモン
    • Year and Date
      2010-03-10
    • Related Report
      2011 Final Research Report 2009 Annual Research Report
  • [Presentation] 改良型罰回避政策形成アルゴリズムへの罰基底度決定機構の導入と評価2010

    • Author(s)
      小林諒平, 宮崎和光, 小林博明
    • Organizer
      日本機械学会関東支部第16期総会講演会
    • Place of Presentation
      明治大学アカデミーコモン
    • Year and Date
      2010-03-10
    • Related Report
      2011 Final Research Report 2009 Annual Research Report
  • [Presentation] 罰基底度閾値の学習機能を有する改良型罰回避政策形成アルゴリズムの提案2009

    • Author(s)
      小林諒平, 宮崎和光, 小林博明
    • Organizer
      第52回自動制御連合講演会
    • Place of Presentation
      大阪大学基礎工学研究科
    • Year and Date
      2009-11-22
    • Related Report
      2011 Final Research Report 2009 Annual Research Report
  • [Remarks]

    • Related Report
      2011 Final Research Report

URL: 

Published: 2009-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi