研究実績の概要 |
Previously, we have been using Model-based Reinforcement Learning (MBRL) to understand the robot's preserving energy and characterize its interaction (dynamics) with its energy-flowing property based on the law of energy conservation. This year, we have made progress this year by extending our achievements and applying the MBRL approach to teach the robot how to walk on even or uneven terrain at different speeds. To achieve a higher control frequency, we introduced actuators as energy sources into the energy-flowing system to reduce the state's dimensions and achieve higher control frequency. Furthermore, we have used the robot's energy-formulated state and the law of conservation of energy to design a walking trajectory in terms of energy formulation. This ensures both robot dynamics and tasks are formulated in the same representation and a general manner. For verifying our approach's effectiveness, we conducted experiments using a simulated spring-loaded biped robot in a physics simulator. Results show that our approach can generalize across skill conditions, including different terrains and walking speeds. The walking skills are acquired using a compact 9-dimensional energy-formulated state, on-site learning ability, and learning with only a few minutes of samples. Besides developing learning algorithms, we are also creating a hardware robot that meets the requirements for our application. Regarding its progress, we are refining the initial design and plan to build the first prototype soon. With this hardware robot, we can test our approach in real-world scenarios.
|