TY - JOUR
T1 - Curriculum-Based Reinforcement Learning for Quadrupedal Jumping
T2 - A Reference-Free Design
AU - Atanassov, Vassil
AU - Ding, Jiatao
AU - Kober, Jens
AU - Havoutis, Ioannis
AU - Santina, Cosimo Della
N1 - Publisher Copyright:
© 1994-2011 IEEE.
PY - 2024
Y1 - 2024
N2 - Deep reinforcement learning (DRL) has emerged as a promising solution to mastering explosive and versatile quadrupedal jumping skills. However, current DRL-based frameworks usually rely on pre-existing reference trajectories obtained by capturing animal motions or transferring experience from existing controllers. This work aims to prove that learning dynamic jumping is possible without relying on imitating a reference trajectory by leveraging a curriculum design. Starting from a vertical in-place jump, we generalize the learned policy to forward and diagonal jumps and, finally, we learn to jump across obstacles. Conditioned on the desired landing location, orientation, and obstacle dimensions, the proposed approach yields a wide range of omnidirectional jumping motions in real-world experiments. In particular, we achieve a 90 cm forward jump, exceeding all previous records for similar robots. Additionally, the robot can reliably execute continuous jumping on soft grassy grounds, which is especially remarkable as such conditions were not included in the training stage.
AB - Deep reinforcement learning (DRL) has emerged as a promising solution to mastering explosive and versatile quadrupedal jumping skills. However, current DRL-based frameworks usually rely on pre-existing reference trajectories obtained by capturing animal motions or transferring experience from existing controllers. This work aims to prove that learning dynamic jumping is possible without relying on imitating a reference trajectory by leveraging a curriculum design. Starting from a vertical in-place jump, we generalize the learned policy to forward and diagonal jumps and, finally, we learn to jump across obstacles. Conditioned on the desired landing location, orientation, and obstacle dimensions, the proposed approach yields a wide range of omnidirectional jumping motions in real-world experiments. In particular, we achieve a 90 cm forward jump, exceeding all previous records for similar robots. Additionally, the robot can reliably execute continuous jumping on soft grassy grounds, which is especially remarkable as such conditions were not included in the training stage.
UR - http://www.scopus.com/inward/record.url?scp=85209910328&partnerID=8YFLogxK
U2 - 10.1109/MRA.2024.3487325
DO - 10.1109/MRA.2024.3487325
M3 - Article
AN - SCOPUS:85209910328
SN - 1070-9932
SP - 2
EP - 15
JO - IEEE Robotics and Automation Magazine
JF - IEEE Robotics and Automation Magazine
ER -