Towards Low-Gravity Planetary Exploration using Reinforcement Learning for Walking, Jumping, and In-flight Attitude Control
Jørgen Anker Olsen, Kostas Alexis
2026
Abstract
This paper presents reinforcement learning (RL) policies for dynamic quadrupedal locomotion in planetary exploration scenarios. Building on a taskoptimized quadruped with a 5-bar leg design, we develop RL policies for walking, vertical jumping, forward jumping, and in-flight attitude control, explicitly tailored to the reduced gravity on Mars. These policies jointly enable such robots to overcome obstacles larger than themselves through coordinated jumping and precise in-flight reorientation for safe landings. We demonstrate Sim2Real transfer of the attitude control policy on the Olympus quadruped through single-axis reorientation tests, while all locomotion policies are validated in simulation. A complete Mars exploration mission scenario demonstrates coordinated policy deployment across challenging terrain. Experimental results show 90° attitude reorientation in 2.6 seconds, with simulations demonstrating 3.1 meter vertical jumps and 3.9 meter forward jumps under Martian gravity conditions. - Supplementary video: https://www.youtube.com/watch?v=qlSJ3P87A4A
Keywords
Related papers
PAEAR: Point Clouds Area Exploration and Active Recognition method driven by reinforcement learning for robotic welding
Yong Tao, Donghua Tan, Fan Ren +6 more
Robotics and Computer-Integrated Manufacturing · 2026
Multi-pass cutting parameters optimisation with causal reinforcement learning for deformation control of thin-walled parts
Fengyi Lu, Guanghui Zhou, Chao Zhang +2 more
Robotics and Computer-Integrated Manufacturing · 2026
Bioinspired underwater robotics: Advances across the materials, design, control, and applications
Dilip Muchhala, Pramod Kumar Maurya, Adarsh Raut +3 more
Robotics and Autonomous Systems · 2026
Trajectory tracking control for 6WID/4WIS UGV via nonlinear sliding mode-model predictive control with adaptive following steering and dynamic-static constraints
Shengyang Lu, Guanpeng Chen, Lijing Zhao +2 more
Robotics and Autonomous Systems · 2026