





















Abstract:This paper presents reinforcement learning (RL) policies for dynamic quadrupedal locomotion in planetary exploration scenarios. Building on a taskoptimized quadruped with a 5-bar leg design, we develop RL policies for walking, vertical jumping, forward jumping, and in-flight attitude control, explicitly tailored to the reduced gravity on Mars. These policies jointly enable such robots to overcome obstacles larger than themselves through coordinated jumping and precise in-flight reorientation for safe landings. We demonstrate Sim2Real transfer of the attitude control policy on the Olympus quadruped through single-axis reorientation tests, while all locomotion policies are validated in simulation. A complete Mars exploration mission scenario demonstrates coordinated policy deployment across challenging terrain. Experimental results show 90° attitude reorientation in 2.6 seconds, with simulations demonstrating 3.1 meter vertical jumps and 3.9 meter forward jumps under Martian gravity conditions. - Supplementary video: this https URL
| Comments: | 16 pages, 16 figures |
| Subjects: | Robotics (cs.RO); Systems and Control (eess.SY) |
| Cite as: | arXiv:2605.24643 [cs.RO] |
| (or arXiv:2605.24643v1 [cs.RO] for this version) | |
| https://doi.org/10.48550/arXiv.2605.24643 arXiv-issued DOI via DataCite (pending registration) |
From: Jørgen Anker Olsen [view email]
[v1]
Sat, 23 May 2026 16:20:23 UTC (3,194 KB)
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。