首页 /研究 /Exploring Constrained Reinforcement Learning Algorithms for Quadrupedal Locomotion
LOCOMOTION

Exploring Constrained Reinforcement Learning Algorithms for Quadrupedal Locomotion

Joonho Lee, Lukas Schroth, Victor Klemm, Marko Bjelonic, Alexander Reske, Marco Hutter

发表年份
2024
引用次数
10

摘要

Shifting from traditional control strategies to Deep Reinforcement Learning (RL) for legged robots poses inherent challenges, especially when addressing real-world physical constraints during training. While high-fidelity simulations provide significant benefits, they often bypass these essential physical limitations. In this paper, we experiment with the Constrained Markov Decision Process (CMDP) framework instead of the conventional unconstrained RL for robotic applications. We evaluated five constrained policy optimization algorithms for quadrupedal locomotion using three different robot models. Our aim is to evaluate their applicability in real-world scenarios. Our robot experiments demonstrate the critical role of incorporating physical constraints, yielding successful sim-to-real transfers, and reducing operational errors on physical systems. The CMDP formulation streamlines the training process by separately handling constraints from rewards. Our findings underscore the potential of constrained RL for the effective development and deployment of learned controllers in robotics.

关键词

QuadrupedalismReinforcement learningComputer scienceReinforcementArtificial intelligenceAlgorithmEngineeringStructural engineering

相关论文

查看 LOCOMOTION 分类全部论文