Home /Research /Learning to Control a 6-Degree-of-Freedom Walking Robot
LOCOMOTION

Learning to Control a 6-Degree-of-Freedom Walking Robot

Paweł Wawrzyński

Year
2007
Citations
23

Abstract

We analyze the issue of optimizing a control policy for a complex system in a simulated trial-and-error learning process. The approach to this problem we consider is Reinforcement Learning (RL). Stationary policies, applied by most RL methods, may be improper in control applications, since for time discretization fine enough they do not exhibit exploration capabilities and define policy gradient estimators of very large variance. As a remedy to those difficulties, we proposed earlier the use of piecewise non-Markov policies. In the experimental study presented here we apply our approach to a 6-degree-of-freedom walking robot and obtain an efficient policy for this object.

Keywords

Computer scienceControl (management)RobotRobot controlMobile robotDegree (music)Artificial intelligenceHuman–computer interactionControl engineeringControl theory (sociology)

Related papers

Browse all LOCOMOTION papers