Learning to Control a 6-Degree-of-Freedom Walking Robot

Paweł Wawrzyński

Year: 2007
Citations: 23

Abstract

We analyze the issue of optimizing a control policy for a complex system in a simulated trial-and-error learning process. The approach to this problem we consider is Reinforcement Learning (RL). Stationary policies, applied by most RL methods, may be improper in control applications, since for time discretization fine enough they do not exhibit exploration capabilities and define policy gradient estimators of very large variance. As a remedy to those difficulties, we proposed earlier the use of piecewise non-Markov policies. In the experimental study presented here we apply our approach to a 6-degree-of-freedom walking robot and obtain an efficient policy for this object.

Keywords

Computer scienceControl (management)RobotRobot controlMobile robotDegree (music)Artificial intelligenceHuman–computer interactionControl engineeringControl theory (sociology)

Learning to Control a 6-Degree-of-Freedom Walking Robot

Abstract

Keywords

Related papers

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory