Learning to Control a 6-Degree-of-Freedom Walking Robot

Paweł Wawrzyński

发表年份: 2007
引用次数: 23

摘要

We analyze the issue of optimizing a control policy for a complex system in a simulated trial-and-error learning process. The approach to this problem we consider is Reinforcement Learning (RL). Stationary policies, applied by most RL methods, may be improper in control applications, since for time discretization fine enough they do not exhibit exploration capabilities and define policy gradient estimators of very large variance. As a remedy to those difficulties, we proposed earlier the use of piecewise non-Markov policies. In the experimental study presented here we apply our approach to a 6-degree-of-freedom walking robot and obtain an efficient policy for this object.

关键词

Computer scienceControl (management)RobotRobot controlMobile robotDegree (music)Artificial intelligenceHuman–computer interactionControl engineeringControl theory (sociology)

Learning to Control a 6-Degree-of-Freedom Walking Robot

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory