Reinforcement learning approach to generate goal-directed locomotion of a snake-like robot with screw-drive units
Sromona Chatterjee, Timo Nachstedt, Florentin Wörgötter, Minijia Tamosiunaite, Poramate Manoonpong, Yoshihide Enomoto, Ryo Ariizumi, Fumitoshi Matsuno
- Year
- 2014
- Citations
- 9
Abstract
In this paper we apply a policy improvement algorithm called Policy Improvement with Path Integrals (PI <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2</sup> ) to generate goal-directed locomotion of a complex snake-like robot with screw-drive units. PI <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2</sup> is numerically simple and has an ability to deal with high dimensional systems. Here, this approach is used to find proper locomotion control parameters, like joint angles and screw-drive velocities, of the robot. The learning process was achieved using a simulated robot and the learned parameters were successfully transferred to the real one. As a result the robot can locomote toward a given goal.
Keywords
Related papers
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002