Advance Motion Acquisition of an Actual Robot by Reinforcement Learning using Reward Change
Ryota YAMASHINA, Haruhisa Motoyama, Mariko URAKAWA, Jian Huang, Tetsuro Yabuta
- 发表年份
- 2006
- 引用次数
- 8
- 访问权限
- 开放获取
摘要
Generally, a reward is given as the fixed value for the Reinforcement Learning process. However, as for a human being's training process, the reward could be changed according to the improvement of the process. Therefore, it is very interesting to study the Q-Learning process under the reward change. In this paper, Q-Learning is applied to an actual robot in order for advance motion acquisition. Results show that the Q-Learning process changes according to the reward change. Moreover, this paper clarifies how the robot obtains its optimal motion form under the learning process.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002