Self-generating method of behavioral evaluation for reinforcement learning among multiple coordinated robots
K. Ohkawa, Takanori Shibata, K. Tanie
- 发表年份
- 2002
- 引用次数
- 2
摘要
In this paper, we present a novel self-generating algorithm for behavioral evaluation, which is used to evaluate self-selected behaviour in a reinforcement learning system. This behavioral evaluation is composed of rewards and self-evaluated standards. Rewards are given by the operator as one of the methods for understanding the purpose of tasks; and self-evaluated standards are obtained as the result of executions. Each robot can generate the evaluation depending on its situations by using the proposed method, and therefore the robots can create cooperative behaviours even if the number of robots or tasks is changed dynamically. We performed simulation experiments to study the effectiveness of the proposed method. The experimental results confirm that each robot can generate evaluations for creating cooperative behaviours without changing the algorithm during the simulation experiments.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002