Multi-robot Box-pushing: Single-Agent Q-Learning vs. Team Q-Learning

Ying Wang, Clarence W. de Silva

发表年份: 2006
引用次数: 76

摘要

In this paper, two types of multi-agent reinforcement learning algorithms are employed in a task of multi-robot box-pushing. The first one is a direct extension of the single-agent Q-learning, which does not have a solid theoretical foundation because it violates the static environment assumption of the Q-learning algorithm. The second one is the Team Q-learning algorithm, which is a multi-agent reinforcement learning algorithm, and is proved to converge to the optimal policy. The states, actions, and reward function of the algorithms are presented in the paper. Based on the two Q-learning algorithms, a fully distributed multi-robot system is developed. Computer simulations are carried out using the developed system. The simulation results show that the two algorithms are effective in a simple environment. It is shown, however, that the single-agent Q-learning algorithm does a better job than the Team Q-learning algorithm in a complicated and unknown environment with many obstacles.

关键词

Reinforcement learningQ-learningComputer scienceRobotRobot learningTask (project management)Extension (predicate logic)Artificial intelligenceLearning classifier systemSimple (philosophy)

Multi-robot Box-pushing: Single-Agent Q-Learning vs. Team Q-Learning

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory