The necessity of average rewards in cooperative multirobot learning

Poj Tangamchit, John M. Dolan, P.K. Khosla

发表年份: 2003
引用次数: 30

摘要

Learning can be an effective way for robot systems to deal with dynamic environments and changing task conditions. However, popular single-robot learning algorithms based on discounted rewards, such as Q learning, do not achieve cooperation (i.e., purposeful division of labor) when applied to task-level multirobot systems. A task-level system is defined as one performing a mission that is decomposed into subtasks shared among robots. We demonstrate the superiority of average-reward-based learning such as the Monte Carlo algorithm for task-level multirobot systems, and suggest an explanation for this superiority.

关键词

Task (project management)Computer scienceRobotArtificial intelligenceMonte Carlo methodTask analysisMachine learningRobot learningMobile robotEngineering

The necessity of average rewards in cooperative multirobot learning

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory