首页 /研究 /Dynamic correlation matrix based multi-Q learning for a multi-robot system

SWARM

Dynamic correlation matrix based multi-Q learning for a multi-robot system

Hongliang Guo, Yan Meng

发表年份: 2008
引用次数: 13

摘要

Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selections, and difficulty in merging learned experiences from other robots. In this paper, we propose a dynamic correlation matrix based multi-Q learning (DCM-MultiQ) method for a distributed multi-robot system. A novel dynamic correlation matrix is proposed, which not only handles each agentpsilas Q value, but also deals with the correlation among agents. Furthermore, a theoretical proof of the convergence of the proposed DCM-MultiQ algorithm is also provided using a feedback matrix control theory. To evaluate the efficiency of the proposed DCM-MultiQ method, several case studies of a multi-robot system in forage tasks have been conducted. The simulation results show the efficiency and convergence of the proposed method.

关键词

Computer scienceRobotCorrelationMatrix (chemical analysis)Artificial intelligenceMathematicsMaterials science

Dynamic correlation matrix based multi-Q learning for a multi-robot system

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Fractional Differential Equations

Applied Nonlinear Control