Hierarchical Reinforcement Learning for Swarm Confrontation with High Uncertainty
Qizhen Wu, Kexin Liu, Lei Chen, Jinhu Lü
- 发表年份
- 2024
- 访问权限
- 开放获取
摘要
In swarm robotics, confrontation including the pursuit-evasion game is a key scenario. High uncertainty caused by unknown opponents' strategies, dynamic obstacles, and insufficient training complicates the action space into a hybrid decision process. Although the deep reinforcement learning method is significant for swarm confrontation since it can handle various sizes, as an end-to-end implementation, it cannot deal with the hybrid process. Here, we propose a novel hierarchical reinforcement learning approach consisting of a target allocation layer, a path planning layer, and the underlying dynamic interaction mechanism between the two layers, which indicates the quantified uncertainty. It decouples the hybrid process into discrete allocation and continuous planning layers, with a probabilistic ensemble model to quantify the uncertainty and regulate the interaction frequency adaptively. Furthermore, to overcome the unstable training process introduced by the two layers, we design an integration training method including pre-training and cross-training, which enhances the training efficiency and stability. Experiment results in both comparison, ablation, and real-robot studies validate the effectiveness and generalization performance of our proposed approach. In our defined experiments with twenty to forty agents, the win rate of the proposed method reaches around ninety percent, outperforming other traditional methods.
关键词
相关论文
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002
Swarm Intelligence
Eric Bonabeau, Marco Dorigo, Guy Théraulaz
1999
Design and use paradigms for gazebo, an open-source multi-robot simulator
Nathan Koenig, A. Howard
2005
Swarm robotics: a review from the swarm engineering perspective
Manuele Brambilla, Eliseo Ferrante, Mauro Birattari 等 4 位作者
2013