Generative adversarial imitation learning for robot swarms: Learning from human demonstrations and trained policies
Mattes Kraus, Jonas Kuckling
- 发表年份
- 2026
- 访问权限
- 开放获取
摘要
In imitation learning, robots are supposed to learn from demonstrations of the desired behavior. Most of the work in imitation learning for swarm robotics provides the demonstrations as rollouts of an existing policy. In this work, we provide a framework based on generative adversarial imitation learning that aims to learn collective behaviors from human demonstrations. Our framework is evaluated across six different missions, learning both from manual demonstrations and demonstrations derived from a PPO-trained policy. Results show that the imitation learning process is able to learn qualitatively meaningful behaviors that perform similarly well as the provided demonstrations. Additionally, we deploy the learned policies on a swarm of TurtleBot 4 robots in real-robot experiments. The exhibited behaviors preserved their visually recognizable character and their performance is comparable to the one achieved in simulation.
关键词
相关论文
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002
Swarm Intelligence
Eric Bonabeau, Marco Dorigo, Guy Théraulaz
1999
Design and use paradigms for gazebo, an open-source multi-robot simulator
Nathan Koenig, A. Howard
2005
Swarm robotics: a review from the swarm engineering perspective
Manuele Brambilla, Eliseo Ferrante, Mauro Birattari 等 4 位作者
2013