Integrating sporadic imitation in Reinforcement Learning robots

Willi Richert, Ulrich Scheller, Markus Koch, Bernd Kleinjohann, Claudius Stern

发表年份: 2009
引用次数: 2

摘要

Although the combination of reinforcement learning and imitation has been already considered in recent research, it always revolved around fixed settings where demonstrator and imitator are fixed and the imitation process is a well-defined period of time. What is missing is the investigation of approaches that also work in scenarios where imitation is only sporadically possible. This means that in a multi-robot scenario a robot is now allowed to interrupt another robot by asking to repeat certain actions, but can only observe and integrate information bits delivered occasionally. In this paper we present how that can be done in continuous and noisy environment within an SMDP context.

关键词

InterruptImitationRobotReinforcement learningComputer scienceContext (archaeology)Artificial intelligenceHuman–computer interactionProcess (computing)Psychology

Integrating sporadic imitation in Reinforcement Learning robots

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory