Reinforcement Learning for a Human-Following Robot

Yan Wang, David Lee

发表年份: 2006
引用次数: 6

摘要

This paper discusses the use of a mobile robot following a person. It focuses on the less researched interaction with the human attitude through robot movements. The reward, which indicates the attitude of the human, is used to train the network so that the robot learns an appropriate position relative to the person. The algorithm presented in this study overcomes the difficulty that the feedback reward score given by the human has no gradient throughout large parts of the input space. This network works online and has the ability to adapt to unpredictable changes in the person's preference

关键词

Reinforcement learningRobotComputer scienceMobile robotArtificial intelligencePreferenceHuman–computer interactionHuman–robot interactionPosition (finance)Space (punctuation)

Reinforcement Learning for a Human-Following Robot

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory