首页 /研究 /Using policy gradient reinforcement learning on autonomous robot controllers

LEARNING

Using policy gradient reinforcement learning on autonomous robot controllers

Gregory Z. Grudić, Vishal Kumar, Lyle Ungar

发表年份: 2004
引用次数: 20

摘要

Robot programmers can often quickly program a robot to approximately execute a task under specific environment conditions. However, achieving robust performance under more general conditions is significantly more difficult. We propose a framework that starts with an existing control system and uses reinforcement feedback from the environment to autonomously improve the controller's performance. We use the policy gradient reinforcement learning (PGRL) framework, which estimates a gradient (in controller space) of improved reward, allowing the controller parameters to be incrementally updated to autonomously achieve locally optimal performance. Our approach is experimentally verified on a Cye robot executing a room entry and observation task, showing significant reduction in task execution time and robustness with respect to un-modelled changes in the environment.

关键词

Reinforcement learningRobustness (evolution)RobotComputer scienceTask (project management)Controller (irrigation)Robust controlControl engineeringControl theory (sociology)Artificial intelligence

Using policy gradient reinforcement learning on autonomous robot controllers

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory