首页 /研究 /Approximating the value function for continuous space reinforcement learning in robot control

LEARNING

Approximating the value function for continuous space reinforcement learning in robot control

Sebastian Buck, Michael Beetz, Thorsten Schmitt

发表年份: 2003
引用次数: 9

摘要

Many robot learning tasks are very difficult to solve: their state spaces are high dimensional, variables and command parameters are continuously valued, and system states are only partly observable. In this paper, we propose to learn a continuous space value function for reinforcement learning using neural networks trained from data of exploration runs. The learned function is guaranteed to be a lower bound for, and reproduces the characteristic shape of, the accurate value function. We apply our approach to two robot navigation tasks, discuss how to deal with possible problems occurring in practice, and assess its performance.

关键词

Reinforcement learningRobotBellman equationFunction (biology)Computer scienceState spaceArtificial intelligenceArtificial neural networkValue (mathematics)Space (punctuation)

Approximating the value function for continuous space reinforcement learning in robot control

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory