Practical Reinforcement Learning in Continuous Spaces

William D. Smart, Leslie Pack Kaelbling

Year: 2000
Citations: 211

Abstract

Dynamic control tasks are good candidates for the application of reinforcement learning techniques. However, many of these tasks inherently have continuous state or action variables. This can cause problems for traditional reinforcement learning algorithms which assume discrete states and actions. In this paper, we introduce an algorithm that safely approximates the value function for continuous state control tasks, and that learns quickly from a small amount of data. We give experimental results using this algorithm to learn policies for both a simulated task and also for a real robot, operating in an unaltered environment. The algorithm works well in a traditional learning setting, and demonstrates extremely good learning when bootstrapped with a small amount of human-provided data. 1.

Keywords

Reinforcement learningComputer scienceTask (project management)Artificial intelligenceBellman equationLearning classifier systemRobotState (computer science)Action (physics)Control (management)

Practical Reinforcement Learning in Continuous Spaces

Abstract

Keywords

Related papers

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory