On-line robot learning using the interval estimation algorithm

Tijn van der Zant, Wiering, Jan van Eijck

Year: 2005
Citations: 3

Abstract

A lot of reinforcement learning algorithms are based on a full state space to learn from. In the RoboCup mid-size league this is impossible to do during the real games, due to the immense state space. This paper suggests a way to reduce the state space significantly by selecting among behaviors that are only triggered by few states. In fact to make the robot keeper learn very fast to select its best behavior with the purpose to defend the goal, we only used a single state in our experiments. For a behavior with a certain goal several implementations are made. From this behavior set the interval estimation algorithm chooses the behavior that has the highest probability to actually achieve the highest possible performance. This means fast learning, although the reduced state space also means that some solutions cannot be found.

Keywords

Reinforcement learningRobotState spaceComputer scienceState (computer science)Artificial intelligenceInterval (graph theory)Space (punctuation)Line (geometry)Set (abstract data type)

On-line robot learning using the interval estimation algorithm

Abstract

Keywords

Related papers

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory