Enhanced continuous valued Q-learning for real autonomous robots

Masanori Takeda, Takayuki Nakamura, Masakazu Imai, Tsukasa Ogasawara, Minoru Asada

发表年份: 2000
引用次数: 11

摘要

Q-learning, a most widely used reinforcement learning method, normally needs well-defined quantized state and action spaces to obtain an optimal policy for accomplishing a given task. This makes it difficult to be applied to real robot tasks because of poor performance of learned behavior due to the failure of quantization of continuous state and action spaces. To deal with this problem, we proposed a continuous valued Q-learning (Takahashi et al., 1999) (hereafter, called CVQ-learning) for real robot applications. This method utilized a function approximation method for representing a action value function. In this paper, we point out that this type of learning method potentially has a discontinuity problem of optimal actions given a state. To resolve this problem, this paper proposes a method for estimating where discontinuity of optimal action takes place and for refining a state space for CVQ-learning. To show the validity of our method, we apply the method to a vision-guided mobile robot of which task is to chase the ball. Although the task is simple, the performance is quite impressive.

关键词

Q-learningReinforcement learningRobotImage (mathematics)Artificial intelligenceComputer scienceComputer visionMathematics

Enhanced continuous valued Q-learning for real autonomous robots

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Fractional Differential Equations

Applied Nonlinear Control