首页 /研究 /Residual-gradient-based neural reinforcement learning for the optimal control of an acrobot

LEARNING

Residual-gradient-based neural reinforcement learning for the optimal control of an acrobot

Xin Xu, Hangen He

发表年份: 2003
引用次数: 8

摘要

Based on the idea of dynamic programming, reinforcement learning (RL) has become an important model-free method to solve difficult optimal control problems. In this paper, a novel neural RL method is proposed to solve the time-optimal control problem of a class of under-actuated robots, which is called the acrobot. The RL method uses a modified residual gradient reinforcement learning algorithm called RGNP (residual gradient with nonstationary policy). The RGNP algorithm not only has guaranteed convergence under certain conditions but also can ensure the performance of the approximated optimal policy, which is superior to the previous residual gradient algorithms. Simulation results of the learning control of the acrobot illustrate the effectiveness of the proposed method.

关键词

Reinforcement learningResidualConvergence (economics)Computer scienceControl theory (sociology)Gradient methodOptimal controlMathematical optimizationRobotArtificial neural network

Residual-gradient-based neural reinforcement learning for the optimal control of an acrobot

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory