首页 /研究 /A Bio-inspired Reinforcement Learning Rule to Optimise Dynamical Neural Networks for Robot Control

LOCOMOTION

A Bio-inspired Reinforcement Learning Rule to Optimise Dynamical Neural Networks for Robot Control

Tianqi Wei, Barbara Webb

发表年份: 2018
引用次数: 7

摘要

Most approaches for optimisation of neural networks are based on variants of back-propagation. This requires the network to be time invariant and differentiable; neural networks with dynamics are thus generally outside the scope of these methods. Biological neural circuits are highly dynamic yet clearly able to support learning. We propose a reinforcement learning approach inspired by the mechanisms and dynamics of biological synapses. The network weights undergo spontaneous fluctuations, and a reward signal modulates the centre and amplitude of fluctuations to converge to a desired network behaviour. We test the new learning rule on a 2D bipedal walking simulation, using a control system that combines a recurrent neural network, a bio-inspired central pattern generator layer and proportional-integral control, and demonstrate the first successful solution to this benchmark task.

关键词

Artificial neural networkReinforcement learningComputer scienceArtificial intelligenceBenchmark (surveying)Learning ruleDynamical systems theoryRecurrent neural networkMachine learning

A Bio-inspired Reinforcement Learning Rule to Optimise Dynamical Neural Networks for Robot Control

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory