首页 /研究 /A Bio-inspired Reinforcement Learning Rule to Optimise Dynamical Neural Networks for Robot Control
LOCOMOTION

A Bio-inspired Reinforcement Learning Rule to Optimise Dynamical Neural Networks for Robot Control

Tianqi Wei, Barbara Webb

发表年份
2018
引用次数
7

摘要

Most approaches for optimisation of neural networks are based on variants of back-propagation. This requires the network to be time invariant and differentiable; neural networks with dynamics are thus generally outside the scope of these methods. Biological neural circuits are highly dynamic yet clearly able to support learning. We propose a reinforcement learning approach inspired by the mechanisms and dynamics of biological synapses. The network weights undergo spontaneous fluctuations, and a reward signal modulates the centre and amplitude of fluctuations to converge to a desired network behaviour. We test the new learning rule on a 2D bipedal walking simulation, using a control system that combines a recurrent neural network, a bio-inspired central pattern generator layer and proportional-integral control, and demonstrate the first successful solution to this benchmark task.

关键词

Artificial neural networkReinforcement learningComputer scienceArtificial intelligenceBenchmark (surveying)Learning ruleDynamical systems theoryRecurrent neural networkMachine learning

相关论文

查看 LOCOMOTION 分类全部论文