Temporal difference learning

相关论文数: 20

最高引用论文

Natural Actor-Critic

Jan Peters, Stefan Schaal

引用数: 751 • 2008

Survey of Model-Based Reinforcement Learning: Applications on Robotics

Athanasios Polydoros, Lazaros Nalpantidis

引用数: 538 • 2017

Average reward reinforcement learning: Foundations, algorithms, and empirical results

Sridhar Mahadevan

引用数: 401 • 1996

Continuous Deep Q-Learning with Model-based Acceleration

Shixiang Gu, Timothy Lillicrap, Ilya Sutskever, Sergey Levine

引用数: 337 • 2016

Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction

Richard S. Sutton, Joseph Modayil, Michael Delp, Thomas Degris, Patrick M. Pilarski, Adam White, Doina Precup

引用数: 305 • 2011

Reinforcement learning of motor skills in high dimensions: A path integral approach

Evangelos A. Theodorou, Jonas Buchli, Stefan Schaal

引用数: 257 • 2010

Temporal abstraction in reinforcement learning

Doina Precup, Richard S. Sutton

引用数: 247 • 2000

Efficient reinforcement learning: computational theories, neuroscience and robotics

Mitsuo Kawato, Kazuyuki Samejima

引用数: 91 • 2007

Comparing evolutionary and temporal difference methods in a reinforcement learning domain

Matthew E. Taylor, Shimon Whiteson, Peter Stone

引用数: 88 • 2006

Multi-Robot Flocking Control Based on Deep Reinforcement Learning

Pengming Zhu, Wei Dai, Weijia Yao, Junchong Ma, Zhiwen Zeng, Huimin Lu

引用数: 76 • 2020

Isotropic Sequence Order Learning

Bernd Porr, Florentin Wörgötter

引用数: 74 • 2003

Learning from Limited Demonstrations

Beomjoon Kim, Amir massoud Farahmand, Joëlle Pineau, Doina Precup

引用数: 72 • 2013

Vision-based reinforcement learning for purposive behavior acquisition

Minoru Asada, Shoichi Noda, Sukoya Tawaratsumida, Koh Hosoda

引用数: 71 • 2002

Reinforcement learning of dynamic motor sequence: learning to stand up

Jun Morimoto, Kenji Doya

引用数: 68 • 2002

Multi-timescale nexting in a reinforcement learning robot

Joseph Modayil, Adam White, Richard S. Sutton

引用数: 68 • 2014

Recent Advances in Reinforcement Learning

Leslie Pack Kaelbling

引用数: 62 • 1996

A Novel Hierarchical Soft Actor-Critic Algorithm for Multi-Logistics Robots Task Allocation

Hengliang Tang, Anqi Wang, Fei Xue, Jiaxin Yang, Yang Cao

引用数: 58 • 2021

Learning to Control an Octopus Arm with Gaussian Process Temporal Difference Methods

Yaakov Engel, Peter Szabo, Dmitry Volkinshtein

引用数: 57 • 2005

Control delay in Reinforcement Learning for real-time dynamic systems: A memoryless approach

E. Schuitema, Lucian Buşoniu, Robert Babuška, Pieter Jonker

引用数: 56 • 2010

Adaptive state space partitioning for reinforcement learning

Ivan S.K. Lee, Henry Y. K. Lau

引用数: 43 • 2004