Temporal difference learning
相关论文数: 20
顶级研究者
最高引用论文
Natural Actor-Critic
Jan Peters, Stefan Schaal
引用数: 751 • 2008
Survey of Model-Based Reinforcement Learning: Applications on Robotics
Athanasios Polydoros, Lazaros Nalpantidis
引用数: 538 • 2017
Average reward reinforcement learning: Foundations, algorithms, and empirical results
Sridhar Mahadevan
引用数: 401 • 1996
Continuous Deep Q-Learning with Model-based Acceleration
Shixiang Gu, Timothy Lillicrap, Ilya Sutskever, Sergey Levine
引用数: 337 • 2016
Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction
Richard S. Sutton, Joseph Modayil, Michael Delp, Thomas Degris, Patrick M. Pilarski, Adam White, Doina Precup
引用数: 305 • 2011
Reinforcement learning of motor skills in high dimensions: A path integral approach
Evangelos A. Theodorou, Jonas Buchli, Stefan Schaal
引用数: 257 • 2010
Temporal abstraction in reinforcement learning
Doina Precup, Richard S. Sutton
引用数: 247 • 2000
Efficient reinforcement learning: computational theories, neuroscience and robotics
Mitsuo Kawato, Kazuyuki Samejima
引用数: 91 • 2007
Comparing evolutionary and temporal difference methods in a reinforcement learning domain
Matthew E. Taylor, Shimon Whiteson, Peter Stone
引用数: 88 • 2006
Multi-Robot Flocking Control Based on Deep Reinforcement Learning
Pengming Zhu, Wei Dai, Weijia Yao, Junchong Ma, Zhiwen Zeng, Huimin Lu
引用数: 76 • 2020
Isotropic Sequence Order Learning
Bernd Porr, Florentin Wörgötter
引用数: 74 • 2003
Learning from Limited Demonstrations
Beomjoon Kim, Amir massoud Farahmand, Joëlle Pineau, Doina Precup
引用数: 72 • 2013
Vision-based reinforcement learning for purposive behavior acquisition
Minoru Asada, Shoichi Noda, Sukoya Tawaratsumida, Koh Hosoda
引用数: 71 • 2002
Reinforcement learning of dynamic motor sequence: learning to stand up
Jun Morimoto, Kenji Doya
引用数: 68 • 2002
Multi-timescale nexting in a reinforcement learning robot
Joseph Modayil, Adam White, Richard S. Sutton
引用数: 68 • 2014
Recent Advances in Reinforcement Learning
Leslie Pack Kaelbling
引用数: 62 • 1996
A Novel Hierarchical Soft Actor-Critic Algorithm for Multi-Logistics Robots Task Allocation
Hengliang Tang, Anqi Wang, Fei Xue, Jiaxin Yang, Yang Cao
引用数: 58 • 2021
Learning to Control an Octopus Arm with Gaussian Process Temporal Difference Methods
Yaakov Engel, Peter Szabo, Dmitry Volkinshtein
引用数: 57 • 2005
Control delay in Reinforcement Learning for real-time dynamic systems: A memoryless approach
E. Schuitema, Lucian Buşoniu, Robert Babuška, Pieter Jonker
引用数: 56 • 2010
Adaptive state space partitioning for reinforcement learning
Ivan S.K. Lee, Henry Y. K. Lau
引用数: 43 • 2004