Bellman equation

相关论文数: 20

最高引用论文

High-Dimensional Continuous Control Using Generalized Advantage Estimation

John Schulman, Philipp Moritz, Sergey Levine, Michael I. Jordan, Pieter Abbeel

引用数: 1750 • 2015

Natural Actor-Critic

Jan Peters, Stefan Schaal

引用数: 751 • 2008

A Generalized Path Integral Control Approach to Reinforcement Learning

Evangelos A. Theodorou, Jonas Buchli, Stefan Schaal

引用数: 449 • 2010

Anytime Point-Based Approximations for Large POMDPs

Joëlle Pineau, Geoff Gordon, Sebastian Thrun

引用数: 373 • 2006

Natural Actor-Critic

Jan Peters, Sethu Vijayakumar, Stefan Schaal

引用数: 312 • 2005

Motion planning under uncertainty using iterative local optimization in belief space

Jur van den Berg, Sachin Patil, Ron Alterovitz

引用数: 305 • 2012

Finding Approximate POMDP solutions Through Belief Compression

Nicholas Roy, Geoffrey J. Gordon, Sebastian Thrun

引用数: 253 • 2005

Point-Based Value Iteration for Continuous POMDPs

Josep M. Porta, Nikos Vlassis, Matthijs T. J. Spaan, Pascal Poupart

引用数: 246 • 2006

Model-Free reinforcement learning with continuous action in practice

Thomas Degris, Patrick M. Pilarski, Richard S. Sutton

引用数: 234 • 2012

Planning in the Presence of Cost Functions Controlled by an Adversary

H. Brendan McMahan, Geoffrey J. Gordon, Avrim Blum

引用数: 228 • 2018

Practical Reinforcement Learning in Continuous Spaces

William D. Smart, Leslie Pack Kaelbling

引用数: 211 • 2000

An efficiently solvable quadratic program for stabilizing dynamic locomotion

引用数: 183 • 2014

Inverse Optimal Control with Linearly-Solvable MDPs

Krishnamurthy Dvijotham, Emanuel Todorov

引用数: 119 • 2010

Bisimulation Metrics for Continuous Markov Decision Processes

Norm Ferns, Prakash Panangaden, Doina Precup

引用数: 100 • 2011

Learning to Drive a Real Car in 20 Minutes

Martin Riedmiller, Mike Montemerlo, Hendrik Dahlkamp

引用数: 92 • 2007

Parametric POMDPs for planning in continuous state spaces

Alex Brooks, Alexei Makarenko, Stefan B. Williams, Hugh Durrant‐Whyte

引用数: 90 • 2006

Safety Augmented Value Estimation From Demonstrations (SAVED): Safe Deep Model-Based RL for Sparse Cost Robotic Tasks

Brijen Thananjeyan, Ashwin Balakrishna, Ugo Rosolia, Felix Li, Rowan McAllister, Joseph E. Gonzalez, Sergey Levine, Francesco Borrelli

引用数: 90 • 2020

Bridging Hamilton-Jacobi Safety Analysis and Reinforcement Learning

Jaime F. Fisac, Neil F. Lugovoy, Vicenç Rubies-Royo, Shromona Ghosh, Claire J. Tomlin

引用数: 85 • 2019

Jumping over obstacles with MIT Cheetah 2

Hae-Won Park, Patrick M. Wensing, Sangbae Kim

引用数: 77 • 2020

A Deep Q-network (DQN) Based Path Planning Method for Mobile Robots

Siyu Zhou, Xin Liu, Yingfu Xu, Jifeng Guo

引用数: 75 • 2018