Trust region
相关论文数: 20
顶级研究者
最高引用论文
Trust Region Policy Optimization
John Schulman, Sergey Levine, Philipp Moritz, Michael I. Jordan, Pieter Abbeel
引用数: 3141 • 2015
Guided Policy Search via Approximate Mirror Descent
William Montgomery, Sergey Levine
引用数: 83 • 2016
An incremental trust-region method for Robust online sparse least-squares estimation
David M. Rosen, Michael Kaess, John J. Leonard
引用数: 73 • 2012
A hybrid conjugate gradient based approach for solving unconstrained optimization and motion control problems
Auwal Bala Abubakar, Poom Kumam, Maulana Malik, Abdulkarim Hassan Ibrahim
引用数: 51 • 2021
Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies
Kaiqing Zhang, Alec Koppel, Hao Zhu, Tamer Başar
引用数: 44 • 2019
A new trust region–sequential quadratic programming approach for nonlinear systems based on nonlinear model predictive control
Zhongbo Sun, Yifang Sun, Y. Li, K.P. Liu
引用数: 41 • 2018
Optimizing Expectations: From Deep Reinforcement Learning to Stochastic Computation Graphs
John Schulman
引用数: 37 • 2016
TRC: Trust Region Conditional Value at Risk for Safe Reinforcement Learning
Dohyeong Kim, Songhwai Oh
引用数: 21 • 2022
Reinforcement Learning for UAV Attitude Control
William R. Koch, Renato Mancuso, Richard West, Azer Bestavros
引用数: 19 • 2019
Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value At Risk
Dohyeong Kim, Songhwai Oh
引用数: 17 • 2022
A Differentiable Augmented Lagrangian Method for Bilevel Nonlinear Optimization
Benoit Landry, Zachary Manchester, Marco Pavone
引用数: 16 • 2019
Guided Policy Search as Approximate Mirror Descent
William Montgomery, Sergey Levine
引用数: 15 • 2016
Stochastic Variance Reduction for Policy Gradient Estimation
Tian-Bing Xu, Qiang Liu, Jian Peng
引用数: 10 • 2017
Trust dampening and trust promoting: A dual-pathway of trust calibration in human-robot interaction
Xinyu HUANG, Ye Li
引用数: 7 • 2024
Two steps natural actor critic learning for underwater cable tracking
Andrés El-Fakdi, Marc Carreras, Enric Galceran
引用数: 7 • 2010
Bayesian Optimization Based Trust Model for Human Multi-robot Collaborative Motion Tasks in Offroad Environments
Huanfei Zheng, Jonathon M. Smereka, Dariusz Mikulski, Yue Wang
引用数: 6 • 2023
Smoothing Policies and Safe Policy Gradients
Matteo Papini, Matteo Pirotta, Marcello Restelli
引用数: 6 • 2019
Hindsight Trust Region Policy Optimization
Hanbo Zhang, Xuguang Lan, David Hsu, Nanning Zheng
引用数: 5 • 2021
A Fast and Robust Algorithm for General Inequality/Equality Constrained Minimum-Time Problems
B.J. Driessen, Nader Sadegh, Gordon G. Parker, G. Richard Eisler
引用数: 5 • 1999
Deep Black-Box Reinforcement Learning with Movement Primitives
Fabian Otto, Onur Çelik, Hongyi Zhou, Hanna Ziesche, Ngo Anh Vien, Gerhard Neumann
引用数: 5 • 2022