Policy learning
Related papers: 20
Top Researchers
Top Cited Papers
A Survey on Policy Search for Robotics
Marc Peter Deisenroth
Citations: 684 • 2011
Data-Efficient Hierarchical Reinforcement Learning
Ofir Nachum, Shixiang Gu, Honglak Lee, Sergey Levine
Citations: 265 • 2018
Transfer via inter-task mappings in policy search reinforcement learning
Matthew E. Taylor, Shimon Whiteson, Peter Stone
Citations: 133 • 2007
Multi-task policy search for robotics
Marc Peter Deisenroth, Péter Englert, Jan Peters, Dieter Fox
Citations: 121 • 2014
Effect of human guidance and state space size on Interactive Reinforcement Learning
Halit Bener Suay, Sonia Chernova
Citations: 118 • 2011
Interactive Learning from Policy-Dependent Human Feedback
James MacGlashan, Mark K. Ho, Robert Loftin, Bei Peng, Guan Wang, David L. Roberts, Matthew E. Taylor, Michael L. Littman
Citations: 108 • 2017
Preference-Based Policy Learning
Riad Akrour, Marc Schoenauer, Michèle Sébag
Citations: 83 • 2011
Explanation-Based Reward Coaching to Improve Human Performance via Reinforcement Learning
Aaquib Tabrez, Shivendra Agrawal, Bradley Hayes
Citations: 62 • 2019
Stochastic Abstract Policies: Generalizing Knowledge to Improve Reinforcement Learning
Marcelo Li Koga, Valdinei Freire, Anna Helena Reali Costa
Citations: 45 • 2014
A residual reinforcement learning method for robotic assembly using visual and force information
Zhuangzhuang Zhang, Yizhao Wang, Zhinan Zhang, Lihui Wang, Huang Huang, Qixin Cao
Citations: 40 • 2023
Any-point Trajectory Modeling for Policy Learning
Xingyu Lin, John So, Kai Chen, Qi Dou, Yang Gao, Pieter Abbeel
Citations: 40 • 2024
Reinforcement Learning for Pivoting Task
Rika Antonova, Silvia Cruciani, Christian Smith, Danica Kragić
Citations: 36 • 2017
Affordance Learning from Play for Sample-Efficient Policy Learning
Jessica Borja-Diaz, Oier Mees, Gabriel Kalweit, Lukás Hermann, Joschka Boedecker, Wolfram Burgard
Citations: 29 • 2022
Velocity adaptation for self-improvement of skills learned from user demonstrations
Bojan Nemec, Andrej Gams, Aleš Ude
Citations: 28 • 2013
GAPLE: Generalizable Approaching Policy LEarning for Robotic Object Searching in Indoor Environment
Xin Ye, Zhe Lin, Joon‐Young Lee, Jianming Zhang, Shibin Zheng, Yezhou Yang
Citations: 26 • 2019
Transfer Learning for Policy Search Methods
Shimon Whiteson
Citations: 25 • 2006
Sample and time efficient policy learning with CMA-ES and Bayesian Optimisation
Léni K. Le Goff, Edgar Buchanan, Emma Hart, A. E. Eiben, Wei Li, Matteo De Carlo, Matthew F. Hale, Mike Angus, Robert Woolley, Jon Timmis, Alan Winfield, Andrew M. Tyrrell
Citations: 20 • 2020
Learning policies for attentional control
Luiz Marcos Garcia Gonçalves, Gilson A. Giraldi, Antonio A. F. Oliveira, Roderic A. Grupen
Citations: 15 • 2003
Interaction-Aware Multi-Agent Reinforcement Learning for Mobile Agents with Individual Goals
Anahita Mohseni-Kabir, David Isele, Kikuo Fujimura
Citations: 14 • 2019
Learning Environmental Calibration Actions for Policy Self-Evolution
Chao Zhang, Yang Yu, Zhi‐Hua Zhou
Citations: 13 • 2018