Max Robotics — Global Service Robot Directory + US Certification

About

Pieter Abbeel stands as one of the most influential figures at the intersection of robotics, deep learning, and reinforcement learning. His research has fundamentally shaped how machines learn to perceive, plan, and act in complex real-world environments. Among his landmark contributions is Trust Region Policy Optimization (TRPO), a foundational reinforcement learning algorithm that introduced principled, monotonically improving policy updates and has accumulated over 3,100 citations, cementing its status as a cornerstone of modern RL. Alongside collaborators, he pioneered end-to-end deep visuomotor policy learning, enabling robots to acquire control skills directly from raw visual input without hand-engineered perception pipelines. His work on domain randomization (2,700+ citations) bridged the simulation-to-reality gap, while Soft Actor-Critic addressed critical challenges of sample efficiency and stability in continuous control. Beyond algorithms, Abbeel contributed practical tools for the robotics community, including the widely adopted YCB object benchmark and advances in motion planning. His research on DeepMimic further extended physically realistic character animation through reinforcement learning. With multiple papers surpassing 1,000 citations, Abbeel's cumulative influence across robotics manipulation, deep RL, and sim-to-real transfer has profoundly accelerated progress toward capable, autonomous robotic systems.

Research Focus

Computer science253 · 34,390 citations

Artificial intelligence245 · 31,601 citations

Robot166 · 23,374 citations

Reinforcement learning97 · 15,515 citations

Machine learning104 · 11,729 citations

Mathematics71 · 11,507 citations

Engineering91 · 11,418 citations

Artificial neural network26 · 10,483 citations

Computer vision72 · 8,661 citations

Mathematical optimization24 · 8,342 citations

Robotics46 · 7,986 citations

Deep learning21 · 7,979 citations

Key Achievements

78

H-Index

256

Papers

34,399

Total Citations

134

Avg Citations/Paper

🏆 Most Cited Paper

Trust Region Policy Optimization

3,141 citations · 2015

📈 Most Prolific Year: 2015 (33 Papers)

🤝 Key Collaborators: 492

🏛 Institutions: University of California, Berkeley, International Computer Science Institute, University of California System, Technische Universität Darmstadt, Machine Intelligence Research Institute, OpenAI (United States)

Top Papers

1
Trust Region Policy Optimization
3,141 citations · 2015
2
Domain randomization for transferring deep neural networks from simulation to the real world
2,736 citations · 2017
3
Soft Actor-Critic Algorithms and Applications
1,952 citations · 2018
4
High-Dimensional Continuous Control Using Generalized Advantage Estimation
1,750 citations · 2015
5
End-to-end training of deep visuomotor policies
1,715 citations · 2016
6
End-to-End Training of Deep Visuomotor Policies
1,399 citations · 2015
7
Motion planning with sequential convex optimization and convex collision checking
840 citations · 2014
8
A Survey of Research on Cloud Robotics and Automation
826 citations · 2015
9
The YCB object and Model set: Towards common benchmarks for manipulation research
807 citations · 2015
10
DeepMimic
802 citations · 2018

Key Collaborators

Contact & Links

Available for collaboration