Markov process

相关论文数: 20

最高引用论文

Point-based value iteration: an anytime algorithm for POMDPs

Joëlle Pineau, Geoff Gordon, Sebastian Thrun

引用数: 934 • 2003

Reinforcement learning for robots using neural networks

Long-Ji Lin

引用数: 887 • 1992

SARSOP: Efficient Point-Based POMDP Planning by Approximating Optimally Reachable Belief Spaces

Hanna Kurniawati, David Hsu, Wee Sun Lee

引用数: 782 • 2008

Learning to Track: Online Multi-object Tracking by Decision Making

Xiang Yu, Alexandre Alahi, Silvio Savarese

引用数: 716 • 2015

Learning policies for partially observable environments: Scaling up

Michael L. Littman, Anthony R. Cassandra, Leslie Pack Kaelbling

引用数: 662 • 1995

Continuous-Time Markov Jump Linear Systems

O.L.V. Costa, Marcelo D. Fragoso, Marcos G. Todorov

引用数: 496 • 2012

Probabilistic robot navigation in partially observable environments

Reid Simmons, Sven Koenig

引用数: 488 • 1995

Acting under uncertainty: discrete Bayesian models for mobile-robot navigation

Anthony R. Cassandra, Leslie Pack Kaelbling, James Kurien

引用数: 468 • 2002

Anytime Point-Based Approximations for Large POMDPs

Joëlle Pineau, Geoff Gordon, Sebastian Thrun

引用数: 373 • 2006

Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving

Shai Shalev‐Shwartz, Shaked Shammah, Amnon Shashua

引用数: 367 • 2016

Intention-aware online POMDP planning for autonomous driving in a crowd

Haoyu Bai, Shaojun Cai, Nan Ye, David Hsu, Wee Sun Lee

引用数: 331 • 2015

Motion planning under uncertainty using iterative local optimization in belief space

Jur van den Berg, Sachin Patil, Ron Alterovitz

引用数: 305 • 2012

Finite-Time Sliding-Mode Control of Markovian Jump Cyber-Physical Systems Against Randomly Occurring Injection Attacks

Zhiru Cao, Yugang Niu, Jun Song

引用数: 279 • 2019

Autonomous helicopter control using reinforcement learning policy search methods

J. Andrew Bagnell, Jeff Schneider

引用数: 278 • 2002

Finding Approximate POMDP solutions Through Belief Compression

Nicholas Roy, Geoffrey J. Gordon, Sebastian Thrun

引用数: 253 • 2005

Temporal abstraction in reinforcement learning

Doina Precup, Richard S. Sutton

引用数: 247 • 2000

Point-Based Value Iteration for Continuous POMDPs

Josep M. Porta, Nikos Vlassis, Matthijs T. J. Spaan, Pascal Poupart

引用数: 246 • 2006

Parameter-exploring policy gradients

Frank Sehnke, Christian Osendorfer, Thomas Rückstieß, Alex Graves, Jan Peters, Jürgen Schmidhuber

引用数: 245 • 2009

A Gentle Introduction to Reinforcement Learning and its Application in Different Fields

Muddasar Naeem, Syed Tahir Hussain Rizvi, Antonio Coronato

引用数: 241 • 2020

Planning under Uncertainty for Robotic Tasks with Mixed Observability

Sylvie C. W. Ong, Shao Wei Png, David Hsu, Wee Sun Lee

引用数: 238 • 2010