学习 分类论文(27,419)
清除筛选 ✕ELVIS: Ensemble-Calibrated Latent Imagination for Long-Horizon Visual MPC
Yurui Du, Pinhao Song, Yutong Hu 等 4 位作者
2026
ReflectDrive-2: Reinforcement-Learning-Aligned Self-Editing for Discrete Diffusion Driving
Huimin Wang, Yue Wang, Bihao Cui 等 10 位作者
2026
Dream-MPC: Gradient-Based Model Predictive Control with Latent Imagination
Jonathan Spieler, Sven Behnke
2026
Counter-Dyna: Data-Efficient RL-Based HVAC Control using Counterfactual Building Models
Jan Marco Ruiz de Vargas, Fabian Raisch, Zoltan Nagy 等 5 位作者
2026
CRAFT: Counterfactual-to-Interactive Reinforcement Fine-Tuning for Driving Policies
Keyu Chen, Nanfei Ye, Yida Wang 等 7 位作者
2026
Queue-Aware and Resilient Routing in LEO Satellite Networks Using Multi-Agent Reinforcement Learning
Mudassar Liaq, Mahyar Tajeri, Peng Hu
2026
基于动态解耦球面径向挤压的约束增强强化学习
Qijun Liao, Zhaoxin Yu, Jue Yang
2026
SOAR: Real-Time Joint Optimization of Order Allocation and Robot Scheduling in Robotic Mobile Fulfillment Systems
Yibang Tang, Yifan Yang, Jingyuan Wang 等 5 位作者
2026
Will the Carbon Border Adjustment Mechanism Impact European Electricity Prices? A GNN-Based Network Analysis
Jiachen Shen, Jian Shi, Dan Wang 等 4 位作者
2026
On Surprising Effects of Risk-Aware Domain Randomization for Contact-Rich Sampling-based Predictive Control
Sergio A. Esteban, Junheng Li, Vince Kurtz 等 4 位作者
2026
Enhancing RL Generalizability in Robotics through SHAP Analysis of Algorithms and Hyperparameters
Lingxiao Kong, Cong Yang, Oya Deniz Beyan 等 4 位作者
2026
Per-Platform GPIO Overhead in Hardware-Validated Edge ML Inference Timing
Akul Swami, Nikhil Chougule
2026
Beyond Specialization: Robust Reinforcement Learning Navigation via Procedural Map Generators
Christian Jestel, Nicolas Bach, Marvin Wiedemann 等 5 位作者
2026
Set-Based Training of Neural Barrier Certificates for Safety Verification of Dynamical Systems
Miriam Kranzlmüller, Lukas Koller, Tobias Ladner 等 4 位作者
2026
EdgeLPR: On the Deep Neural Network trade-off between Precision and Performance in LiDAR Place Recognition
Pierpaolo Serio, Hetian Wang, Zixiang Wei 等 7 位作者
2026
Do We Really Need Immediate Resets? Rethinking Collision Handling for Efficient Robot Navigation
Shanze Wang, Xinming Zhang, Siwei Cheng 等 6 位作者
2026
Training Non-Differentiable Networks via Optimal Transport
An T. Le
2026
Joint Energy Management and Coordinated AIGC Workload Scheduling for Distributed Data Centers: A Diffusion-Aided Reward Shaping Approach
Yang Fu, Peng Qin, Liming Chen 等 6 位作者
2026
Zero-Shot, Safe and Time-Efficient UAV Navigation via Potential-Based Reward Shaping, Control Lyapunov and Barrier Functions
Ashik Abrar Naeem, Mohammad Ariful Haque
2026
Analytic Bridge Diffusions for Controlled Path Generation
Michael Chertkov
2026
A Universal Optimal Control Strategy for a Tailsitter UAV
Animesh Kumar Shastry, Mangal Kothari
2026
An Efficient Metric for Data Quality Measurement in Imitation Learning
Noushad Sojib, Momotaz Begum
2026
Good in Bad (GiB): Sifting Through End-user Demonstrations for Learning a Better Policy
Noushad Sojib, Ola Ghattas, Momotaz Begum
2026
Dynamics Distillation for Efficient and Transferable Control Learning
Xunjiang Gu, Kashyap Chitta, Mahsa Golchoubian 等 5 位作者
2026