学习 分类论文(27,419)
清除筛选 ✕TOPPO: Rethinking PPO for Multi-Task Reinforcement Learning with Critic Balancing
Yuanpeng Li, Gefei Lin, Annie Qu 等 4 位作者
2026
Beyond Prediction: Interval Neural Networks for Uncertainty-Aware System Identification
Mehmet Ali Ferah, Tufan Kumbasar
2026
Learning When to Act: Communication-Efficient Reinforcement Learning via Run-Time Assurance
Adam Haroon, Erick J. Rodríguez-Seda, Cody Fleming 等 4 位作者
2026
RankQ: Offline-to-Online Reinforcement Learning via Self-Supervised Action Ranking
Andrew Choi, Wei Xu
2026
Newton's Lantern: A Reinforcement Learning Framework for Finetuning AC Power Flow Warm Start Models
Shourya Bose, Helgi Hilmarsson, Dhruv Suri
2026
Variational Inference for Lévy Process-Driven SDEs via Neural Tilting
Yaman Kindap, Manfred Opper, Benjamin Dupuis 等 5 位作者
2026
xApp Empowered Resource Management for Non-Terrestrial Users in 5G O-RAN Networks
Mohammed M. H. Qazzaz, Syed Ali Zaidi, Aubida A. Al-Hameed 等 5 位作者
2026
Demystifying Deep Reinforcement Learning: A Neuro-Symbolic Framework for Interpretable Open RAN Automation
Jie Lu, Peihao Yan, Pang-Ning Tan 等 5 位作者
2026
Hierarchical End-to-End Taylor Bounds for Complete Neural Network Verification
Taha Entesari, Mahyar Fazlyab
2026
Priority-Driven Control and Communication in Decentralized Multi-Agent Systems via Reinforcement Learning
Qingyun Guo, Junyi Shi, Tomasz Piotr Kucner 等 4 位作者
2026
Data-Asymmetric Latent Imagination and Reranking for 3D Robotic Imitation Learning
Lianghao Luo, Xizhou Bu, Ruyan Liu 等 8 位作者
2026
Plan in Sandbox, Navigate in Open Worlds: Learning Physics-Grounded Abstracted Experience for Embodied Navigation
Zhixuan Shen, Jiawei Du, Ziyu Guo 等 8 位作者
2026
Beyond Self-Play and Scale: A Behavior Benchmark for Generalization in Autonomous Driving
Aron Distelzweig, Faris Janjoš, Andreas Look 等 10 位作者
2026
Harnessing Floating Car Data, Traffic Camera Observations, and Network Flow Analysis for Traffic Volume Estimation
Antonina Kosikova, Mehmet Kerem Turkcan, Ahmed Darrat 等 4 位作者
2026
Geometric Pareto Control: Riemannian Gradient Flow of Energy Function via Lie Group Homotopy
Tong Wu
2026
Dynamic Scheduling of a Parallel-Server Queueing System: A Computational Method for High-Dimensional Problems
Baris Ata, Ebru Kasikaralar
2026
ASACK : Adaptive Safe Active Continual Koopman Learning for Uncertain Systems with Contractive Guarantees
Chandan Kumar Sah, Rajpal Singh, Jishnu Keshavan
2026
Trust Region Inverse Reinforcement Learning: Explicit Dual Ascent using Local Policy Updates
Anish Diwan, Davide Tateo, Christopher E. Mower 等 6 位作者
2026
PolarNet: Single-Minima Neural Network for Modeling Lyapunov Functions
Yuan Zhong, Jiaxin Cheng, Hefu Ye 等 4 位作者
2026
SHIELD: Scalable Optimal Control with Certification using Duality and Convexity
Hansung Kim, Siddharth H. Nair, Francesco Borrelli
2026
Data-Driven Inverse Reinforcement Learning of Linear Systems with Model Uncertainty: A Convex Optimization View
Duc Cuong Nguyen, Phuong Nam Dao
2026
Beyond Self-Play: Hierarchical Reasoning for Continuous Motion in Closed-Loop Traffic Simulation
Weifan Zhang, Xiaofeng Zhao, Adel Bazzi 等 6 位作者
2026
FactoryNet:面向工业时间序列基础模型的大规模数据集
Karim Othman, Jonas Petersen, Matei Ignuta-Ciuncanu 等 8 位作者
2026
面向场景图生成的依赖感知离散扩散模型
Rajalaxmi Rajagopalan, Romit Roy Choudhury
2026