学习 分类论文(27,419)
清除筛选 ✕A Machine-to-Machine Knowledge-Guided LLM Agent for Generalizable Radiotherapy Treatment Planning
Md Mainul Abrar, Xun Jia, Yujie Chi
2026
Too Much of a Good Thing: When sim2real Efforts Impede Policy Learning (And What to Do About It)
Kyle Morgenstein, Bharath Masetty, Stephen Welch 等 4 位作者
2026
Infeasible optimization problems and the hierarchical augmented Lagrangian method in imitation learning
Roland Andrews, Justin Carpentier, Ajay Sathya
2026
Shape Your Body: Value Gradients for Multi-Embodiment Robot Design
Nico Bohlinger, Jan Peters
2026
DriveAnchor: Progressive Anchor-based Flow Learning for Autonomous Driving Planning
Limin Yan, Haoyun Tang, Yutao Qiu 等 5 位作者
2026
DRL-Based Pose Control for Double-Ackermann Robots Under Actuation Uncertainties
Oussama Zaim, Mélodie Daniel, Aly Magassouba 等 5 位作者
2026
Enhancing Human-Likeness in Reinforcement Learning Agents via Hierarchical Macro Action Quantization
Usman Nizamani, M. Shaheer Luqman, Fawad Javed Fateh 等 7 位作者
2026
FLAG: Flow Policy MaxEnt-RL by Latent Augmented Guidance
Sungha Kim, Gawon Lee, Jusuk Lee 等 6 位作者
2026
When are LLMs Sufficient Policy Optimizers for Sequential RL Tasks?
Stephane Hatgis-Kessell, Emma Brunskill
2026
ZAPS-DA: Zero-Phase Action Policy Smoothing with Decoupled Actor for Continuous Control in Reinforcement Learning
Faiq Shamass
2026
World Models: A Comprehensive Survey of Architectures, Methodologies, Reasoning Paradigms, and Applications
Arif Hassan Zidan, Yi Pan, Hanqi Jiang 等 20 位作者
2026
超越GPU主导范式的机器人强化学习异构架构
Yufei Jia, Zhanxiang Cao, Mingrui Yu 等 20 位作者
2026
基于评论引导的样本高效扩散强化学习
Shutong Ding, Zejia Zhong, Zhongyi Wang 等 7 位作者
2026
BuilDyn:激励驱动数据生成用于建筑热动力学建模与控制
Felix Koch, Thomas Krug, Fabian Raisch 等 5 位作者
2026
基于在线增量学习的定制腕带关节角度估计
Shuo Wang, Xiaobin Chen, Xiaoming Tao
2026
基于动量的低排放交通信号控制奖励设计
Chinmay Mundane, Amith Manoharan, Arun Singh
2026
访问集至关重要:为可扩展的权重空间模型合并预算专家读取
Yuanyi Wang, Yanggan Gu, Su Lu 等 8 位作者
2026
MiraBench:评估机器人世界模型中动作条件可靠性
Tianzhuo Yang, Zihan Shen, Zirui Mi 等 10 位作者
2026
主体感对学习结构化规律的影响:一项人工语法学习研究
Haxhi R, Woźniak M, Wykowska A
Psychological research · 2026
CA-AC-MPC:基于CUDA加速的演员-评论家模型预测控制
Antoonio Buo, Vittorio Cammarota, Michele Avagnale 等 6 位作者
2026
线弧增材制造焊道几何控制中的学习与自适应
Chen-Lung Lu, John Wen
2026
基于局部可观性和移动时域估计的前馈神经网络训练
Yi Yang, Victor G. Lopez, Matthias A. Müller
2026
Gamma-World: 超越双玩家的生成式多智能体世界建模
Fangfu Liu, Kai He, Tianchang Shen 等 10 位作者
2026
SARAD:基于大语言模型的安全感知混合强化学习与碰撞预测自动驾驶方法
Kangyu Wu, Peng Cui, Guoxi Chen 等 4 位作者
2026