首页 /研究 /Training-Free Imitation Learning with Closed-Form Diffusion Policies
LEARNING

Training-Free Imitation Learning with Closed-Form Diffusion Policies

Raghav Mishra, Ian R. Manchester

发表年份
2026
访问权限
开放获取

摘要

While diffusion-based policies have impressive performance and expressivity, their long offline training slows down the data collection and policy deployment loop. We introduce Closed-Form Diffusion Policies, a class of training-free diffusion-based policies for imitation learning using the closed-form score derived from the demonstration dataset. We deploy CFDP with real-time inference with a mobile CPU in hardware experiments, showing it can successfully perform imitation directly from the dataset in milliseconds and with faster inference than neural diffusion policies. In experiments on imitation learning benchmarks, we show that CFDP is competitive against neural baselines that require hours of training, providing a favorable tradeoff between training time and performance. Finally, we show how closed-form diffusion policies act as a composable primitive that enables data-driven inference-time editing of pre-trained neural diffusion policies, including policy guidance and novel demonstration augmentation.

关键词

cs.ROcs.LG

相关论文

查看 LEARNING 分类全部论文