PERCEPTION
通过场景自我探索进行视图规划
Kangrui Wang, Linjie Li, Zhengyuan Yang, Shiqi Chen, Zihan Wang, Li Fei-Fei, Jiajun Wu, Leonidas Guibas, Lijuan Wang, Manling Li
- 发表年份
- 2026
- 访问权限
- 开放获取
摘要
本文提出ViewSuite环境,评估VLM在多步视图规划中的能力,发现其存在规划组合鸿沟。通过迭代自我探索与视图图蒸馏框架,显著提升了模型在交互式视图规划中的性能。
关键词
view planningself-explorationVLM3D scene understandingmulti-turn planning
相关论文
PERCEPTION
📊 22,245 引用
Artificial intelligence: a modern approach
1995
PERCEPTION
📊 14,348 引用
Are we ready for autonomous driving? The KITTI vision benchmark suite
Andreas Geiger, P Lenz, R. Urtasun
2012
PERCEPTION
开放获取📊 9,777 引用
TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
Martı́n Abadi, Ashish Agarwal, Paul Barham 等 20 位作者
2016
PERCEPTION
📊 9,681 引用
Vision meets robotics: The KITTI dataset
Andreas Geiger, Philip Lenz, Christoph Stiller 等 4 位作者
2013