TransRAD: Retentive Vision Transformer for Enhanced Radar Object Detection
Lei Cheng, Siyang Cao
- 发表年份
- 2025
- 引用次数
- 12
摘要
Despite significant advancements in environment perception capabilities for autonomous driving and intelligent robotics, cameras and LiDARs remain notoriously unreliable in low-light conditions and adverse weather, which limits their effectiveness. Radar serves as a reliable and low-cost sensor that can effectively complement these limitations. However, radar-based object detection has been underexplored due to the inherent weaknesses of radar data, such as low resolution, high noise, and lack of visual information. In this article, we present TransRAD, a novel 3-D radar object detection model designed to address these challenges by leveraging the retentive vision transformer (RMT) to more effectively learn features from information-dense radar range-Azimuth–Doppler (RAD) data. Our approach leverages the retentive Manhattan self-attention (MaSA) mechanism provided by RMT to incorporate explicit spatial priors, thereby enabling more accurate alignment with the spatial saliency characteristics of radar targets in RAD data and achieving precise 3-D radar detection across RAD dimensions. Furthermore, we propose location-aware nonmaximum suppression (LA-NMS) to effectively mitigate the common issue of duplicate bounding boxes in deep radar object detection. The experimental results demonstrate that TransRAD outperforms state-of-the-art (SOTA) methods in both 2-D and 3-D radar detection tasks, achieving higher accuracy, faster inference speed, and reduced computational complexity. Code is available at <uri xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">https://github.com/radar-lab/TransRAD</uri>.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002