Home /Research /Real-Time 2D LiDAR Object Detection Using Three-Frame RGB Scan Encoding

PERCEPTION

Real-Time 2D LiDAR Object Detection Using Three-Frame RGB Scan Encoding

Soheil Behnam Roudsari, Alexandre S. Brandão, Felipe N. Martins

Year: 2026
Access: Open access

Abstract

Indoor service robots need perception that is robust, more privacy-friendly than RGB video, and feasible on embedded hardware. We present a camera-free 2D LiDAR object detection pipeline that encodes short-term temporal context by stacking three consecutive scans as RGB channels, yielding a compact YOLOv8n input without occupancy-grid construction while preserving angular structure and motion cues. Evaluated in Webots across 160 randomized indoor scenarios with strict scenario-level holdout, the method achieves 98.4% [email protected] (0.778 [email protected]:0.95) with 94.9% precision and 94.7% recall on four object classes. On a Raspberry Pi 5, it runs in real time with a mean post-warm-up end-to-end latency of 47.8ms per frame, including scan encoding and postprocessing. Relative to a closely related occupancy-grid LiDAR-YOLO pipeline reported on the same platform, the proposed representation is associated with substantially lower reported end-to-end latency. Although results are simulation-based, they suggest that lightweight temporal encoding can enable accurate and real-time LiDAR-only detection for embedded indoor robotics without capturing RGB appearance.

Keywords

eess.SPcs.CVcs.LGcs.RO

Real-Time 2D LiDAR Object Detection Using Three-Frame RGB Scan Encoding

Abstract

Keywords

Related papers

Artificial intelligence: a modern approach

Are we ready for autonomous driving? The KITTI vision benchmark suite

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

Vision meets robotics: The KITTI dataset