






















Abstract:Autonomous vehicles equipped with robust onboard perception, localization, and planning still face limitations in occlusion and non-line-of-sight (NLOS) scenarios, where delayed reactions can increase collision risk. We propose CooperDrive, a cooperative perception framework that augments situational awareness and enables earlier, safer driving decisions. CooperDrive offers two key advantages: (i) each vehicle retains its native perception, localization, and planning stack, and (ii) a lightweight object-level sharing and fusion strategy bridges perception and planning. Specifically, CooperDrive reuses detector Bird's-Eye View (BEV) features to estimate accurate vehicle poses without additional heavy encoders, thereby reconstructing BEV representations and feeding the planner with low latency. On the planning side, CooperDrive leverages the expanded object set to anticipate potential conflicts earlier and adjust speed and trajectory proactively, thereby transforming reactive behaviors into predictive and safer driving decisions. Real-world closed-loop tests at occlusion-heavy NLOS intersections demonstrate that CooperDrive increases reaction lead time, minimum time-to-collision (TTC), and stopping margin, while requiring only 90 kbps bandwidth and maintaining an average end-to-end latency of 89 ms.
| Comments: | Accepted at ICRA 2026 |
| Subjects: | Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV) |
| Cite as: | arXiv:2604.14454 [cs.RO] |
| (or arXiv:2604.14454v1 [cs.RO] for this version) | |
| https://doi.org/10.48550/arXiv.2604.14454 arXiv-issued DOI via DataCite (pending registration) |
From: Deyuan Qu [view email]
[v1]
Wed, 15 Apr 2026 22:15:50 UTC (6,466 KB)
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。