CoDRMA: Collaborative Depth Refinement via Dual-Mask and Dual-Attention for Bird’s Eye View Collaborative 3D Object Detection
文献类型:会议论文
作者 | Yang,Kang3; Wang, Yongcai3; Han, Yunjun1![]() |
出版日期 | 2024 |
会议日期 | 2024年8月28 |
会议地点 | Bari,Italy |
英文摘要 | In collaborative perception, camera-based approach is more informative and economical than Lidar-based approach. However, currently, camera-based methods still have a significant performance gap compared to the Lidar-based approach due to the difficulty and uncertainty involved in depth estimation. This paper introduces a strategy that refines depth estimation using foreground and background information, which empowers accurate Bird’s Eye View (BEV) collaborative 3D detection by multi-agents. Our strategy encompasses two stages: Initially, we introduce the Dual-Mask to enhance depth estimation and employ Bird’s Eye View (BEV) paradigms for integrating multi-viewpoint data, facilitating a comprehensive scene analysis. In the second stage, we generate pseudo-images by fusing depth and masks as auxiliary messages. A Dual- Attention scheme is proposed, which leverages multi-agent communication to augment auxiliary insights and further refine depth estimations. By refining the depth information twice, our method effectively improves BEV-based collaboration 3D object detection accuracy especially the occlused and long distance objects. Experiments on the OPV2V dataset show that our method achieves state-of-the-art performance in 3D object detection task among known camera-based methods, narrowing the gap with Lidar-based methods. Codes will be made available. |
源URL | [http://ir.ia.ac.cn/handle/173211/57370] ![]() |
专题 | 自动化研究所_复杂系统管理与控制国家重点实验室_先进控制与自动化团队 |
通讯作者 | Han, Yunjun |
作者单位 | 1.Institute of Automation, Chinese Academy of Science 2.Department of Automation, Tsinghua University 3.School of Information, Renmin University of China |
推荐引用方式 GB/T 7714 | Yang,Kang,Wang, Yongcai,Han, Yunjun,et al. CoDRMA: Collaborative Depth Refinement via Dual-Mask and Dual-Attention for Bird’s Eye View Collaborative 3D Object Detection[C]. 见:. Bari,Italy. 2024年8月28. |
入库方式: OAI收割
来源:自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。