• Laser & Optoelectronics Progress
  • Vol. 62, Issue 2, 0212001 (2025)
Yuhan Zhang1、2、3, Miaohua Huang1、2、3、*, Gengyao Chen1、2、3, Yanzhou Li1、2、3, and Yiming Wu1、2、3
Author Affiliations
  • 1Hubei Key Laboratory of Advanced Technology for Automotive Components, Wuhan University of Technology, Wuhan 430070, Hubei , China
  • 2Hubei Collaborative Innovation Center for Automotive Components Technology, Wuhan University of Technology, Wuhan 430070, Hubei , China
  • 3Hubei Research Center for New Energy & Intelligent Connected Vehicle, Wuhan University of Technology, Wuhan 430070, Hubei , China
  • show less
    DOI: 10.3788/LOP240912 Cite this Article Set citation alerts
    Yuhan Zhang, Miaohua Huang, Gengyao Chen, Yanzhou Li, Yiming Wu. Multiview 3D Object Detection Based on Improved DETR3D[J]. Laser & Optoelectronics Progress, 2025, 62(2): 0212001 Copy Citation Text show less

    Abstract

    To overcome the limitations of current multicamera 3D object detection methods, which often struggle to balance precision and computational speed, we propose an enhanced version of DETR3D. The algorithm framework is based on the encoder-decoder architecture of DETR3D. We incorporate a 3D position encoder alongside the image feature extraction branch to enhance image features. Object queries are initialized with two components, representing the object's bounding box and instance features. In the decoder stage, we introduce a multiscale adaptive attention mechanism based on Euclidean distance, allowing the algorithm to effectively capture multiscale information in 3D space, which significantly improves detection performance for complex and diverse objects in autonomous driving scenarios. During feature sampling, we integrate temporal information to align features across consecutive frames, improving detection accuracy. Additionally, multipoint sampling is employed to strengthen the robustness of the sampling process. Experiments conducted on the nuScenes dataset indicate that compared to the baseline algorithm, our proposed approach achieves a 17.1% improvement in detection accuracy and a 4.22-fold increase in computational speed. Moreover, it proves effective in detecting objects even in occluded environments.
    Yuhan Zhang, Miaohua Huang, Gengyao Chen, Yanzhou Li, Yiming Wu. Multiview 3D Object Detection Based on Improved DETR3D[J]. Laser & Optoelectronics Progress, 2025, 62(2): 0212001
    Download Citation