• Laser & Optoelectronics Progress
  • Vol. 60, Issue 16, 1612002 (2023)
Fengsui Wang1,2,3,*, Lei Xiong1,2,3, and Yaping Qian1,2,3
Author Affiliations
  • 1School of Electrical Engineering, Anhui Polytechnic University, Wuhu 241000, Anhui, China
  • 2Anhui Key Laboratory of Detection Technology and Energy Saving Devices, Wuhu 241000, Anhui, China
  • 3Key Laboratory of Advanced Perception and Intelligent Control of High-End Equipment, Ministry of Education, Wuhu 241000, Anhui, China
  • show less
    DOI: 10.3788/LOP222627 Cite this Article Set citation alerts
    Fengsui Wang, Lei Xiong, Yaping Qian. Multiscale Monocular Three-Dimensional Object Detection Algorithm Incorporating Instance Depth[J]. Laser & Optoelectronics Progress, 2023, 60(16): 1612002 Copy Citation Text show less

    Abstract

    To solve the problems of lack of depth information and poor detection accuracy in conventional monocular three-dimensional (3D) target detection algorithms, an algorithm for multiscale monocular 3D target detection incorporating instance depth is proposed. First, to enhance the processing ability of the model for targets with different scales, a multiscale sensing module based on hole convolution is designed. Then, the depth features containing multiscale information are refined from both spatial and channel directions to remove the inconsistencies among different scale feature maps. Further, the instance depth information is used as an auxiliary learning task to enhance the spatial depth characteristics of 3D objects, and the sparse instance depth is used to monitor the auxiliary task, thereby improving the model's 3D perception. Finally, the proposed algorithm is tested and validated on the KITTI dataset. The experimental results show that the average accuracy of the proposed algorithm in the vehicle category is 5.27% higher than that of the baseline algorithm, indicating that the proposed algorithm effectively improves the detection performance compared with the conventional monocular 3D target detection algorithms.
    Fengsui Wang, Lei Xiong, Yaping Qian. Multiscale Monocular Three-Dimensional Object Detection Algorithm Incorporating Instance Depth[J]. Laser & Optoelectronics Progress, 2023, 60(16): 1612002
    Download Citation