• Laser & Optoelectronics Progress
  • Vol. 62, Issue 8, 0815010 (2025)
Lei Xiao, Peng Hu*, and Junjie Ma
Author Affiliations
  • College of Artificial Intelligence, Anhui University of Science & Technology, Huainan 232001, Anhui , China
  • show less
    DOI: 10.3788/LOP241870 Cite this Article Set citation alerts
    Lei Xiao, Peng Hu, Junjie Ma. Self-Supervised Monocular Depth Estimation Model Based on Global Information Correlation Under Influence of Local Attention[J]. Laser & Optoelectronics Progress, 2025, 62(8): 0815010 Copy Citation Text show less
    Overall framework of proposed network
    Fig. 1. Overall framework of proposed network
    Internal structure of the DepthNet
    Fig. 2. Internal structure of the DepthNet
    Schematic diagram of the internal structure for the joint DSConv & Transformer block
    Fig. 3. Schematic diagram of the internal structure for the joint DSConv & Transformer block
    Qualitative comparison between the proposed method and other methods on the KITTI dataset
    Fig. 4. Qualitative comparison between the proposed method and other methods on the KITTI dataset
    Qualitative comparison of the proposed method and other methods on the Cityscapes dataset
    Fig. 5. Qualitative comparison of the proposed method and other methods on the Cityscapes dataset
    Model complexity and speed evaluation. (a) Params; (b) FLOPs
    Fig. 6. Model complexity and speed evaluation. (a) Params; (b) FLOPs
    MethodDataAbs RelSq RelRMSERMSE logδ1δ2δ3
    Monodepth222M0.1150.9034.8630.1930.8770.9590.981
    SGDepth32M+Se0.1130.8354.6930.1910.8790.9610.981
    SAFENet33M+Se0.1120.7884.5820.1870.8780.9630.983
    R-MSFM624M0.1120.8064.7040.1910.8780.9600.981
    PackNet34M0.1080.7274.4260.1840.8850.9630.983
    HR-Depth9M0.1090.7924.6320.1850.8840.9620.983
    Ref. [35M0.1060.8614.6990.1850.8890.9620.982
    CADepth23M0.1050.7694.5350.1810.8920.9640.983
    DIFFNet36M0.1020.7494.4450.1790.8970.9650.983
    ProposedM0.1020.7344.4350.1770.8950.9660.984
    Table 1. Quantitative comparison between the proposed method and other monocular depth estimation methods on the KITTI dataset
    MethodDataAbs RelSq RelRMSERMSE logδ1δ2δ3
    Monodepth222M0.1551.9006.7960.2090.8130.9430.979
    DIFFNet36M0.1401.5716.2980.1920.8370.9500.983
    ProposedM0.1251.3005.7900.1770.8610.9580.986
    Table 2. Quantitative comparison between the proposed method and other monocular depth estimation methods on the Cityscapes dataset
    DSConvSConvShuffleParams /106Inference time /msAbs RelSq RelRMSERMSE logδ1δ2δ3
    3.364.70.1240.9014.7980.2050.8490.9410.962
    4.206.00.1050.7494.5010.1800.8910.9600.981
    3.364.80.1200.8504.6020.1950.8600.9530.972
    7.208.50.1000.7204.4000.1750.9000.9700.986
    4.205.90.1020.7344.4350.1770.8950.9660.984
    Table 3. Ablation experimental results
    Lei Xiao, Peng Hu, Junjie Ma. Self-Supervised Monocular Depth Estimation Model Based on Global Information Correlation Under Influence of Local Attention[J]. Laser & Optoelectronics Progress, 2025, 62(8): 0815010
    Download Citation