• Spacecraft Recovery & Remote Sensing
  • Vol. 45, Issue 2, 153 (2024)
Yuhao WEI1、2, Song HUANG1、*, and Yani HUANG3
Author Affiliations
  • 1Collage of Command and Control Engineering, Army Engineering University of PLA, Nanjing 210000, China
  • 231678 Corps of PLA, Luzhou 646000, China
  • 331305 Corps of PLA, Chengdu 610000, China
  • show less
    DOI: 10.3969/j.issn.1009-8518.2024.02.015 Cite this Article
    Yuhao WEI, Song HUANG, Yani HUANG. Multi-Scale Object Detection in Satellite Images Based on Improved YOLOv7[J]. Spacecraft Recovery & Remote Sensing, 2024, 45(2): 153 Copy Citation Text show less
    References

    [1] GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich Feature Hierarchies f Accurate Object Detection Semantic Segmentation[C]2024 IEEE Conference on Computer Vision Pattern Recognition, June 2328, 2014, Columbus, OH, USA. IEEE, 2014: 580587. DOI: 10.1109cvpr.2014.81.

    [2] GIRSHICK R. Fast RCNN[C]2015 IEEE International Conference on Computer Vision (ICVV), December 0713, 2015, Santiago, Chile. IEEE, 2015: 14401448. DOI: 10.1109iccv.2015.169.

    [3] S REN, K HE, R GIRSHICK et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis & Machine Intelligence, 39, 1137-1149(2017).

    [4] REDMON J, DIVVALA S, GIRSHICK R, et al. You Only Look Once: Unified, RealTime Object Detection[C]2016 IEEE Conference on Computer Vision Pattern Recognition (CVPR), June 2730, 2016, Las Vegas, NV, USA. IEEE, 2016: 779788. DOI: 10.1109CVPR.2016.91.

    [5] LIU W, ANGUELOV D, ERHAN D, et al. SSD: Single Shot Multibox Detect[C]2016 European Conference on Computer Vision (ECCV), October 816, 2016, Amsterdam, herls. Springer, 2016: 2137. DOI: 10.10079783319464480_2.

    [6] YU F, CHEN H, WANG X, et al. Bdd100k: A Diverse Driving Dataset f Jeterogeneous Multitask Learning[C]2020 IEEECVF Conference on Computer Vision Pattern Recognition, 1319 June, 2020, Seattle, WA, USA. IEEE, 2020: 26362645. DOI: 10.1109CVPR42600.2020.00271.

    [7] X HUANG, P WANG, X CHENG et al. The Apolloscape Open Dataset for Autonomous Driving and Its Application. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42, 2702-2719(2019).

    [8] N KUMAR, R VERMA, S SHARMA et al. A Dataset and a Technique for Generalized Nuclear Segmentation for Computational Pathology. IEEE Transactions on Medical Imaging, 36, 1550-1560(2017).

    [9] WANG J, YANG W, GUO H, et al. Tiny Object Detection in Aerial Images[C]2020 25th International Conference on Pattern Recognition (ICPR), January 1015, 2021, Milan, Italy. IEEE, 2021: 37913798. DOI: 10.1109ICPR48806.2021.9413340.

    [10] Y ZHANG, Y YUAN, Y FENG et al. Hierarchical and Robust Convolutional Neural Network for Very High-resolution Remote Sensing Object Detection. IEEE Transactions on Geoscience and Remote Sensing, 57, 5535-5548(2019).

    [12] WOO S, PARK J, LEE J Y, et al. CBAM: Convolutional Block Attention Module[EBOL]. [20230106]. https:doi.g10.48550arXiv.1807.06521.

    [14] HOWARD A, SLER M, CHU G, et al. Searching f Mobilev3[C]2019 IEEECVF International Conference on Computer Vision (ICCV), October 27November 02, 2019, Seoul, Kea (South). IEEE, 2019: 13141324. DOI: 10.1109ICCV.2019.00140.

    [15] ETTEN V A. You Only Look Twice: Rapid MultiScale Object Detection in Satellite Imagery[EBOL]. [20230106]. https:arXiv preprint arXiv:1805.09512, 2018. DOI: 10.48550arXiv.1805.09512.

    [16] WANG C Y, BOCHKOVSKIY A, LIAO H . YOLOv7: Trainable Bagoffreebies Sets New Stateoftheart f Realtime Object Detects[EBOL]. [20230106].https:arXiv preprint arXiv:2207.02696, 2022. DOI: 10.48550arXiv.2207.02696.

    [17] LIU Z, MAO H, WU C, et al. A Conv f the 2020s[C]2022 IEEECVF Conference on Computer vision Pattern Recognition, 1824 June, 2022, New leans, LA, USA. IEEE, 2022: 1197611986. DOI: 10.1109CVPR52688.2022.01167.

    [18] LIU S, QI L, QIN H, et al. Path Aggregation wk f Instance Segmentation[C]2018 IEEECVF Conference on Computer Vision Pattern Recognition, June 1823, 2018, Salt Lake City, UT, USA. IEEE, 2018: 87598768. DOI: 10.1109CVPR.2018.00913.

    [19] DOSOVITSKIY A, BEYER L, KOLESNIKOV A, et al. An Image is Wth 16x16 Wds: Transfmers f Image Recognition at Scale[EBOL]. [20230106]. https:arXiv preprint arXiv:2010.11929, 2020. DOI: 10.48550arXiv.2010.11929.

    [20] LIU Z, LIN Y, CAO Y, et al. Swin Transfmer: Hierarchical Vision Transfmer Using Shifted Windows[C]2021 IEEECVF International Conference on Computer Vision, October 1017, 2021, Montreal, QC, Canada. IEEE, 2021: 1001210022. DOI: 10.1109ICCV48922.2021.00986.

    [21] NEUBECK A, GOOL L V. Efficient NonMaximum Suppression[C]18th International Conference on Pattern Recognition, August 2024, 2006, Hongkong, China. IEEE, 2006. DOI: 10.1109ICPR.2006.479.

    [22] X LU, Y ZHANG, Y YUAN et al. Gated and Axis-concentrated Localization Network for Remote Sensing Object Detection. IEEE Transactions on Geoscience and Remote Sensing, 58, 179-192(2019).

    Yuhao WEI, Song HUANG, Yani HUANG. Multi-Scale Object Detection in Satellite Images Based on Improved YOLOv7[J]. Spacecraft Recovery & Remote Sensing, 2024, 45(2): 153
    Download Citation