• Infrared and Laser Engineering
  • Vol. 53, Issue 11, 20240256 (2024)
Jiageng SANG1,2, Zhijia ZHANG1, Chuanmin XIAO3, Haibo LUO2, and Junyao ZHANG4
Author Affiliations
  • 1College of Artificial Intelligence, Shenyang University of Technology, Shenyang 110870, China
  • 2Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110169, China
  • 3The Third Militray Representative Office of the Air Force Equipment Department, Shenyang 110144, China
  • 4China Academy of Machinery Shenyang Research Institute of Foundry Co., Ltd., Shenyang 110022, China
  • show less
    DOI: 10.3788/IRLA20240256 Cite this Article
    Jiageng SANG, Zhijia ZHANG, Chuanmin XIAO, Haibo LUO, Junyao ZHANG. An improved YOLOv8s method and its application in road traffic target detection[J]. Infrared and Laser Engineering, 2024, 53(11): 20240256 Copy Citation Text show less
    References

    [1] Xuening WANG, Junhui LI, Yuan ZHAI. Analysis on the development environment of intelligent automobile industry in China. Auto Industry Research, 8-10(2023).

    [2] GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies f accurate object detection semantic segmentation [C]2014 IEEE Conference on Computer Vision Pattern Recognition (CVPR), 2014: 580587.

    [3] GIRSHICK R. Fast RCNN [C]Proceedings of the IEEE International Conference on Computer Vision, IEEE, 2015: 14401448.

    [4] REN S, HE K, GIRSHICK R, et al. Faster RCNN: Towards realtime object detection with region proposal wks [J]. Advances in Neural Infmation Processing Systems , 2017, 39(6): 11371149.

    [5] HE K, GKIOXARI G, DOLLÁR P, et al. Mask RCNN [C]Proceedings of the IEEE International Conference on Computer Vision, 2017: 29612969.

    [6] LIU W, ANGUELOV D, ERHAN D, et al. SSD: Single shot multibox detect [C]Computer VisionECCV 2016, 2016, 9905: 2137.

    [7] REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: Unified, realtime object detection [C]Proceedings of the IEEE Conference on Computer Vision Pattern Recognition, 2016: 779788.

    [8] REDMON J, FARHADI A. YOLO9000: Better, faster, stronger [C]Proceedings of the IEEE Conference on Computer Vision Pattern Recognition, 2017: 72637271.

    [9] REDMON J, FARHADI A. YOLOv3: An incremental improvement [DBOL]. (20180408) [20240914]. https:arxiv.gabs1804.02767.

    [10] BOCHKOVSKIY A, WANG C Y, LIAO H Y M. YOLOv4: Optimal speed accuracy of object detection [DBOL]. (20200423) [20240914]. https:arxiv.gabs2004.10934.

    [11] ZHU X, LYU S, WANG X, et al. TPHYOLOv5: Improved YOLOv5 based on transfmer prediction head f object detection on dronecaptured scenarios [C]Proceedings of the IEEECVF International Conference on Computer Vision, 2021: 27782788.

    [12] LI C, LI L, JIANG H, et al. YOLOv6: A singlestage object detection framewk f industrial applications [DBOL]. (20180408) [20240914]. https:arxiv.gabs1804.02767.

    [13] WANG C Y, BOCHKOVSKIY A, LIAO H Y M. YOLOv7: Trainable bagoffreebies sets new stateoftheart f realtime object detects [C]Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition, 2023: 74647475.

    [14] S LI, Y LI, Y LI, Al ET. YOLO-Firi: Improved YOLOv5 for infrared image object detection. IEEE Access, 9, 141861-141875(2021).

    [15] Y CHEN, H SHIN. Pedestrian detection at night in infrared images using an attention-guided encoder-decoder convolutional neural network. Applied Sciences, 10, 809(2020).

    [16] L ZHOU, S GAO, S WANG et al. IPD-net: infrared pedestrian detection network via adaptive feature extraction and coordinate information fusion. Sensors, 22, 8966(2022).

    [17] X ZHAO, Y XIA, W ZHANG et al. YOLO-ViT-based method for unmanned aerial vehicle infrared vehicle target detection. Remote Sensing, 15, 3778(2023).

    [18] LIU S, QI L, QIN H, et al. Path aggregation wk f instance segmentation [C]Proceedings of the 2018 IEEECVF Conference on Computer Vision Pattern Recognition, 2018: 87598768.

    [19] SUNKARA R, LUO T. No me strided convolutions pooling: A new CNN building block f lowresolution images small objects [C]Joint European Conference on Machine Learning Knowledge Discovery in Databases. Cham: Springer Nature Switzerl, 2022: 443459.

    [20] JIE HU, LI SHEN, GANG SUN. Squeezeexcitation wks [C]Proceedings of the IEEE Conference on Computer Vision Pattern Recognition, 2018: 71327141.

    [21] SZEGEDY C, VANHOUCKE V, IOFFE S, et al. Rethinking the inception architecture f computer vision [C]Proceedings of the IEEE Conference on Computer Vision Pattern Recognition, 2016: 28182826.

    [22] Y F ZHANG, W REN, Z ZHANG et al. Focal and efficient IOU loss for accurate bounding box regression. Neurocomputing, 506, 146-157(2022).

    [23] WOO S, PARK J, LEE J Y, et al. CBAM: Convolutional block attention module [C]Proceedings of the European Conference on Computer Vision (ECCV), 2018: 319.

    [24] HOU Q, ZHOU D, FENG J. Codinate attention f efficient mobile wk design [C]Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition, 2021: 1371313722.

    [25] OUYANG D, HE S, ZHANG G, et al. Efficient multiscale attention module with crossspatial learning [C]ICASSP 20232023 IEEE International Conference on Acoustics, Speech Signal Processing (ICASSP), IEEE, 2023: 15.

    Jiageng SANG, Zhijia ZHANG, Chuanmin XIAO, Haibo LUO, Junyao ZHANG. An improved YOLOv8s method and its application in road traffic target detection[J]. Infrared and Laser Engineering, 2024, 53(11): 20240256
    Download Citation