• Optoelectronics Letters
  • Vol. 20, Issue 7, 424 (2024)
Minming YU1, Sixian CHAN1,2,*, Xiaolong ZHOU3, and Zhounian and LAI4
Author Affiliations
  • 1College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China
  • 2Hangzhou Xsuan Technology Co., Ltd., Hangzhou 310051, China
  • 3Quzhou University, Quzhou 324000, China
  • 4Huzhou Institute of Zhejiang University, Huzhou 313002, China
  • show less
    DOI: 10.1007/s11801-024-3181-7 Cite this Article
    YU Minming, CHAN Sixian, ZHOU Xiaolong, and LAI Zhounian. Small object detection on highways via balance feature fusion and task-specific encoding network[J]. Optoelectronics Letters, 2024, 20(7): 424 Copy Citation Text show less
    References

    [1] WANG C Y, BOCHKOVSKIY A, LIAO H M. Yolov7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 17-24, 2023, Vancouver, BC, Can- ada. New York: IEEE, 2023: 7464-7475.

    [2] CARION N, MASSA F, SYNNAEVE G, et al. End-to-end object detection with transform- ers[C]//Proceedings of the European Conference on Computer Vision (ECCV), August 23-28, 2020, Glas- gow, UK. Berlin, Heidelberg: Springer, 2020: 213-229.

    [3] CHAN S X, LIU P, ZHANG Z. Webox: locating small objects from weak edges[J]. Optoelectronics letters, 2021, 17(6): 349-353.

    [4] KIM S, KOOK H, SUN J, et al. Parallel feature pyra- mid network for object detection[C]//Proceedings of the European Conference on Computer Vision (ECCV),September 8-14, 2018, Munich, Germany. Berlin, Hei- delberg: Springer, 2018: 239-256.

    [5] LIU S, QI L, QIN H, et al. Path aggregation network for instance segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recogni- tion, June 18-22, 2018, Salt Lake City, UT, USA. New York: IEEE, 2018: 8759-8768.

    [6] WU Y, CHEN Y P, YUAN L, et al. Rethinking classifi- cation and localization for object detec- tion[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 13-19, 2020, Seattle, WA, USA. New York: IEEE, 2020: 10183-10192.

    [7] GE Z, LIU S T, WANG F, et al. YOLOX: exceeding YOLO series in 2021[EB/OL]. (2021-07-18) [2023-06-24]. https://arxiv.org/abs/2107.08430.

    [8] BOCHKOVSKIY A, WANG C, LIAO H M. Yolov4: optimal speed and accuracy of object detection[J]. (2020-04-23) [2023-06-24]. https://arxiv.org/abs/2004. 10934.

    [9] ZHANG Y F, REN W Q, ZHANG Z, et al. Focal and efficient IOU loss for accurate bounding box regres- sion[J]. Neurocomputing, 2022, 506: 146-157.

    [10] WANG J W, YANG W, GUO H, et al. Tiny object de- tection in aerial images[C]//2020 25th International Conference on Pattern Recognition (ICPR), January 10-15, 2021, Milan, Italy. New York: IEEE, 2021: 3791-3798.

    [11] LIN T, MAIRE M, BELOGIE S, et al. Microsoft COCO: common objects in context[C]//Proceedings of the European Conference on Computer Vision (ECCV), September 6-12, 2014, Zurich, Switzerland. Berlin, Heidelberg: Springer, 2014: 740-755.

    [12] GIRSHICK R. Fast R-CNN[C]//Proceedings of the IEEE International Conference on Computer Vision,December 7-13, 2015, Santiago, Chile. New York: IEEE, 2015: 1440-1448.

    [13] CAI Z W, VASCONCELOS N. Cascade R-CNN: delv- ing into high quality object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pat- tern Recognition, June 18-22, 2018, Salt Lake City, UT,USA. New York: IEEE, 2018: 6154-6162

    [14] ZHOU X Y, WANG D Q, KR A HENBU HI P. Objects as points[EB/OL]. (2019-04-25) [2023-06-24]. https://arxiv.org/abs/1904.07850v1.

    [15] LU X, LI B Y, YUE Y X, et al. Grid R-CNN[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 16-20, 2019, Long Beach, CA, USA. New York: IEEE,2019: 7363-7372.

    [16] ZHANG S F, CHI C, YAO Y Q, et al. Bridging the gap between anchor-based and anchor-free detection via adaptive training sample selection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 13-19, 2020, Seattle, WA, USA. New York: IEEE, 2020: 9759-9768.

    [17] DAI X Y, CHEN Y P, XIAO B, et al. Dynamic head: unifying object detection heads with atten- tions[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 19-25, 2021, virtual. New York: IEEE, 2021: 7373-7382.

    [18] FENG C J, ZHONG Y J, GAO Y, et al. TOOD: task-aligned one-stage object detection[C]//2021 IEEE/CVF International Conference on Computer Vi- sion, October 10-17, 2021, Montreal, QC, Canada. New York: IEEE, 2021: 3490-3499.

    [19] ZHANG H, LI F, LIU S L, et al. DINO: DETR with improved denoising anchor boxes for end-to-end object detect ion[C]//Proceedings of the IEEE/CVF Inter- national Conference on Learning Representations, May 1-5, 2023, Kigali, Rwanda. New York: IEEE, 2023.

    [20] XU C, WANG J W, YANG W, et al. Dot distance for tiny object detection in aerial images[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 19-25, 2021, virtual. New York: IEEE, 2021: 1192-1201.

    [21] WANG J W, XU C, YANG W, et al. A normalized Gaussian Wasserstein distance for tiny object detec- tion[EB/OL]. (2021-10-26) [2023-06-24]. https://arxiv.org/abs/2110.13389.

    [22] REDMON J, FARHADI A. Yolov3: an incremental improvement[EB/OL]. (2018-04-08) [2023-06-24]. https://arxiv.org/abs/1804.02767.

    [23] KIM K, LEE H S. Probabilistic anchor assignment with iou prediction for object detection[C]//Proceedings of the European Conference on Computer Vision (ECCV), August 23-28, 2020, Glasgow, UK. Berlin, Heidelberg: Springer, 2020: 355-371.

    [24] QIAO S Y, CHEN L, YUILLE A. DetectoRS: detecting objects with recursive feature pyramid and switchable atrous convolution[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recogni- tion, June 19-25, 2021, virtual. New York: IEEE, 2021: 10213-10224.

    YU Minming, CHAN Sixian, ZHOU Xiaolong, and LAI Zhounian. Small object detection on highways via balance feature fusion and task-specific encoding network[J]. Optoelectronics Letters, 2024, 20(7): 424
    Download Citation