[1] GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Columbus: IEEE, 2014: 580-587.
[2] GIRSHICK R. Fast R-CNN[C]//IEEE International Conference on Computer Vision (ICCV). Santiago: IEEE, 2015: 1440-1448.
[3] REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137-1149.
[4] LIU W, ANGUELOV D, ERHAN D, et al. SSD: single shot multibox detector[C]//Proceedings of the European Conference on Computer Vision (ECCV). Cham: Springer, 2016: 21-37.
[5] REDMON J, FARHADI A. YOLOv3: an incremental improvement[R]. Los Alamos: arXiv Preprint, 2018: arXiv: 1804.02767.
[6] HE K M, GKIOXARI G, DOLL
[7] LIN T Y, GOYAL P, GIRSHICK R, et al. Focal loss for dense object detection[C]//IEEE International Conference on Computer Vision (ICCV). Venice: IEEE, 2017: 2999-3007.
[10] LAW H, DENG J. CornerNet: detecting objects as paired keypoints[C]//Proceedings of the European Conference on Computer Vision (ECCV). Cham: Springer, 2018: 765-781.
[11] YU J H, JIANG Y N, WANG Z Y, et al. UnitBox: an advanced object detection network[C]//Proceedings of the 24th ACM International Conference on Multimedia. New York: Association for Computing Machinery, 2016: 516-520.
[12] DUAN K W, BAI S, XIE L X, et al. CenterNet: keypoint triplets for object detection[C]//IEEE/CVF International Conference on Computer Vision (ICCV). Seoul: IEEE, 2019: 6568-6577.
[13] TIAN Z, SHEN C H, CHEN H, et al. FCOS: fully convolutional one-stage object detection[C]//IEEE/CVF International Conference on Computer Vision (ICCV). Seoul: IEEE, 2019: 9626-9635.
[14] YU C Q, WANG J B, PENG C, et al. Bisenet: bilateral segmentation network for real-time semantic segmentation[C]//Proceedings of the European Conference on Computer Vision (ECCV). Cham: Springer, 2018: 334-349.
[15] YU C Q, WANG J B, PENG C, et al. Learning a discriminative feature network for semantic segmentation[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Salt Lake City: IEEE, 2018: 1857-1866.
[16] CHEN L C, PAPANDREOU G, KOKKINOS I, et al. DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40(4): 834-848.
[17] DAI J F, QI H Z, XIONG Y W, et al. Deformable convolutional networks[C]//IEEE International Conference on Computer Vision (ICCV). Venice: IEEE, 2017: 764-773.
[18] HU J, SHEN L, SUN G. Squeeze-and-excitation networks[C]//IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Salt Lake City: IEEE, 2018: 7132-7141.
[20] DONG L L, MA D D, QIN G, et al. Infrared target detection in backlighting maritime environment based on visual attention model[J]. Infrared Physics & Technology, 2019, 99: 193-200.
[23] WOO S, PARK J, LEE J Y, et al. CBAM: convolutional block attention module[C]//Proceedings of the European Conference on Computer Vision (ECCV). Cham: Springer, 2018: 3-19.
[24] PARK J, WOO S, LEE J Y, et al. BAM: bottleneck attention module[R]. Los Alamos: arXiv Preprint, 2018: arXiv: 1807.06514.