Text-Semantics-Driven Feature Extraction from Remote Sensing Imagery

Sijun DONG; Xiaoliang MENG

doi:10.3969/j.issn.1009-8518.2024.03.009

[3] D F BAUER, T RUSS, B I WALDKIRCH et al. Generation of Annotated Multimodal Ground Truth Datasets for Abdominal Medical Image Registration. International Journal of Computer Assisted Radiology and Surgery, 16, 1277-1285(2020).

[4] ROMBACH R, BLATTMANN A, LENZ D, et al. HighResolution Image Synthesis with Latent Diffusion Models[C]2022 IEEECVF Conference on Computer Vision Pattern Recognition (CVPR), June 1824, 2022, New leans, LA, USA. IEEE, 2022: 1067410685. DOI: 10.1109CVPR52688.2022.01042.

[5] RADFD A, KIM J W, HALLACY C, et al. Learning Transferable Visual Models from Natural Language Supervision[EBOL]. (20210226)[20231010]. https:doi.g10.48550arXiv.2103.00020.

[6] DENG J, DONG W, SOCHER R, et al. Image: A LargeScale Hierarchical Image Database[C]2009 IEEE Conference on Computer Vision Pattern Recognition, June 2025, 2009, Miami, FL, USA. IEEE, 2009: 248255. DOI: 10.1109CVPR.2009.5206848.

[7] VASWANI A, SHAZEER N, PARMAR N, et al. Attention is All You Need[EBOL]. [20231010]. https:arxiv.gabs1706.03762.

[8] ZHANG B, MING Z, LIU Y, et al. RsMmFmer: Multimodal Transfmer Using Multiscale Selfattention f Remote Sensing Image Classification[EBOL]. [20231010]. https:doi.g10.48550arXiv.2303.13101.

[9] X LI, G ZHANG, H CUI et al. Progressive Fusion Learning: A Multimodal Joint Segmentation Framework for Building Extraction from Optical and SAR Images. ISPRS Journal of Photogrammetry and Remote Sensing, 195, 178-191(2023).

[11] X YANG, S S LI, Z C CHEN et al. An Attention-Fused Network for Semantic Segmentation of Very-High-Resolution Remote Sensing Imagery. ISPRS Journal of Photogrammetry and Remote Sensing, 177, 238-262(2021).

[12] F LIU, D CHEN, Z GUAN et al. RemoteCLIP: A Vision Language Foundation Model for Remote Sensing. IEEE Transactions on Geoscience and Remote Sensing, 62, 1-16(2022).

[13] LU X, WANG B, ZHENG X, et al. Expling Models Data f Remote Sensing Image Caption Generation[J]. IEEE Transactions on Geoscience Remote Sensing, 2017, 56(4): 21832195. DOI: 10.1109TGRS.2017.2776321.

[14] S JI, S WEI, M LU et al. Fully Convolutional Networks for Multisource Building Extraction from an Open Aerial and Satellite Imagery Data Set. IEEE Transactions on Geoscience and Remote Sensing, 57, 574-586(2019).

[15] LIN T, DOLLÁR P, GIRSHICK R B, et al. Feature Pyra wks f Object Detection[C]2017 IEEE Conference on Computer Vision Pattern Recognition (CVPR), July 2126, 2017, Honolulu, HI, USA. IEEE, 2017: 936944. DOI: 10.1109CVPR.2017.106.

[16] LIU Z, LIN Y T, CAO Y, et al. Swin Transfmer: Hierarchical Vision Transfmer using Shifted Windows[C]2021 IEEECVF International Conference on Computer Vision (ICCV), October 1017, 2021, Montreal, QC, Canada. IEEE, 2022: 999210002. DOI: 10.1109ICCV48922.2021.00986.

[17] HE J J, DENG Z Y, ZHOU L, et al. Adaptive Pyra Context wk f Semantic Segmentation[C]2019 IEEECVF Conference on Computer Vision Pattern Recognition (CVPR), June 1520, 2019, Long Beach, CA, USA. IEEE, 2019: 75117520. DOI: 10.1109CVPR.2019.00770.

[18] ZHAO H S, ZHANG Y, LIU S, et al. PSA: Pointwise Spatial Attention wk f Scene Parsing, in European Conference on Computer Vision[C]Computer Vision — ECCV 2018: 15th European Conference, September 814, 2018, Munich, Germany. ACM, 2018: 270286. DOI: 10.10079783030012403_17.

[19] LI X, ZHONG Z S, WU J L, et al. ExpectationMaximization Attention wks f Semantic Segmentation[C]2019 IEEECVF International Conference on Computer Vision (ICCV), October 27November 02, 2019, Seoul, Kea (South). IEEE, 2019: 91669175. DOI: 10.1109ICCV.2019.00926.

[20] HUANG L, YUAN Y, GUO J, et al. Interlaced Sparse SelfAttention f Semantic Segmentation[EBOL]. [20231010]. https:arxiv.gabs1907.12273.

[21] ZHU Z, XU M, BAI S, et al. Asymmetric NonLocal Neural wks f Semantic Segmentation[C]2019 IEEECVF International Conference on Computer Vision (ICCV), October 27November 02, 2019, Seoul, Kea (South). IEEE, 2019: 593602. DOI: 10.1109ICCV.2019.00068.

[22] Z HUANG, X G WANG, Y C WEI et al. CCNet: Criss-Cross Attention for Semantic Segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45, 603-612(2018).

[23] CHEN L, ZHU Y, PAPREOU G, et al. EncoderDecoder with Atrous Separable Convolution f Semantic Image Segmentation[EBOL]. [20231010]. https:arxiv.gabs1802.02611.

微信扫一扫：分享

微信扫一扫：分享