• Laser & Optoelectronics Progress
  • Vol. 61, Issue 4, 0428004 (2024)
Xingtao Ming and Dehong Yang*
Author Affiliations
  • Faculty of Land and Resources Engineering, Kunming University of Science and Technology, Kunming 650093, Yunnan, China
  • show less
    DOI: 10.3788/LOP231148 Cite this Article Set citation alerts
    Xingtao Ming, Dehong Yang. Building Extraction from Remote Sensing Image Based on Multi-Module[J]. Laser & Optoelectronics Progress, 2024, 61(4): 0428004 Copy Citation Text show less

    Abstract

    Building extraction from high-resolution remote sensing imagery is an important research direction for the interpretation of remote sensing imagery. To address the issues of small buildings easily lost and large buildings with blurred boundaries by traditional extraction methods, this paper proposes a multi-module building extraction U-shaped network (MM-Unet) based on Unet. First, Multi-scale feature combination module (MFCM) is introduced in the encoder and decoder sections of the network to obtain and supplement more spatial information. Then, multi-scale feature enhancement module (MFEF) is incorporated at the end of the decoder to enhance the extraction of multi-scale features. After the skip connections, the dual attention module (DAM) is introduced to adaptively learn the feature importance of channel and spatial positions, thereby reducing the differences among features at different depths. In order to validate the effectiveness of the network, experiments are conducted on Massachusetts, WHU, and Vaihingen building datasets with spatial resolutions of 1 m, 0.3 m, and 0.09 m respectively. and the intersection and union ratio of MM-Unet reach 73.42%, 90.11%, and 85.21%, compared to Unet, increased by 2.21 percentage points, 1.25 percentage points, and 1.55 percentage points. These results demonstrate that MM-Unet shows high extraction accuracy and strong generalization ability on buildings of various scales.
    y=F(w,x)+x
    Mc=σMLPAvgPool(F)+MLPMaxPool(F)
    C=FMc
    Ms=σConv7×7ConcatAvgPool(C),MaxPool(C),
    S=CMs
    R=S+F
    ROA=NTP+NTNNTP+NTN+NFP+NFN
    Rprecision=NTPNTP+NFP
    Rrecall=NTPNTP+NFN
    sF1=2×Rprecision×RrecallRprecision+Rrecall
    RIoU=NTPNTP+NFP+NFN