• Optics and Precision Engineering
  • Vol. 32, Issue 7, 1087 (2024)
Haibin WU1, Shiyu DAI1, Aili WANG1,*, Iwahori YUJI2, and Xiaoyu YU3
Author Affiliations
  • 1Heilongjiang Province Key Laboratory of Laser Spectroscopy Technology and Application, College of Measurement and Control Technology and Communication Engineering, Harbin University of Science and Technology, Harbin50080, China
  • 2Department of Computer Science, Chubu University, Aichi487-8501, Japan
  • 3College of Electron and Information, University of Electronic Science and Technology of China,Zhongshan Institute, Zhongshan528400, China
  • show less
    DOI: 10.37188/OPE.20243207.1087 Cite this Article
    Haibin WU, Shiyu DAI, Aili WANG, Iwahori YUJI, Xiaoyu YU. Collaborative classification of hyperspectral and LiDAR data based on CNN-transformer[J]. Optics and Precision Engineering, 2024, 32(7): 1087 Copy Citation Text show less

    Abstract

    To tackle the challenges in multimodal classification tasks involving hyperspectral images (HSI) and LiDAR data, such as cross-modal information expression and feature alignment, this paper introduces a contrastive learning-based multi-branch CNN-Transformer network (CLCT-Net) for the joint classification of hyperspectral and LiDAR data. Initially, CLCT-Net employs a feature extraction module with a ConvNeXt V2 Block to capture shared features across different modalities, addressing the semantic alignment issue between data from heterogeneous sensors. It then develops a dual-branch HSI encoder with spatial channel and spectral context branches, alongside a LiDAR encoder enhanced by a frequency domain self-attention mechanism, to secure more comprehensive feature representations. Lastly, it leverages ensemble contrastive learning for classification to further refine the accuracy of multimodal collaborative classification. Experimental evaluations on the Houston 2013 and Trento datasets demonstrate that the proposed model excels in extracting and integrating cross-modal data features, achieving superior ground object classification accuracies of 92.01% and 98.90%, respectively, when compared to existing models for classifying hyperspectral images and LiDAR data.
    Haibin WU, Shiyu DAI, Aili WANG, Iwahori YUJI, Xiaoyu YU. Collaborative classification of hyperspectral and LiDAR data based on CNN-transformer[J]. Optics and Precision Engineering, 2024, 32(7): 1087
    Download Citation