• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2025, Vol. 47 ›› Issue (7): 1262-1273.

• Graphics and Images • Previous Articles     Next Articles

A pyramid feature decoupling extraction fusion network for pansharpening

LIN Yi1,2,3,SONG Huihui1,2,3   

  1. (1.School of Computer Science,School of Cyber Science and Engineering,Nanjing University of Information Science
     & Technology,Nanjing 210044;
    2.Jiangsu Key Laboratory of Big Data Analysis Technology,Nanjing 210044;
    3.Collaborative Innovation Center of Atmospheric Environment and Equipment Technology,Nanjing 210044,China)
  • Received:2023-12-19 Revised:2024-06-02 Online:2025-07-25 Published:2025-08-25

Abstract: The objective of pansharpening is to fuse low-resolution multispectral images (LRMS) and their corresponding high-resolution panchromatic images (PAN) acquired by the same remote sensing satellite to generate high-resolution multispectral images (HRMS).Existing networks overly rely on the feature extraction and fusion capabilities of deep learning,failing to focus on the advantageous features of each modality and neglecting the distinct representations inherent in multimodal data,which leads to excessive redundant features in the final output.To extract features that independently express desired representations,reduce redundant information,and better integrate the complementary information from both modalities,this paper proposes a novel pyramid feature decoupling extraction and fusion network for pansharpening,effectively enhancing the clear representation of spectral and texture details in images.First,inspired by the divide-and-conquer concept,the network decouples and separately extracts spectral and texture information,employing different attention mechanisms to capture the unique details of each modality.Then,a cross-modal feature fusion module strengthens the interaction between features of different modalities,enabling the network to acquire complementary information while eliminating redundancy.Finally,based on a pyramid structure,the network performs feature extraction and fusion operations at multiple spatial scales,achieving outstanding results.Extensive experiments conducted on the GaoFen-2 and WorldView-3 satellite datasets demonstrate that the proposed network significantly outperforms state-of-the-art approaches,providing sub-stantial improvements for the pansharpening task.

Key words: remote sensing, pansharpening, pyramid, feature decoupling, attention mechanism