A novel pansharpening method based on side window filter and new injection gain matrices

Liu, Tianci; Dong, Keyan; Song, Yansong; Li, Jinwang; Wang, Junyao; Wang, Yanbo; Zhang, Lei; Li, Yuqing

doi:10.1038/s41598-025-08929-9

Download PDF

Article
Open access
Published: 18 July 2025

A novel pansharpening method based on side window filter and new injection gain matrices

Tianci Liu^1,2,
Keyan Dong¹^nAff2,
Yansong Song¹^nAff2,
Jinwang Li¹,
Junyao Wang¹^nAff2,
Yanbo Wang¹,
Lei Zhang¹ &
…
Yuqing Li¹

Scientific Reports volume 15, Article number: 26052 (2025) Cite this article

759 Accesses
Metrics details

Subjects

Optical spectroscopy

Abstract

The acquisition of remote sensing images with both high spatial resolution and high spectral resolution is constrained by limitations in spectral imaging technology. Pan-sharpening is utilized to generate high-resolution multispectral images that correspond to the resolution of panchromatic (PAN) images through the fusion of multispectral (MS) and PAN images. Traditional methods frequently encounter challenges associated with the loss of image details. This study introduces a novel multispectral image fusion method based on Side Window Filtering (SWF) and a new injected gain, termed the improved adaptive Gram-Schmidt method based on SWF (SWGSA), with an emphasis on detail preservation. Initially, side window filtering is applied to the PAN image, which not only reduces noise but also enhances edge retention, leading to a more accurate computation of weight indices. Subsequently, the injected gain is adjusted by incorporating references from the PAN image to further enhance image details. Ultimately, the fused image is produced through this process. Experimental results from the IKONOS, GeoEye-1, and WorldView-3 datasets substantiate the effectiveness of the proposed method, demonstrating a significant improvement in the quality of the fused images.

A spectral preserved model based on spectral contribution and dependence with detail injection for pansharpening

Article Open access 27 April 2023

A novel pansharpening method based on cross stage partial network and transformer

Article Open access 02 June 2024

Bayesian decision based fusion algorithm for remote sensing images

Article Open access 21 May 2024

Introduction

High-resolution multispectral images are crucial in the field of remote sensing, significantly contributing to areas such as national defense security¹target detection²environmental protection³and disaster prevention⁴. However, the simultaneous achievement of high spatial resolution and high spectral resolution images remains a significant challenge due to the limitations of multispectral imaging devices and sensor technologies. Pan-sharpening provides an effective solution to this issue by fusing multispectral (MS) images with panchromatic (PAN) images from the same region, thereby generating high-resolution multispectral images⁵. While numerous spectral fusion algorithms are currently available, all pan-sharpening algorithms can be classified into four primary categories⁶: Component Substitution (CS), Multiresolution Analysis (MRA), Variational Optimization-based (VO), and Deep Learning (DL).

The Component Substitution (CS) method, also referred to as the spectral method, leverages the high spatial resolution of the panchromatic (PAN) image to replace the low spatial resolution components of the multispectral (MS) image, resulting in a fused MS image with enhanced spatial resolution⁷. Owing to its straightforward implementation, many pioneering pan-sharpening algorithms belong to the CS category⁸. Notable examples include those utilizing Intensity-Hue-Saturation (IHS)⁹Principal Component Analysis (PCA)¹⁰and Gram-Schmidt (GS)¹¹. Recent contributions in the literature have focused on refining the injection rules, leading to the development of Adaptive GS (GSA)¹² and GSA with Clustering (C-GSA)¹³ methods. The Multiresolution Analysis (MRA) approach extracts spatial details from the panchromatic (PAN) image through multiscale transformations and injects these details into the resampled MS image, yielding a fused multispectral image with high spatial resolution¹⁴. These methods can be traced back to fusion techniques based on generalized decomposition, which can be executed using simple low-pass filtering or more complex iterative methods, such as Laplacian pyramids, wavelets¹⁵and Smoothing Filter-based Intensity Modulation (SFIM)¹⁶.

Variational Optimization-based (VO) techniques are recognized as sharpening methods that depend on the resolution of optimization problems. All techniques within this family are based on a model that defines an objective function, which is typically addressed through variational optimization methods to identify appropriate solutions¹⁷. A pioneering VO pan-sharpening approach introduced a regularization model based on Total Variation (TV)¹⁸. It is undeniable that the rise of Deep Learning (DL) has significantly impacted the field of panchromatic sharpening^19,20. Masi et al. designed and trained the first fully convolutional network specifically developed for panchromatic sharpening, referred to as the Panchromatic Sharpening Neural Network (PNN)²¹which achieved promising results. An enhanced version of the PNN, known as the Adaptive PNN (APNN)²²has also been proposed. Deng et al.²³ developed the FusionNet architecture by integrating convolutional neural networks (CNNs) with traditional techniques, achieving commendable performance at both downscale and full scale. Jian et al.²⁴ constructed a multi-scale multi-flow fusion network (MMFN) by extracting diverse features from multispectral (MS) and panchromatic (PAN) images using a multi-scale strategy and multi-flow fusion blocks, enabling the preservation of both spatial and spectral features. Fang et al.²⁵ introduced a lightweight CNN architecture with a single branch and scaling, termed SDRCNN, which incorporates convolutional blocks and dense residual connections to retain spatial and spectral information within the images. Lu et al.²⁶ proposed a self-guided spatial channel adaptive convolution method (SSCAConv) that combines spatial and channel attention, taking into account both spatial and channel differences in features while enhancing sharpening outcomes. Que et al.²⁷ integrated spectral and spatial networks in a bilateral pyramid structure, employing collaborative attention to amplify the correlation between spectral and spatial features, thereby improving the sharpening results. Although deep learning methods can enhance the quality of fused images from various perspectives, they often involve high computational complexity and necessitate a substantial amount of training data, which somewhat limits the advancement of these approaches^28,29.

To effectively address the spatial and spectral distortions caused by the previously mentioned methods, a pan-sharpening (PS) approach based on a Component Substitution (CS) model and Side Window Filtering (SWF) has been proposed. This approach, referred to as the Improved Adaptive GS Method Based on SWF (SWGSA), significantly enhances the quality of full-resolution fused images while reducing errors. In the process of generating intensity components, SWF is applied to filter the PAN image, achieving both denoising and improved edge preservation. Furthermore, the gain function has been modified to incorporate references from the PAN image, thereby enhancing detail. The latest parameter metric, RQNR, has been utilized for performance evaluation. Experimental results indicate that the algorithm effectively enhances both spatial and spectral information.

Related works

Component substitution methods are based on projecting the multispectral (MS) image into a new space where spatial structures and spectral information are effectively separated. In this new space, the component containing spatial structures is replaced with the panchromatic (PAN) image, and the transformed MS image is subsequently sharpened. Finally, the sharpening process is completed by applying an inverse transformation to restore the MS image to its original space. This process can generally be described by the following equation:

$$\widehat{{{\bf{MS}}}}_{k} = \widetilde{{{\bf{MS}}}}_{k} + {\bf{G}}_{k} \cdot \left( {{\bf{P}} - {\bf{I}}_{L} } \right)$$

(1)

In Eq. (1), the subscript k indicates the spectral image of the k-th channel; $\widehat{{{\bf{MS}}}}_{k}$ denotes the fused multispectral image; $\widetilde{{{\bf{MS}}}}_{k}$ represents the multispectral image interpolated to the resolution of the PAN image; G_k refers to the injected gain matrix; P is the panchromatic image; and I_L is the intensity component, which is defined as follows:

$${\bf{I}}_{L} = \sum\limits_{i = 1}^{K} {w_{i} \widetilde{{{\bf{MS}}}}_{i} }$$

(2)

In Eq. (2), K denotes the number of spectral channels, and w_i represents the weight coefficient for the i-th channel. In the traditional Gram-Schmidt (GS) method, the intensity component is calculated using a direct averaging approach, specifically w_i = 1/K. The calculation of the injected gain matrix in the traditional Gram-Schmidt (GS) method is presented as follows:

$${\bf{G}}_{k} = \frac{{\sigma ({\bf{I}}_{L} )}}{{\sigma ({\bf{P}})}}{\bf{1}}$$

(3)

In Eq. (3), $\sigma$ represents the standard deviation and 1 indicates a matrix of ones. It is evident that the traditional GS method solely considers the relationship between the intensity component and the panchromatic (PAN) image.

Multiple improved versions have been developed by modifying the methods for generating intensity components and calculating the injected gain matrix. The Gram-Schmidt Adaptive (GSA) method calculates the weight coefficients for each channel by minimizing the mean squared error (MSE) relative to the low-pass filtered version of the Panchromatic (PAN) image through a weighted average of the multispectral (MS) bands. Furthermore, the new injected gain matrix considers the relationship between the MS image and the intensity component, as represented in the following equation:

$${\bf{G}}_{k} = g_{k} {\bf{1}} = \frac{{{\text{cov}} \left( {\widetilde{{{\bf{MS}}}}_{k} ,{\bf{I}}_{L} } \right)}}{{{\text{var}} \left( {{\bf{I}}_{L} } \right)}}{\bf{1}}$$

(4)

In Eq. (4), Cov(X, Y) denotes the covariance between two images X and Y; var(X) represents the variance of image X. The formulas for covariance and variance are provided below:

$${\text{cov}} (X,Y) = \frac{1}{MN}\sum\limits_{i = 1}^{M} {\sum\limits_{j = 1}^{N} {(X(i,j) - \overline{X})(Y(i,j) - \overline{Y})} }$$

(5)

$${\text{var}} (X) = \frac{1}{MN}\sum\limits_{i = 1}^{M} {\sum\limits_{j = 1}^{N} {(X(i,j) - \overline{X})^{2} } }$$

(6)

In Eq. (5), M and N indicate the resolution dimensions of the image, while $\overline{X}$ and $\overline{Y}$ represent the average grayscale values of all pixels in the entire image.

The GSA with Clustering (C-GSA) method employs an injected gain matrix that is consistent with the GSA method, but the generation of intensity components differs. C-GSA segments the MS image using a binary tree segmentation algorithm, and k-means clustering is applied to estimate the weight coefficients for each channel from the segmented images, thereby optimizing the algorithm.

The mathematical foundation of the Gram-Schmidt Adaptive (GSA) method and its improved version, the C-GSA method, which is based on the classical Gram-Schmidt orthogonalization process, is clearly established. The calculations involved are relatively straightforward and easy to implement. Both GSA and its improved methods exhibit high computational efficiency and speed, particularly when handling large-scale image data. Due to their lower algorithmic complexity, these methods can complete panchromatic sharpening tasks within a shorter time frame. Furthermore, they effectively mitigate the impact of noise interference on sharpening results and demonstrate a degree of robustness to variations in input data, allowing for adaptability to different types of multispectral images and providing high-quality outputs.

In contrast, Multiresolution Analysis (MRA) methods typically involve complex wavelet transforms and multiscale analyses. In comparison, GSA and its improved methods are simpler to implement, requiring less frequent scale transformations, which reduces the overall complexity of implementation. These methods directly address the features of the input images, thereby avoiding information loss at various resolution levels and preserving both image detail and contrast.

Variational Optimization (VO) methods generally necessitate iterative calculations to resolve complex optimization problems. In contrast, GSA and its improved methods can produce results in a shorter time, making them suitable for applications requiring rapid responses. While variational optimization methods require an in-depth mathematical background and advanced optimization techniques, the implementation of GSA and its improved methods is relatively intuitive, promoting their widespread adoption and application.

Deep Learning (DL) methods typically demand a substantial amount of training data for model training. Conversely, GSA and its improved methods have relatively lower data requirements and can achieve good performance even on smaller datasets. The training process for deep learning models may consume significant time and computational resources, whereas GSA and its improved methods can rapidly yield results, making them ideal for quick deployment and application.

Proposed method

Figure 1 illustrates the flowchart of the algorithm proposed in this paper, with each component to be discussed in detail in the subsequent sections.

SWGSA fusion model

The Gram-Schmidt (GS) method applies low-pass filtering to the Panchromatic (PAN) image; however, low-pass filtering often diminishes the edge details of the image. Reference³⁰ introduces a Spatial Weighted Filtering (SWF) model that significantly enhances the ability to preserve image edges. Inspired by this model, applying it in the Panchromatic Sharpening (PS) process allows for better extraction of effective and relevant detail features from the image, thereby establishing a foundation for improving the spatial resolution of the Multispectral (MS) fused image while maintaining spectral information after information injection. The introduction of the SWF filtering method aims to better preserve meaningful detail information in the image. By integrating the SWF-filtered image into the weight function, the detail features of the intensity component can be more thoroughly enhanced. Furthermore, since the fused image should consider both the MS and PAN images, the relationship with the PAN image is crucial. The new injection gain strategy modifies the injection gain to utilize the covariance relationship between the PAN image and the MS image, which further elevates the image’s detail level. The implementation of these two strategies will positively influence the detail level of image fusion. The subsequent sections will provide a detailed description of the specific operational steps for the new intensity component and injection gain strategy.

Side window filtering

The original weight coefficients were established through linear regression between the low-pass filtered PAN image and the MS bands, with the goal of minimizing the mean squared error (MSE) between the low-pass filtered PAN image and the intensity component. The new intensity component strategy primarily involves replacing the original low-pass filtered PAN image with the new Spatial Weighted Filtering (SWF) method, which is capable of preserving more details from the MS image. In this study, the weight function is defined by applying weighted averages to the MS bands, aiming to minimize the mean squared error (MSE) between the upsampled MS image and the PAN image processed with SWF filtering. SWF not only retains detail information but also incurs a lower computational cost. This approach is expected to effectively mitigate the detail loss issues that may arise from low-pass filtering.

Compared to traditional filters, Side Window Filtering not only effectively removes image noise but also preserves image edges and details more effectively. The parameters of the side window filter, including window size and the number of iterations, can be flexibly adjusted to meet specific requirements for processing images with varying noise levels. Specifically, the choice of filter window size and iteration count must balance the level of image noise with the retention of detail and edge information. Smaller filter windows are more effective at preserving details and edges; however, they may prove inadequate for effective noise reduction at higher noise levels. In contrast, larger filter windows can smooth the image more effectively and remove noise, but they may also blur details and edges, negatively impacting image clarity. Regarding the number of iterations, fewer iterations typically facilitate rapid noise removal but may not sufficiently smooth the image, resulting in residual noise. Conversely, increasing the number of iterations can enhance the denoising effect to some extent; however, excessive iterations may lead to the loss of image details and cause over-smoothing effects, such as the “block effect.”

Given that remote sensing image data contains rich details and edge information, caution must be exercised during filtering to avoid detail loss due to over-smoothing. Additionally, considering the response modulation transfer function of optical sensors, the collected image data may contain some high-frequency noise. To retain as much detail and edge information as possible while denoising, a window size of 3 × 3 was selected for the side window filter in this study. Taking into account the processing efficiency of the filter and the denoising effect, the number of iterations was set to 1 to achieve an optimal balance between denoising effectiveness and detail retention.

The SWF method effectively captures the edges of the target. We employed data from the Gaofen satellite³¹ to illustrate the comparison of filtering effects, as depicted in Fig. 2. Figure 2(a) presents the original PAN image, Fig. 2(b) shows the low-pass (LP) filtered version of Fig. 2(a), and Fig. 2(c) displays the SWF filtered version of Fig. 2(a).

The analysis of the figure reveals that the edges in the original PAN image are distinctly pronounced; however, the low-pass filtered image in Fig. 2(b) suffers from considerable blurring, resulting in a lower similarity to the original PAN image and a reduction in visible edge details. In contrast, the SWF filtered image in Fig. 2(c) exhibits a strong similarity to the original PAN image, particularly regarding global edge representation. Notably, even in areas where edges are relatively subtle, the SWF method effectively highlights these edges and maintains detail. The comparison between Figs. 2(b) and 2(c) clearly demonstrates that the SWF filtered image in Fig. 2(c) shows a marked enhancement in edge information extraction compared to the low-pass filtered image in Fig. 2(b).

New injection gain matrices

As noted in the literature, the traditional GS (Gram-Schmidt) method employs an injection gain matrix G_k only addresses the relationship between the upsampled multispectral image and the intensity component, neglecting the relationship between the PAN image and the intensity component. In this study, we enhance detail information and reduce information distortion by incorporating the relationship between the PAN image and the intensity component.

We introduce a novel injection gain strategy that considers a scheme to augment the spatial information of the PAN image. The calculation method is as follows:

$${\bf{G}}_{k} = g_{k} {\bf{1}} = \frac{{{\text{cov}} \left( {\widetilde{{{\bf{MS}}}}_{k} ,{\bf{I}}_{L} } \right)}}{{{\text{cov}} \left( {{\bf{P}},{\bf{I}}_{L} } \right)}}{\bf{1}}$$

(7)

In Eqs. (7), 1 refers to a matrix of ones, and cov(X, Y) represents the covariance between two images X and Y.

In the traditional Gram-Schmidt (GS) method, the injected gain matrix solely considers the relationship between the intensity component and the panchromatic (P) image. In contrast, the Gram-Schmidt Adaptive (GSA) method emphasizes the relationship between multispectral (MS) images and the intensity component. This paper presents an improved approach that incorporates the covariance between the panchromatic (PAN) image and the intensity component. This enhancement not only considers the relationship between the MS images and the intensity component but also expands the analysis to include the relationship between the PAN image and the intensity component. This improvement significantly enhances the representation of image details, thereby elevating the quality of the fused image.

Experimental results and discussion

Experimental setup and dataset

A systematic evaluation of performance was conducted. All experiments were performed under the following configuration: the hardware used included an Intel(R) Core(TM) i5-8300 H CPU @ 2.30 GHz processor with 16GB of RAM; the software environment was Windows 10 (64-bit), and the experimental platform was MATLAB R2021A.

This study employed three sets of image datasets, performing two types of experiments on the IKONOS³²GeoEye-1³³, and WorldView-3³⁴ datasets, including downscaling and full-size comparison experiments. We use ENVI 5.0³⁵ software for image registration and segmentation. In all experiments, the resolution of the multispectral (MS) images prior to fusion was 256 × 256, while the resolution of the panchromatic (PAN) images was 1024 × 1024, resulting in a resolution ratio R of 4. In the downscaling experiment, we filtered the original PAN and MS images according to Wald protocol³⁶ and downsampled them by a resolution ratio of 4.

This paper compares nine representative fusion methods with the proposed approach, including Interpolated MS image with a 23-tap interpolation kernel (EXP)³⁷Principal Component Analysis (PCA)¹⁰Smoothing Filter-based Intensity Modulation (SFIM)¹⁶PCA/wavelet model-based fusion (PWMBF)¹⁵Pansharpening Neural Network (PNN)²¹Advanced Pansharpening Neural Network (A-PNN)²²FusionNet²³Gram–Schmidt adaptive (GSA )¹²and GSA with Clustering (C-GSA)¹³. All comparison methods utilized in this study are based on open-source code provided by the respective authors. Seven evaluation metrics^38,39 were employed to assess the performance of the fused image results obtained by the different methods, with detailed descriptions of these metrics provided in Table 1. To ensure the reproducibility and transparency of the research, the code used in this study will be made publicly available after the article’s publication, allowing other researchers to reference and utilize it.

Table 1 The quantitative evaluation index.

Full size table

Experimental results

To evaluate the fused images more effectively, we selected a small region (highlighted by the green small box) from each fused image for 4x magnification. The magnified area was then placed in the lower right corner of the fused image (indicated by the green large box).

The fusion effects of different algorithms tested on the IKONOS dataset are illustrated in Fig. 3. Figure 3(a) displays the ground truth image, while Fig. 3(b) shows the panchromatic image. Figures 3(c) to 3(l) present the corresponding fusion results obtained from various fusion methods. From the fusion results of this set of images, it is evident that the traditional EXP algorithm performs the worst, producing a blurry image. The fusion results from the PCA, SFIM, and PWMBF algorithms also exhibit localized areas of ghosting and loss of detail in certain regions. The PNN algorithm suffers from spectral distortion issues, leading to color deviations and poor spatial quality. The APNN algorithm produces blurry images. The FusionNet algorithm yields relatively satisfactory results. In contrast, the GSA and C-GSA algorithms produce clear image outputs. The algorithm proposed in this paper effectively preserves image details, resulting in high-quality images.

Table 2 presents a comparison of the objective evaluation metrics results obtained by various algorithms on the IKONOS dataset. The optimal results for each metric are highlighted in bold, while the best results among the GSA-related methods are italics. From Table 2, it can be seen that for the D_λ index, the results from the proposed algorithm are only slightly lower than those of the SFIM algorithm, indicating a significant improvement over the GSA and C-GSA algorithms. For the D_s index and ERGAS index, the proposed algorithm achieves the smallest numerical values, showcasing a clear advantage. In terms of RQNR, Q4, and SSIM metrics, the proposed algorithm produces the highest values, reflecting a notable enhancement compared to the second-best results. For the PIQE index, the results from the proposed algorithm are second only to those of the SFIM algorithm. Among the GSA-related methods, both the C-GSA and GSA algorithms exhibit minimal differences across the various metrics. In contrast, the proposed algorithm achieves optimal results across all metrics, demonstrating improvements over both the GSA and C-GSA algorithms.

Table 2 Quantitative evaluation results for IKONOS.

Full size table

The fusion effects of different algorithms tested on the GeoEye-1 dataset are illustrated in Fig. 4. Figure 4(a) displays the ground truth image, while Fig. 4(b) shows the panchromatic image. Figures 4(c) to 4(l) present the corresponding fusion results obtained from various fusion methods. From the fusion results of this set of images, it is clear that the traditional EXP algorithm still performs the worst, exhibiting significant spatial distortion and overall blurriness. The PCA algorithm suffers from color distortion. The SFIM and PWMBF algorithms demonstrate oversharpening, resulting in unclear edges. The results from both the PNN and APNN algorithms show blurriness in the enlarged areas, indicating subpar quality. The FusionNet algorithm presents slightly blurred results in the enlarged regions. The GSA and C-GSA algorithms exhibit overexposure in the grayscale of the white roof areas. In contrast, the results from the proposed algorithm demonstrate good clarity and detail in the images.

Table 3 presents a comparison of the objective evaluation metrics results obtained by various algorithms on the GeoEye-1 dataset. The optimal results for each metric are highlighted in bold, while the best results among the GSA-related methods are italics. It is clear from the table that the proposed algorithm achieves the best results in the D_s, RQNR, SSIM, and PIQE metrics. For the D_λ index, the results from the proposed algorithm are only surpassed by those of the SFIM and PNN algorithms. In terms of the Q4 index, the PNN algorithm achieves the best result, while the PWMBF algorithm excels in the ERGAS index. Among the GSA-related methods, the C-GSA algorithm shows only marginal improvements over the GSA algorithm, with minimal differences in their performance. In contrast, the proposed algorithm demonstrates improvements over both the GSA and C-GSA algorithms in the D_s, D_λ, RQNR, SSIM, PIQE, and ERGAS metrics, although it performs relatively poorly in the Q4 index.

Table 3 Quantitative evaluation results for GeoEye-1.

Full size table

The fusion effects of different algorithms tested on the WorldView-3 dataset are illustrated in Fig. 5. Figure 5(a) displays the ground truth image, while Fig. 5(b) shows the panchromatic image. Figures 5(c) to 5(l) present the corresponding fusion results obtained from various fusion methods. From the fusion results of this set of images, it is clear that the EXP algorithm performs poorly, resulting in a blurred image. The SFIM algorithm shows blurriness in the enlarged regions of the fusion results. The PCA algorithm produces blurred enlarged images with poor spatial quality. The PWMBF algorithm suffers from oversharpening and color distortion. The results from both the PNN and APNN algorithms are generally subpar, exhibiting color deviations and blurriness, leading to poor quality. The FusionNet algorithm presents enlarged regions that are not very clear. The GSA and C-GSA algorithms yield images with decent brightness but suffer from excessive smoothing. In contrast, the results from the proposed algorithm do not show overexposure and maintain good clarity.

Table 4 presents a comparison of the objective evaluation metrics results obtained by various algorithms on the WorldView-3 dataset. The optimal results for each metric are highlighted in bold, while the best results among the GSA-related methods are italics. It is clear from the table that the proposed algorithm achieves the best results in the D_s, RQNR, SSIM, and PIQE metrics. For the D_λ index, the SFIM algorithm shows the best results, followed by the APNN algorithm. In terms of the Q8 index, the SFIM algorithm again achieves the best results, with the proposed algorithm ranking just behind the SFIM and PWMBF algorithms. Regarding the ERGAS index, the PWMBF algorithm obtains the best results, with the proposed algorithm placing just below the SFIM, PWMBF, and FusionNet algorithms. Among the GSA-related methods, the C-GSA algorithm exhibits negative optimization in several metrics compared to the GSA algorithm, showing limited improvement. In contrast, the proposed algorithm demonstrates enhancements over both the GSA and C-GSA algorithms across all metrics, achieving the optimal results.

Table 4 Quantitative evaluation results for WorldView-3.

Full size table

A key aspect of this study is the analysis and comparison of the computational complexity of the algorithms. By measuring the runtime of different methods on the IKONOS, GeoEye-1, and WorldView-3 datasets, we evaluate the computational complexity of the panchromatic sharpening algorithms and conduct an in-depth analysis of their effectiveness. As shown in Table 5, the optimal results for each metric are highlighted in bold, while the best results among the GSA-related methods are italics. The component replacement and multi-resolution analysis methods generate the final products in a shorter time, while variational optimization and deep learning methods are characterized by longer execution times, which is inherent to the algorithms themselves. The runtime of the proposed algorithm is only slightly higher than that of the PCA and SFIM algorithms, but it demonstrates a significant reduction compared to the GSA and C-GSA algorithms. The average runtime of the proposed algorithm across the three datasets is only two-thirds of that of the GSA algorithm and just 6.8% of that of the C-GSA algorithm.

Table 5 Comparison of running times for different methods(s).

Full size table

Discussion

Based on the image fusion results from the IKONOS, GeoEye-1, and WorldView-3 datasets, the proposed algorithm demonstrates superior performance across four evaluation metrics (D_s, RQNR, SSIM, and PIQE). However, its performance on the D_λ and Q4 metrics is relatively weak. The reasons for the suboptimal performance in these two metrics can be attributed to several factors: First, to enhance the spatial details of the panchromatic image, the SWF filtering method is employed during the sharpening process, which can lead to some distortion of the spectral information, ultimately resulting in a decline in the D_λ metric. Second, while the proposed method uses SWF filtering, which is effective at preserving edges and details, it may, in certain cases, cause excessive enhancement of image edges, leading to unnatural transitions in some areas that negatively affect the Q4 metric. Additionally, the images in the three datasets contain complex textures and rich details. Although the proposed method comprehensively considers the relationships between the multispectral images and the principal component (PAN) images with the intensity component, this approach may lead to some duplication of image details, thereby impacting the evaluation of the D_λ and Q4 metrics.

Although the proposed algorithm performs relatively poorly on the Dλ and Q4 metrics, it shows improvements across all performance indicators when compared to the GSA and C-GSA algorithms. This optimization effect primarily stems from the different filtering techniques and the injected gain matrix used in this study. Since the injected gain matrices of the GSA and C-GSA algorithms are identical, the differences in their performance across various metrics are relatively minor, resulting in insignificant variations in the processing outcomes. The side window filter employed in this study not only effectively reduces computational load but also preserves the edges and details of the image while eliminating noise. Furthermore, the newly introduced injected gain matrix considers not only the relationship between the multispectral (MS) images and the intensity component but also fully accounts for the relationship between the principal component (PAN) image and the intensity component. This enhancement significantly improves the representation of image details and boosts the overall quality of the fused image.

In summary, based on the image fusion results from the three datasets, the proposed algorithm outperforms both the GSA and C-GSA algorithms across all metrics and achieves the best results in several key indicators, including D_s, RQNR, SSIM, and PIQE. The proposed method exhibits lower computational complexity and can deliver excellent image quality and optimal performance across multiple evaluation metrics within a shorter runtime. Further comprehensive subjective and objective evaluations, along with performance analyses, indicate that the method proposed in this study has clear advantages and can effectively enhance the quality of multispectral image fusion.

Conclusions

This paper presents a multispectral image fusion method based on side window filtering and a novel injected gain, termed the improved adaptive GS method based on SWF (SWGSA). The side window filtering technique facilitates denoising while enhancing edge preservation in the images. By modifying the injected gain and incorporating the panchromatic (PAN) image as a reference, further enhancement of image details is achieved. The proposed method’s performance is evaluated on three image datasets using nine comparison methods and seven quantitative metrics. The results confirm that the proposed method effectively enhances spatial details and spectral information, leading to improved quality in the fused images.

Data availability

The datasets used during the current study are available from the corresponding author on reasonable request. Our code will be made available publicly after the paper is published. The source code will be made available on GitHub https://github.com/tianci2357/SWGSA.

References

Sarkar, S., Sagan, V., Bhadra, S. & Fritschi, F. B. Spectral enhancement of planetscope using Sentinal-2 images to estimate soybean yield and seed composition. Sci. Rep. 14, 15063 (2024).
Article CAS PubMed PubMed Central Google Scholar
Hupel, T. & Stütz, P. Adopting hyperspectral anomaly detection for near Real-Time camouflage detection in multispectral imagery. Remote Sens. 14, 3755 (2022).
Article ADS Google Scholar
Rossi, F. S. et al. Assessing soil CO2 emission on eucalyptus species using UAV-based reflectance and vegetation indices. Sci. Rep. 14, 20277 (2024).
Article CAS PubMed PubMed Central Google Scholar
Modak, S., Heil, J. & Stein, A. Pansharpening Low-Altitude multispectral images of potato plants using a generative adversarial network. Remote Sens. 16, 874 (2024).
Article ADS Google Scholar
Wu, L., Jiang, X., Peng, J., Wu, G. & Xiong, X. A spectral preserved model based on spectral contribution and dependence with detail injection for pansharpening. Sci. Rep. 13, 6882 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Vivone, G. et al. A new benchmark based on recent advances in multispectral pansharpening: revisiting pansharpening with classical and emerging pansharpening methods. IEEE Geosci. Remote Sens. Mag. 9, 53–81 (2021).
Article Google Scholar
Arienzo, A., Vivone, G., Garzelli, A., Alparone, L. & Chanussot, J. Full-Resolution quality assessment of pansharpening: theoretical and hands-on approaches. IEEE Geosci. Remote Sens. Mag. 10, 168–201 (2022).
Article Google Scholar
Wu, L., Jiang, X., Zhu, W., Huang, Y. & Liu, K. Bayesian decision based fusion algorithm for remote sensing images. Sci. Rep. 14, 11558 (2024).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhang, X. et al. Improved generalized IHS based on total variation for pansharpening. Remote Sens. 15, 2945 (2023).
Article ADS Google Scholar
Chavez, P. & Kwarteng, A. Extracting spectral contrast in Landsat thematic mapper image data using selective principal component analysis. Photogramm. Eng. Remote Sens. 55(3). (1989).
Laben, C. A. & Brower, B. V. Process for enhancing the spatial resolution of multispectral imagery using pan-sharpening. (2000).
Aiazzi, B., Baronti, S. & Selva, M. Improving component substitution pansharpening through multivariate regression of MS + Pan data. IEEE Trans. Geosci. Remote Sens. 45, 3230–3239 (2007).
Article ADS Google Scholar
Restaino, R., Dalla Mura, M., Vivone, G. & Chanussot, J. Context-Adaptive pansharpening based on image segmentation. IEEE Trans. Geosci. Remote Sens. 55, 753–766 (2017).
Article ADS Google Scholar
A Pansharpening Approach Based on Multiple Linear Regression. Estimation of Injection Coefficients. https://ieeexplore.ieee.org/abstract/document/8717622
Palsson, F., Sveinsson, J. R., Ulfarsson, M. O. & Benediktsson, J. A. Model-Based fusion of Multi- and hyperspectral images using PCA and wavelets. IEEE Trans. Geosci. Remote Sens. 53, 2652–2663 (2015).
Article ADS Google Scholar
Liu, J. G. Smoothing Filter-based intensity modulation: A spectral preserve image fusion technique for improving Spatial details. Int. J. Remote Sens. 21, 3461–3472 (2000).
Article Google Scholar
Palsson, F., Ulfarsson, M. O. & Sveinsson, J. R. Model-Based Reduced-Rank pansharpening. IEEE Geosci. Remote Sens. Lett. 17, 656–660 (2020).
Article ADS Google Scholar
Palsson, F., Sveinsson, J. R. & Ulfarsson, M. O. A new pansharpening algorithm based on total variation. IEEE Geosci. Remote Sens. Lett. 11, 318–322 (2014).
Article ADS Google Scholar
Chen, L., Vivone, G., Nie, Z., Chanussot, J. & Yang, X. Spatial data augmentation: improving the generalization of neural networks for pansharpening. IEEE Trans. Geosci. Remote Sens. 61, 1–11 (2023).
Google Scholar
Xiong, Z., Liu, N., Wang, N., Sun, Z. & Li, W. Unsupervised pansharpening method using residual network with Spatial texture attention. IEEE Trans. Geosci. Remote Sens. 61, 1–12 (2023).
Google Scholar
Masi, G., Cozzolino, D., Verdoliva, L. & Scarpa, G. Pansharpening by convolutional neural networks. Remote Sens. 8, 594 (2016).
Article ADS Google Scholar
Scarpa, G., Vitale, S. & Cozzolino, D. Target-Adaptive CNN-Based pansharpening. IEEE Trans. Geosci. Remote Sens. 56, 5443–5457 (2018).
Article ADS Google Scholar
Deng, L. J., Vivone, G., Jin, C. & Chanussot, J. Detail Injection-Based deep convolutional neural networks for pansharpening. IEEE Trans. Geosci. Remote Sens. 59, 6995–7010 (2021).
Article ADS Google Scholar
Jian, L. et al. Multi-Scale and Multi-Stream fusion network for pansharpening. Remote Sens. 15, 1666 (2023).
Article ADS Google Scholar
Fang, Y., Cai, Y. & Fan, L. S. D. R. C. N. N. A Single-Scale dense residual connected convolutional neural network for pansharpening. IEEE J. Sel. Top. Appl. Earth Observations Remote Sens. 16, 6325–6338 (2023).
Article ADS Google Scholar
Lu, X., Zhuo, Y. W., Chen, H., Deng, L. J. & Hou, J. SSCAConv: Self-Guided Spatial-Channel adaptive Convolution for image fusion. IEEE Geosci. Remote Sens. Lett. 21, 1–5 (2024).
Article Google Scholar
Que, Y., Xiong, H., Xia, X., You, J. & Yang, Y. Integrating spectral and Spatial bilateral pyramid networks for pansharpening. IEEE J. Sel. Top. Appl. Earth Observations Remote Sens. 17, 3985–3998 (2024).
Article ADS Google Scholar
Zini, S., Barbato, M. P., Piccoli, F. & Napoletano, P. Deep learning hyperspectral pansharpening on Large-Scale PRISMA dataset. Remote Sens. 16, 2079 (2024).
Article Google Scholar
Chen, Y., Li, Y., Wang, T., Chen, Y. & Fang, F. DPDU-Net: double prior deep unrolling network for pansharpening. Remote Sens. 16, 2141 (2024).
Article Google Scholar
Yin, H., Gong, Y. & Qiu, G. Side Window Filtering. in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 8750–8758 (IEEE, Long Beach, CA, USA, 2019). 8750–8758 (IEEE, Long Beach, CA, USA, 2019). (2019). https://doi.org/10.1109/CVPR.2019.00896
Gaofen Datasets. http://www.sasclouds.com/english/home
IKONOS Datasets. https://www.satimagingcorp.com/satellite-sensors/ikonos/
GeoEye-1 Datasets. Available online: https://www.satimagingcorp.com/satellite-sensors/geoeye-1/
WorldView-3 Datasets. https://www.satimagingcorp.com/satellite-sensors/worldview-3/
ENVI Remote Sensing Software for Image Processing & Analysis. https://www.nv5geospatialsoftware.com/Products/ENVI.
Wald, L., Ranchin, T. & Mangolini, M. Fusion of satellite images of different Spatial resolutions: assessing the quality of resulting images. Photogram. Eng. Remote Sens. 63, 691–699 (1997).
Google Scholar
Aiazzi, B., Alparone, L., Baronti, S. & Garzelli, A. Context-driven fusion of high Spatial and spectral resolution images based on oversampled multiresolution analysis. IEEE Trans. Geosci. Remote Sens. 40, 2300–2312 (2002).
Article ADS Google Scholar
Scarpa, G. & Ciotola, M. Full-Resolution quality assessment for pansharpening. Remote Sens. 14, 1808 (2022).
Article ADS Google Scholar
Alparone, L., Garzelli, A. & Vivone, G. Spatial consistency for full-scale assessment of pansharpening. In IGARSS –2018 IEEE International Geoscience and Remote Sensing Symposium. https://doi.org/10.1109/IGARSS.2018.8518869 (2018).

Download references

Funding

This research was supported by the National Natural Science Foundation of China(62105029, U2141231), the Young Elite Scientists Sponsorship Program by CAST(YESS20220600), the National Key R&D Program of China, Ministry of Science and Technology(2021YFA0718804, 2022YFB3902500), and the Natural Science Foundation of Jilin Province (YDZJ202501ZYTS627).

Author information

Keyan Dong, Yansong Song & Junyao Wang
Present address: Institute of Space Photo-Electronic Technology, Changchun University of Science and Technology, Changchun, 130022, Jilin, China

Authors and Affiliations

School of Optoelectronic Engineering, Changchun University of Science and Technology, Changchun, 130022, China
Tianci Liu, Keyan Dong, Yansong Song, Jinwang Li, Junyao Wang, Yanbo Wang, Lei Zhang & Yuqing Li
Institute of Space Photo-Electronic Technology, Changchun University of Science and Technology, Changchun, 130022, Jilin, China
Tianci Liu

Authors

Tianci Liu
View author publications
Search author on:PubMed Google Scholar
Keyan Dong
View author publications
Search author on:PubMed Google Scholar
Yansong Song
View author publications
Search author on:PubMed Google Scholar
Jinwang Li
View author publications
Search author on:PubMed Google Scholar
Junyao Wang
View author publications
Search author on:PubMed Google Scholar
Yanbo Wang
View author publications
Search author on:PubMed Google Scholar
Lei Zhang
View author publications
Search author on:PubMed Google Scholar
Yuqing Li
View author publications
Search author on:PubMed Google Scholar

Contributions

Conceptualization, T.L. and K.D.; methodology, T.L.; software, T.L.; formal analysis, K.D. and Y.S.; investigation, J.L. and J.W.; data curation, T.L.; writing—original draft preparation, T.L.; writing—review and editing, T.L.; visualization, Y.W. and L.Z.; supervision, Y.L.; project administration, L.Y. and G.Y.; funding acquisition, K.D. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Keyan Dong.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Liu, T., Dong, K., Song, Y. et al. A novel pansharpening method based on side window filter and new injection gain matrices. Sci Rep 15, 26052 (2025). https://doi.org/10.1038/s41598-025-08929-9

Download citation

Received: 04 March 2025
Accepted: 24 June 2025
Published: 18 July 2025
DOI: https://doi.org/10.1038/s41598-025-08929-9