Remote sensing image Super-resolution reconstruction by fusing multi-scale receptive fields and hybrid transformer

Abstract To enhance high-frequency perceptual information and texture details in remote sensing images and address the challenges of super-resolution reconstruction algorithms during training, particularly the issue of missing details, this paper proposes an improved remote sensing image super-resol...

Full description

Saved in:
Bibliographic Details
Main Authors: Denghui Liu, Lin Zhong, Haiyang Wu, Songyang Li, Yida Li
Format: Article
Language:English
Published: Nature Portfolio 2025-01-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-86446-5
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832594755491463168
author Denghui Liu
Lin Zhong
Haiyang Wu
Songyang Li
Yida Li
author_facet Denghui Liu
Lin Zhong
Haiyang Wu
Songyang Li
Yida Li
author_sort Denghui Liu
collection DOAJ
description Abstract To enhance high-frequency perceptual information and texture details in remote sensing images and address the challenges of super-resolution reconstruction algorithms during training, particularly the issue of missing details, this paper proposes an improved remote sensing image super-resolution reconstruction model. The generator network of the model employs multi-scale convolutional kernels to extract image features and utilizes a multi-head self-attention mechanism to dynamically fuse these features, significantly improving the ability to capture both fine details and global information in remote sensing images. Additionally, the model introduces a multi-stage Hybrid Transformer structure, which processes features at different resolutions progressively, from low resolution to high resolution, substantially enhancing reconstruction quality and detail recovery. The discriminator combines multi-scale convolution, global Transformer, and hierarchical feature discriminators, providing a comprehensive and refined evaluation of image quality. Finally, the model incorporates a Charbonnier loss function and total variation (TV) loss function, which significantly improve training stability and accelerate convergence. Experimental results demonstrate that the proposed method, compared to the SRGAN algorithm, achieves average improvements of approximately 3.61 dB in Peak Signal-to-Noise Ratio (PSNR), 0.070 (8.2%) in Structural Similarity Index (SSIM), and 0.030 (3.1%) in Feature Similarity Index (FSIM) across multiple datasets, showing significant performance gains.
format Article
id doaj-art-45f377b851b54b59a569cc2e4bdcc2e7
institution Kabale University
issn 2045-2322
language English
publishDate 2025-01-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-45f377b851b54b59a569cc2e4bdcc2e72025-01-19T12:21:11ZengNature PortfolioScientific Reports2045-23222025-01-0115111410.1038/s41598-025-86446-5Remote sensing image Super-resolution reconstruction by fusing multi-scale receptive fields and hybrid transformerDenghui Liu0Lin Zhong1Haiyang Wu2Songyang Li3Yida Li4School of Electronics and Information, Xijing UniversitySchool of Electronics and Information, Xijing UniversitySchool of Electronics and Information, Xijing UniversitySchool of Electronics and Information, Xijing UniversitySchool of Electronics and Information, Xijing UniversityAbstract To enhance high-frequency perceptual information and texture details in remote sensing images and address the challenges of super-resolution reconstruction algorithms during training, particularly the issue of missing details, this paper proposes an improved remote sensing image super-resolution reconstruction model. The generator network of the model employs multi-scale convolutional kernels to extract image features and utilizes a multi-head self-attention mechanism to dynamically fuse these features, significantly improving the ability to capture both fine details and global information in remote sensing images. Additionally, the model introduces a multi-stage Hybrid Transformer structure, which processes features at different resolutions progressively, from low resolution to high resolution, substantially enhancing reconstruction quality and detail recovery. The discriminator combines multi-scale convolution, global Transformer, and hierarchical feature discriminators, providing a comprehensive and refined evaluation of image quality. Finally, the model incorporates a Charbonnier loss function and total variation (TV) loss function, which significantly improve training stability and accelerate convergence. Experimental results demonstrate that the proposed method, compared to the SRGAN algorithm, achieves average improvements of approximately 3.61 dB in Peak Signal-to-Noise Ratio (PSNR), 0.070 (8.2%) in Structural Similarity Index (SSIM), and 0.030 (3.1%) in Feature Similarity Index (FSIM) across multiple datasets, showing significant performance gains.https://doi.org/10.1038/s41598-025-86446-5Remote sensing imageImage Super-resolutionGANAttention mechanismHybrid transformerMulti-scale feature extraction
spellingShingle Denghui Liu
Lin Zhong
Haiyang Wu
Songyang Li
Yida Li
Remote sensing image Super-resolution reconstruction by fusing multi-scale receptive fields and hybrid transformer
Scientific Reports
Remote sensing image
Image Super-resolution
GAN
Attention mechanism
Hybrid transformer
Multi-scale feature extraction
title Remote sensing image Super-resolution reconstruction by fusing multi-scale receptive fields and hybrid transformer
title_full Remote sensing image Super-resolution reconstruction by fusing multi-scale receptive fields and hybrid transformer
title_fullStr Remote sensing image Super-resolution reconstruction by fusing multi-scale receptive fields and hybrid transformer
title_full_unstemmed Remote sensing image Super-resolution reconstruction by fusing multi-scale receptive fields and hybrid transformer
title_short Remote sensing image Super-resolution reconstruction by fusing multi-scale receptive fields and hybrid transformer
title_sort remote sensing image super resolution reconstruction by fusing multi scale receptive fields and hybrid transformer
topic Remote sensing image
Image Super-resolution
GAN
Attention mechanism
Hybrid transformer
Multi-scale feature extraction
url https://doi.org/10.1038/s41598-025-86446-5
work_keys_str_mv AT denghuiliu remotesensingimagesuperresolutionreconstructionbyfusingmultiscalereceptivefieldsandhybridtransformer
AT linzhong remotesensingimagesuperresolutionreconstructionbyfusingmultiscalereceptivefieldsandhybridtransformer
AT haiyangwu remotesensingimagesuperresolutionreconstructionbyfusingmultiscalereceptivefieldsandhybridtransformer
AT songyangli remotesensingimagesuperresolutionreconstructionbyfusingmultiscalereceptivefieldsandhybridtransformer
AT yidali remotesensingimagesuperresolutionreconstructionbyfusingmultiscalereceptivefieldsandhybridtransformer