Taming a Diffusion Model to Revitalize Remote Sensing Image Super-Resolution

Conventional neural network-based approaches for single remote sensing image super-resolution (SRSISR) have made remarkable progress. However, the super-resolution outputs produced by these methods often fall short in terms of visual quality. Recent advances in diffusion models for image generation...

Full description

Saved in:
Bibliographic Details
Main Authors: Chao Zhu, Yong Liu, Shan Huang, Fei Wang
Format: Article
Language:English
Published: MDPI AG 2025-04-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/17/8/1348
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850144080421453824
author Chao Zhu
Yong Liu
Shan Huang
Fei Wang
author_facet Chao Zhu
Yong Liu
Shan Huang
Fei Wang
author_sort Chao Zhu
collection DOAJ
description Conventional neural network-based approaches for single remote sensing image super-resolution (SRSISR) have made remarkable progress. However, the super-resolution outputs produced by these methods often fall short in terms of visual quality. Recent advances in diffusion models for image generation have demonstrated remarkable potential for enhancing the visual content of super-resolved images. Despite this promise, existing large diffusion models are predominantly trained on natural images, which have huge differences in data distribution, making them hard to apply in remote sensing images (RSIs). This disparity poses challenges for directly applying these models to RSIs. Moreover, while diffusion models possess powerful generative capabilities, their output must be carefully controlled to generate accurate details as the objects in RSIs are small and blurry. In this paper, we introduce RSDiffSR, a novel SRSISR method based on a conditional diffusion model. This framework ensures the high-quality super-resolution of RSIs through three key contributions. First, it leverages a large diffusion model as a generative prior, which substantially enhances the visual quality of super-resolved RSIs. Second, it incorporates low-rank adaptation into the diffusion UNet and multi-stage training process to address the domain gap caused by differences in data distributions. Third, an enhanced control mechanism is designed to process the content and edge information of RSIs, providing effective guidance during the diffusion process. Experimental results demonstrate that the proposed RSDiffSR achieves state-of-the-art performance in both quantitative and qualitative evaluations across multiple benchmarks.
format Article
id doaj-art-bc6ff37fbb9646efa7bf5cc60c299cc3
institution OA Journals
issn 2072-4292
language English
publishDate 2025-04-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj-art-bc6ff37fbb9646efa7bf5cc60c299cc32025-08-20T02:28:28ZengMDPI AGRemote Sensing2072-42922025-04-01178134810.3390/rs17081348Taming a Diffusion Model to Revitalize Remote Sensing Image Super-ResolutionChao Zhu0Yong Liu1Shan Huang2Fei Wang3National Key Laboratory of Human-Machine Hybrid Augmented Intelligence, National Engineering Research Center of Visual Information and Applications, Institute of Artificial Intelligence and Robotics, Xi’an Jiaotong University, Xi’an 710049, ChinaNational Key Laboratory of Human-Machine Hybrid Augmented Intelligence, National Engineering Research Center of Visual Information and Applications, Institute of Artificial Intelligence and Robotics, Xi’an Jiaotong University, Xi’an 710049, ChinaThe Beijing Institute of Remote Sensing Information, Beijing 100192, ChinaNational Key Laboratory of Human-Machine Hybrid Augmented Intelligence, National Engineering Research Center of Visual Information and Applications, Institute of Artificial Intelligence and Robotics, Xi’an Jiaotong University, Xi’an 710049, ChinaConventional neural network-based approaches for single remote sensing image super-resolution (SRSISR) have made remarkable progress. However, the super-resolution outputs produced by these methods often fall short in terms of visual quality. Recent advances in diffusion models for image generation have demonstrated remarkable potential for enhancing the visual content of super-resolved images. Despite this promise, existing large diffusion models are predominantly trained on natural images, which have huge differences in data distribution, making them hard to apply in remote sensing images (RSIs). This disparity poses challenges for directly applying these models to RSIs. Moreover, while diffusion models possess powerful generative capabilities, their output must be carefully controlled to generate accurate details as the objects in RSIs are small and blurry. In this paper, we introduce RSDiffSR, a novel SRSISR method based on a conditional diffusion model. This framework ensures the high-quality super-resolution of RSIs through three key contributions. First, it leverages a large diffusion model as a generative prior, which substantially enhances the visual quality of super-resolved RSIs. Second, it incorporates low-rank adaptation into the diffusion UNet and multi-stage training process to address the domain gap caused by differences in data distributions. Third, an enhanced control mechanism is designed to process the content and edge information of RSIs, providing effective guidance during the diffusion process. Experimental results demonstrate that the proposed RSDiffSR achieves state-of-the-art performance in both quantitative and qualitative evaluations across multiple benchmarks.https://www.mdpi.com/2072-4292/17/8/1348diffusionneural networkremote sensing image super-resolution
spellingShingle Chao Zhu
Yong Liu
Shan Huang
Fei Wang
Taming a Diffusion Model to Revitalize Remote Sensing Image Super-Resolution
Remote Sensing
diffusion
neural network
remote sensing image super-resolution
title Taming a Diffusion Model to Revitalize Remote Sensing Image Super-Resolution
title_full Taming a Diffusion Model to Revitalize Remote Sensing Image Super-Resolution
title_fullStr Taming a Diffusion Model to Revitalize Remote Sensing Image Super-Resolution
title_full_unstemmed Taming a Diffusion Model to Revitalize Remote Sensing Image Super-Resolution
title_short Taming a Diffusion Model to Revitalize Remote Sensing Image Super-Resolution
title_sort taming a diffusion model to revitalize remote sensing image super resolution
topic diffusion
neural network
remote sensing image super-resolution
url https://www.mdpi.com/2072-4292/17/8/1348
work_keys_str_mv AT chaozhu tamingadiffusionmodeltorevitalizeremotesensingimagesuperresolution
AT yongliu tamingadiffusionmodeltorevitalizeremotesensingimagesuperresolution
AT shanhuang tamingadiffusionmodeltorevitalizeremotesensingimagesuperresolution
AT feiwang tamingadiffusionmodeltorevitalizeremotesensingimagesuperresolution