A Lightweight Single-Image Super-Resolution Method Based on the Parallel Connection of Convolution and Swin Transformer Blocks

In recent years, with the development of deep learning technologies, Vision Transformers combined with Convolutional Neural Networks (CNNs) have made significant progress in the field of single-image super-resolution (SISR). However, existing methods still face issues such as incomplete high-frequen...

Full description

Saved in:
Bibliographic Details
Main Authors: Tengyun Jing, Cuiyin Liu, Yuanshuai Chen
Format: Article
Language:English
Published: MDPI AG 2025-02-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/15/4/1806
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850238873590824960
author Tengyun Jing
Cuiyin Liu
Yuanshuai Chen
author_facet Tengyun Jing
Cuiyin Liu
Yuanshuai Chen
author_sort Tengyun Jing
collection DOAJ
description In recent years, with the development of deep learning technologies, Vision Transformers combined with Convolutional Neural Networks (CNNs) have made significant progress in the field of single-image super-resolution (SISR). However, existing methods still face issues such as incomplete high-frequency information reconstruction, training instability caused by residual connections, and insufficient cross-window information exchange. To address these problems and better leverage both local and global information, this paper proposes a super-resolution reconstruction network based on the Parallel Connection of Convolution and Swin Transformer Block (PCCSTB) to model the local and global features of an image. Specifically, through a parallel structure of channel feature-enhanced convolution and Swin Transformer, the network extracts, enhances, and fuses the local and global information. Additionally, this paper designs a fusion module to integrate the global and local information extracted by CNNs. The experimental results show that the proposed network effectively balances SR performance and network complexity, achieving good results in the lightweight SR domain. For instance, in the 4× super-resolution experiment on the Urban100 dataset, the network achieves an inference speed of 55 frames per second under the same device conditions, which is more than seven times as fast as the state-of-the-art network Shifted Window-based Image Restoration (SwinIR). Moreover, the network’s Peak Signal-to-Noise Ratio (PSNR) outperforms SwinIR by 0.29 dB at a 4× scale on the Set5 dataset, indicating that the network efficiently performs high-resolution image reconstruction.
format Article
id doaj-art-bbb149637f3d4bbc8217b0fae65ce4e4
institution OA Journals
issn 2076-3417
language English
publishDate 2025-02-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj-art-bbb149637f3d4bbc8217b0fae65ce4e42025-08-20T02:01:20ZengMDPI AGApplied Sciences2076-34172025-02-01154180610.3390/app15041806A Lightweight Single-Image Super-Resolution Method Based on the Parallel Connection of Convolution and Swin Transformer BlocksTengyun Jing0Cuiyin Liu1Yuanshuai Chen2Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650000, ChinaFaculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650000, ChinaFaculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650000, ChinaIn recent years, with the development of deep learning technologies, Vision Transformers combined with Convolutional Neural Networks (CNNs) have made significant progress in the field of single-image super-resolution (SISR). However, existing methods still face issues such as incomplete high-frequency information reconstruction, training instability caused by residual connections, and insufficient cross-window information exchange. To address these problems and better leverage both local and global information, this paper proposes a super-resolution reconstruction network based on the Parallel Connection of Convolution and Swin Transformer Block (PCCSTB) to model the local and global features of an image. Specifically, through a parallel structure of channel feature-enhanced convolution and Swin Transformer, the network extracts, enhances, and fuses the local and global information. Additionally, this paper designs a fusion module to integrate the global and local information extracted by CNNs. The experimental results show that the proposed network effectively balances SR performance and network complexity, achieving good results in the lightweight SR domain. For instance, in the 4× super-resolution experiment on the Urban100 dataset, the network achieves an inference speed of 55 frames per second under the same device conditions, which is more than seven times as fast as the state-of-the-art network Shifted Window-based Image Restoration (SwinIR). Moreover, the network’s Peak Signal-to-Noise Ratio (PSNR) outperforms SwinIR by 0.29 dB at a 4× scale on the Set5 dataset, indicating that the network efficiently performs high-resolution image reconstruction.https://www.mdpi.com/2076-3417/15/4/1806image super-resolutionCNNfeature extractfeature fusionSwin Transformer
spellingShingle Tengyun Jing
Cuiyin Liu
Yuanshuai Chen
A Lightweight Single-Image Super-Resolution Method Based on the Parallel Connection of Convolution and Swin Transformer Blocks
Applied Sciences
image super-resolution
CNN
feature extract
feature fusion
Swin Transformer
title A Lightweight Single-Image Super-Resolution Method Based on the Parallel Connection of Convolution and Swin Transformer Blocks
title_full A Lightweight Single-Image Super-Resolution Method Based on the Parallel Connection of Convolution and Swin Transformer Blocks
title_fullStr A Lightweight Single-Image Super-Resolution Method Based on the Parallel Connection of Convolution and Swin Transformer Blocks
title_full_unstemmed A Lightweight Single-Image Super-Resolution Method Based on the Parallel Connection of Convolution and Swin Transformer Blocks
title_short A Lightweight Single-Image Super-Resolution Method Based on the Parallel Connection of Convolution and Swin Transformer Blocks
title_sort lightweight single image super resolution method based on the parallel connection of convolution and swin transformer blocks
topic image super-resolution
CNN
feature extract
feature fusion
Swin Transformer
url https://www.mdpi.com/2076-3417/15/4/1806
work_keys_str_mv AT tengyunjing alightweightsingleimagesuperresolutionmethodbasedontheparallelconnectionofconvolutionandswintransformerblocks
AT cuiyinliu alightweightsingleimagesuperresolutionmethodbasedontheparallelconnectionofconvolutionandswintransformerblocks
AT yuanshuaichen alightweightsingleimagesuperresolutionmethodbasedontheparallelconnectionofconvolutionandswintransformerblocks
AT tengyunjing lightweightsingleimagesuperresolutionmethodbasedontheparallelconnectionofconvolutionandswintransformerblocks
AT cuiyinliu lightweightsingleimagesuperresolutionmethodbasedontheparallelconnectionofconvolutionandswintransformerblocks
AT yuanshuaichen lightweightsingleimagesuperresolutionmethodbasedontheparallelconnectionofconvolutionandswintransformerblocks