XTNSR: Xception-based transformer network for single image super resolution

Abstract Single image super resolution has significantly advanced by utilizing transformers-based deep learning algorithms. However, challenges still need to be addressed in handling grid-like image patches with higher computational demands and addressing issues like over-smoothing in visual patches...

Full description

Saved in:

Bibliographic Details
Main Authors:	Jagrati Talreja, Supavadee Aramvith, Takao Onoye
Format:	Article
Language:	English
Published:	Springer 2025-01-01
Series:	Complex & Intelligent Systems
Subjects:	Single image super-resolution Local feature window transformer block Multi-layer feature fusion block Xception block
Online Access:	https://doi.org/10.1007/s40747-024-01760-1
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1823861498100842496
author	Jagrati Talreja Supavadee Aramvith Takao Onoye
author_facet	Jagrati Talreja Supavadee Aramvith Takao Onoye
author_sort	Jagrati Talreja
collection	DOAJ
description	Abstract Single image super resolution has significantly advanced by utilizing transformers-based deep learning algorithms. However, challenges still need to be addressed in handling grid-like image patches with higher computational demands and addressing issues like over-smoothing in visual patches. This paper presents a Deep Learning model for single-image super-resolution. In this paper, we present the XTNSR model, a novel multi-path network architecture that combines Local feature window transformers (LWFT) with Xception blocks for single-image super-resolution. The model processes grid-like image patches effectively and reduces computational complexity by integrating a Patch Embedding layer. Whereas the Xception blocks use depth-wise separable convolutions for hierarchical feature extraction, the LWFT blocks capture long-range dependencies and fine-grained qualities. A multi-layer feature fusion block with skip connections, part of this hybrid architecture, guarantees efficient local and global feature fusion. The experimental results show better performance in Peak signal-to-noise ratio (PSNR), structural similarity index measure (SSIM), and visual quality than the state-of-the-art techniques. By optimizing parameters, the suggested architecture also lowers computational complexity. Overall, the architecture presents a promising approach for advancing image super-resolution capabilities.
format	Article
id	doaj-art-bd4a1555b99d4f6ebd200c1d6f318ee9
institution	Kabale University
issn	2199-4536 2198-6053
language	English
publishDate	2025-01-01
publisher	Springer
record_format	Article
series	Complex & Intelligent Systems
spelling	doaj-art-bd4a1555b99d4f6ebd200c1d6f318ee92025-02-09T13:01:16ZengSpringerComplex & Intelligent Systems2199-45362198-60532025-01-0111212510.1007/s40747-024-01760-1XTNSR: Xception-based transformer network for single image super resolutionJagrati Talreja0Supavadee Aramvith1Takao Onoye2Department of Electrical Engineering, Faculty of Engineering, Chulalongkorn UniversityMultimedia Data Analytics and Processing Unit, Department of Electrical Engineering, Faculty of Engineering, Chulalongkorn UniversityGraduate School of Information Science and Technology, Osaka UniversityAbstract Single image super resolution has significantly advanced by utilizing transformers-based deep learning algorithms. However, challenges still need to be addressed in handling grid-like image patches with higher computational demands and addressing issues like over-smoothing in visual patches. This paper presents a Deep Learning model for single-image super-resolution. In this paper, we present the XTNSR model, a novel multi-path network architecture that combines Local feature window transformers (LWFT) with Xception blocks for single-image super-resolution. The model processes grid-like image patches effectively and reduces computational complexity by integrating a Patch Embedding layer. Whereas the Xception blocks use depth-wise separable convolutions for hierarchical feature extraction, the LWFT blocks capture long-range dependencies and fine-grained qualities. A multi-layer feature fusion block with skip connections, part of this hybrid architecture, guarantees efficient local and global feature fusion. The experimental results show better performance in Peak signal-to-noise ratio (PSNR), structural similarity index measure (SSIM), and visual quality than the state-of-the-art techniques. By optimizing parameters, the suggested architecture also lowers computational complexity. Overall, the architecture presents a promising approach for advancing image super-resolution capabilities.https://doi.org/10.1007/s40747-024-01760-1Single image super-resolutionLocal feature window transformer blockMulti-layer feature fusion blockXception block
spellingShingle	Jagrati Talreja Supavadee Aramvith Takao Onoye XTNSR: Xception-based transformer network for single image super resolution Complex & Intelligent Systems Single image super-resolution Local feature window transformer block Multi-layer feature fusion block Xception block
title	XTNSR: Xception-based transformer network for single image super resolution
title_full	XTNSR: Xception-based transformer network for single image super resolution
title_fullStr	XTNSR: Xception-based transformer network for single image super resolution
title_full_unstemmed	XTNSR: Xception-based transformer network for single image super resolution
title_short	XTNSR: Xception-based transformer network for single image super resolution
title_sort	xtnsr xception based transformer network for single image super resolution
topic	Single image super-resolution Local feature window transformer block Multi-layer feature fusion block Xception block
url	https://doi.org/10.1007/s40747-024-01760-1
work_keys_str_mv	AT jagratitalreja xtnsrxceptionbasedtransformernetworkforsingleimagesuperresolution AT supavadeearamvith xtnsrxceptionbasedtransformernetworkforsingleimagesuperresolution AT takaoonoye xtnsrxceptionbasedtransformernetworkforsingleimagesuperresolution

XTNSR: Xception-based transformer network for single image super resolution

Similar Items