An explainable deep learning model for diabetic foot ulcer classification using swin transformer and efficient multi-scale attention-driven network

Abstract Diabetic Foot Ulcer (DFU) is a severe complication of diabetes mellitus, resulting in significant health and socio-economic challenges for the diagnosed individual. Severe cases of DFU can lead to lower limb amputation in diabetic patients, making their diagnosis a complex and costly proces...

Full description

Saved in:
Bibliographic Details
Main Authors: R. Karthik, Armaano Ajay, Anshika Jhalani, Kruthik Ballari, Suganthi K
Format: Article
Language:English
Published: Nature Portfolio 2025-02-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-87519-1
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1823862494986240000
author R. Karthik
Armaano Ajay
Anshika Jhalani
Kruthik Ballari
Suganthi K
author_facet R. Karthik
Armaano Ajay
Anshika Jhalani
Kruthik Ballari
Suganthi K
author_sort R. Karthik
collection DOAJ
description Abstract Diabetic Foot Ulcer (DFU) is a severe complication of diabetes mellitus, resulting in significant health and socio-economic challenges for the diagnosed individual. Severe cases of DFU can lead to lower limb amputation in diabetic patients, making their diagnosis a complex and costly process that poses challenges for medical professionals. Manual identification of DFU is particularly difficult due to their diverse visual characteristics, leading to multiple cases going undiagnosed. To address this challenge, Deep Learning (DL) methods offer an efficient and automated approach to facilitate timely treatment and improve patient outcomes. This research proposes a novel feature fusion-based model that incorporates two parallel tracks for efficient feature extraction. The first track utilizes the Swin transformer, which captures long-range dependencies by employing shifted windows and self-attention mechanisms. The second track involves the Efficient Multi-Scale Attention-Driven Network (EMADN), which leverages Light-weight Multi-scale Deformable Shuffle (LMDS) and Global Dilated Attention (GDA) blocks to extract local features efficiently. These blocks dynamically adjust kernel sizes and leverage attention modules, enabling effective feature extraction. To the best of our knowledge, this is the first work reporting the findings of a dual track architecture for DFU classification, leveraging Swin transformer and EMADN networks. The obtained feature maps from both the networks are concatenated and subjected to shuffle attention for feature refinement at a reduced computational cost. The proposed work also incorporates Grad-CAM-based Explainable Artificial Intelligence (XAI) to visualize and interpret the decision making of the network. The proposed model demonstrated better performance on the DFUC-2021 dataset, surpassing existing works and pre-trained CNN architectures with an accuracy of 78.79% and a macro F1-score of 80%.
format Article
id doaj-art-89368a32fa6c42bdba9e283622790664
institution Kabale University
issn 2045-2322
language English
publishDate 2025-02-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-89368a32fa6c42bdba9e2836227906642025-02-09T12:32:04ZengNature PortfolioScientific Reports2045-23222025-02-0115112010.1038/s41598-025-87519-1An explainable deep learning model for diabetic foot ulcer classification using swin transformer and efficient multi-scale attention-driven networkR. Karthik0Armaano Ajay1Anshika Jhalani2Kruthik Ballari3Suganthi K4Centre for Cyber Physical Systems, Vellore Institute of TechnologySchool of Computer Science and Engineering, Vellore Institute of TechnologySchool of Electronics and Engineering, Vellore Institute of TechnologySchool of Computer Science and Engineering, Vellore Institute of TechnologySchool of Electronics and Engineering, Vellore Institute of TechnologyAbstract Diabetic Foot Ulcer (DFU) is a severe complication of diabetes mellitus, resulting in significant health and socio-economic challenges for the diagnosed individual. Severe cases of DFU can lead to lower limb amputation in diabetic patients, making their diagnosis a complex and costly process that poses challenges for medical professionals. Manual identification of DFU is particularly difficult due to their diverse visual characteristics, leading to multiple cases going undiagnosed. To address this challenge, Deep Learning (DL) methods offer an efficient and automated approach to facilitate timely treatment and improve patient outcomes. This research proposes a novel feature fusion-based model that incorporates two parallel tracks for efficient feature extraction. The first track utilizes the Swin transformer, which captures long-range dependencies by employing shifted windows and self-attention mechanisms. The second track involves the Efficient Multi-Scale Attention-Driven Network (EMADN), which leverages Light-weight Multi-scale Deformable Shuffle (LMDS) and Global Dilated Attention (GDA) blocks to extract local features efficiently. These blocks dynamically adjust kernel sizes and leverage attention modules, enabling effective feature extraction. To the best of our knowledge, this is the first work reporting the findings of a dual track architecture for DFU classification, leveraging Swin transformer and EMADN networks. The obtained feature maps from both the networks are concatenated and subjected to shuffle attention for feature refinement at a reduced computational cost. The proposed work also incorporates Grad-CAM-based Explainable Artificial Intelligence (XAI) to visualize and interpret the decision making of the network. The proposed model demonstrated better performance on the DFUC-2021 dataset, surpassing existing works and pre-trained CNN architectures with an accuracy of 78.79% and a macro F1-score of 80%.https://doi.org/10.1038/s41598-025-87519-1Swin transformerDiabetic foot ulcerCNNDeep learningShuffle attention
spellingShingle R. Karthik
Armaano Ajay
Anshika Jhalani
Kruthik Ballari
Suganthi K
An explainable deep learning model for diabetic foot ulcer classification using swin transformer and efficient multi-scale attention-driven network
Scientific Reports
Swin transformer
Diabetic foot ulcer
CNN
Deep learning
Shuffle attention
title An explainable deep learning model for diabetic foot ulcer classification using swin transformer and efficient multi-scale attention-driven network
title_full An explainable deep learning model for diabetic foot ulcer classification using swin transformer and efficient multi-scale attention-driven network
title_fullStr An explainable deep learning model for diabetic foot ulcer classification using swin transformer and efficient multi-scale attention-driven network
title_full_unstemmed An explainable deep learning model for diabetic foot ulcer classification using swin transformer and efficient multi-scale attention-driven network
title_short An explainable deep learning model for diabetic foot ulcer classification using swin transformer and efficient multi-scale attention-driven network
title_sort explainable deep learning model for diabetic foot ulcer classification using swin transformer and efficient multi scale attention driven network
topic Swin transformer
Diabetic foot ulcer
CNN
Deep learning
Shuffle attention
url https://doi.org/10.1038/s41598-025-87519-1
work_keys_str_mv AT rkarthik anexplainabledeeplearningmodelfordiabeticfootulcerclassificationusingswintransformerandefficientmultiscaleattentiondrivennetwork
AT armaanoajay anexplainabledeeplearningmodelfordiabeticfootulcerclassificationusingswintransformerandefficientmultiscaleattentiondrivennetwork
AT anshikajhalani anexplainabledeeplearningmodelfordiabeticfootulcerclassificationusingswintransformerandefficientmultiscaleattentiondrivennetwork
AT kruthikballari anexplainabledeeplearningmodelfordiabeticfootulcerclassificationusingswintransformerandefficientmultiscaleattentiondrivennetwork
AT suganthik anexplainabledeeplearningmodelfordiabeticfootulcerclassificationusingswintransformerandefficientmultiscaleattentiondrivennetwork
AT rkarthik explainabledeeplearningmodelfordiabeticfootulcerclassificationusingswintransformerandefficientmultiscaleattentiondrivennetwork
AT armaanoajay explainabledeeplearningmodelfordiabeticfootulcerclassificationusingswintransformerandefficientmultiscaleattentiondrivennetwork
AT anshikajhalani explainabledeeplearningmodelfordiabeticfootulcerclassificationusingswintransformerandefficientmultiscaleattentiondrivennetwork
AT kruthikballari explainabledeeplearningmodelfordiabeticfootulcerclassificationusingswintransformerandefficientmultiscaleattentiondrivennetwork
AT suganthik explainabledeeplearningmodelfordiabeticfootulcerclassificationusingswintransformerandefficientmultiscaleattentiondrivennetwork