A dual encoder network with multiscale feature fusion and multiple pooling channel spatial attention for skin scar image segmentation

Abstract Skin scar is a prevalent dermatological concern that impacts both aesthetic appearance and psychological well-being, making precise delineation of scar tissue essential for clinical treatment. To address the challenge of scar image segmentation, this study introduces an innovative deep lear...

Full description

Saved in:
Bibliographic Details
Main Authors: Weiyuan Yang, Xiaolin Wang, Guangwei Chen, Jianming Wen, Dexing Kong, Jianfeng Zhang, Xinyang Ge, Hao Xu, Jianhua Qin
Format: Article
Language:English
Published: Nature Portfolio 2025-07-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-05239-y
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Skin scar is a prevalent dermatological concern that impacts both aesthetic appearance and psychological well-being, making precise delineation of scar tissue essential for clinical treatment. To address the challenge of scar image segmentation, this study introduces an innovative deep learning framework integrating CNN and Swin Transformer architectures. The proposed model leverages a multi-scale feature fusion module to combine hierarchical representations from both backbones, while a novel multi-pooling channel-spatial attention mechanism enhances feature refinement during skip connections. Comprehensive experiments demonstrate the model’s superior performance in scar segmentation, achieving metrics of 96.01% Accuracy, 77.43% Precision, 90.17% Recall, 71.38% Jaccard Index, and 83.21% Dice Coefficient, which compare favorably with mainstream methods, and our model performs well in all metrics, highlighting its potential for clinical adoption in scar analysis.
ISSN:2045-2322