DSAT: a dynamic sparse attention transformer for steel surface defect detection with hierarchical feature fusion
Abstract The rapid development of industrialization has led to a significant increase in the demand for steel, making the detection of surface defects in steel a critical challenge in industrial quality control. These defects exhibit diverse morphological characteristics and complex patterns, which...
Saved in:
| Main Authors: | , , , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-08-01
|
| Series: | Scientific Reports |
| Subjects: | |
| Online Access: | https://doi.org/10.1038/s41598-025-14935-8 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Abstract The rapid development of industrialization has led to a significant increase in the demand for steel, making the detection of surface defects in steel a critical challenge in industrial quality control. These defects exhibit diverse morphological characteristics and complex patterns, which pose substantial challenges to traditional detection models, particularly regarding multi-scale feature extraction and information retention across network depths. To address these limitations, we propose the Dynamic Sparse Attention Transformer (DSAT), a novel architecture that integrates two key innovations: (1) a Dynamic Sparse Attention (DSA) mechanism, which adaptively focuses on defect-salient regions while minimizing computational overhead; (2) an enhanced SPPF-GhostConv module, which combines Spatial Pyramid Pooling Fast with Ghost Convolution to achieve efficient hierarchical feature fusion. Extensive experimental evaluations on the NEU-DET and GC10-DE datasets demonstrate the superior performance of our approach. On the NEU-DET dataset, DSAT achieved an accuracy of 92.55%, with a mean Average Precision (mAP) of 83.14% at an IoU of 0.5 and 47.37% at an IoU of 0.5:0.95, significantly outperforming existing methods. Similar improvements were observed on the GC10-DE dataset, where the model attained an accuracy of 79.61% and a mAP of 67.27% at an IoU of 0.5, and a mAP of 34.09% at an IoU of 0.5:0.95. Comprehensive ablation studies validated the respective contributions of the DSA mechanism and the SPPF-GhostConv module, while visualization experiments demonstrated the model’s enhanced capability in detecting fine-grained defects. Overall, the proposed DSAT model exhibits significant advantages and potential in the field of steel surface defect detection. |
|---|---|
| ISSN: | 2045-2322 |