Enhance the Concrete Crack Classification Based on a Novel Multi-Stage YOLOV10-ViT Framework

Early identification of concrete cracks and multi-class detection can help to avoid future deformation or collapse in concrete structures. Available traditional detection and methodologies require enormous effort and time. To overcome such difficulties, current vision-based deep learning models can...

Full description

Saved in:
Bibliographic Details
Main Authors: Ali Mahmoud Mayya, Nizar Faisal Alkayem
Format: Article
Language:English
Published: MDPI AG 2024-12-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/24/24/8095
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850037041166811136
author Ali Mahmoud Mayya
Nizar Faisal Alkayem
author_facet Ali Mahmoud Mayya
Nizar Faisal Alkayem
author_sort Ali Mahmoud Mayya
collection DOAJ
description Early identification of concrete cracks and multi-class detection can help to avoid future deformation or collapse in concrete structures. Available traditional detection and methodologies require enormous effort and time. To overcome such difficulties, current vision-based deep learning models can effectively detect and classify various concrete cracks. This study introduces a novel multi-stage deep learning framework for crack detection and type classification. First, the recently developed YOLOV10 model is trained to detect possible defective regions in concrete images. After that, a modified vision transformer (ViT) model is trained to classify concrete images into three main types: normal, simple cracks, and multi-branched cracks. The evaluation process includes feeding concrete test images into the trained YOLOV10 model, identifying the possible defect regions, and finally delivering the detected regions into the trained ViT model, which decides the appropriate crack type of those detected regions. Experiments are conducted using the individual ViT model and the proposed multi-stage framework. To improve the generation ability, multi-source datasets of concrete structures are used. For the classification part, a concrete crack dataset consisting of 12,000 images of three classes is utilized, while for the detection part, a dataset composed of various materials from historical buildings containing 1116 concrete images with their corresponding bounding boxes, is utilized. Results prove that the proposed multi-stage model accurately classifies crack types with 90.67% precision, 90.03% recall, and 90.34% F1-score. The results also show that the proposed model outperforms the individual classification model by 10.9%, 19.99%, and 19.2% for precision, recall, and F1-score, respectively. The proposed multi-stage YOLOV10-ViT model can be integrated into the construction systems which are based on crack materials to obtain early warning of possible future deformation in concrete structures.
format Article
id doaj-art-912bed01a564401eabcdca67c4a7f814
institution DOAJ
issn 1424-8220
language English
publishDate 2024-12-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj-art-912bed01a564401eabcdca67c4a7f8142025-08-20T02:56:58ZengMDPI AGSensors1424-82202024-12-012424809510.3390/s24248095Enhance the Concrete Crack Classification Based on a Novel Multi-Stage YOLOV10-ViT FrameworkAli Mahmoud Mayya0Nizar Faisal Alkayem1Computer and Automatic Control Engineering Department, Faculty of Mechanical and Electrical Engineering, Tishreen University, Lattakia 2230, SyriaCollege of Automation and College of Artificial Intelligence, Nanjing University of Posts and Telecommunications, Nanjing 210046, ChinaEarly identification of concrete cracks and multi-class detection can help to avoid future deformation or collapse in concrete structures. Available traditional detection and methodologies require enormous effort and time. To overcome such difficulties, current vision-based deep learning models can effectively detect and classify various concrete cracks. This study introduces a novel multi-stage deep learning framework for crack detection and type classification. First, the recently developed YOLOV10 model is trained to detect possible defective regions in concrete images. After that, a modified vision transformer (ViT) model is trained to classify concrete images into three main types: normal, simple cracks, and multi-branched cracks. The evaluation process includes feeding concrete test images into the trained YOLOV10 model, identifying the possible defect regions, and finally delivering the detected regions into the trained ViT model, which decides the appropriate crack type of those detected regions. Experiments are conducted using the individual ViT model and the proposed multi-stage framework. To improve the generation ability, multi-source datasets of concrete structures are used. For the classification part, a concrete crack dataset consisting of 12,000 images of three classes is utilized, while for the detection part, a dataset composed of various materials from historical buildings containing 1116 concrete images with their corresponding bounding boxes, is utilized. Results prove that the proposed multi-stage model accurately classifies crack types with 90.67% precision, 90.03% recall, and 90.34% F1-score. The results also show that the proposed model outperforms the individual classification model by 10.9%, 19.99%, and 19.2% for precision, recall, and F1-score, respectively. The proposed multi-stage YOLOV10-ViT model can be integrated into the construction systems which are based on crack materials to obtain early warning of possible future deformation in concrete structures.https://www.mdpi.com/1424-8220/24/24/8095crack detectioncrack classificationdeep learningYOLOV10vision transformermulti-stage model
spellingShingle Ali Mahmoud Mayya
Nizar Faisal Alkayem
Enhance the Concrete Crack Classification Based on a Novel Multi-Stage YOLOV10-ViT Framework
Sensors
crack detection
crack classification
deep learning
YOLOV10
vision transformer
multi-stage model
title Enhance the Concrete Crack Classification Based on a Novel Multi-Stage YOLOV10-ViT Framework
title_full Enhance the Concrete Crack Classification Based on a Novel Multi-Stage YOLOV10-ViT Framework
title_fullStr Enhance the Concrete Crack Classification Based on a Novel Multi-Stage YOLOV10-ViT Framework
title_full_unstemmed Enhance the Concrete Crack Classification Based on a Novel Multi-Stage YOLOV10-ViT Framework
title_short Enhance the Concrete Crack Classification Based on a Novel Multi-Stage YOLOV10-ViT Framework
title_sort enhance the concrete crack classification based on a novel multi stage yolov10 vit framework
topic crack detection
crack classification
deep learning
YOLOV10
vision transformer
multi-stage model
url https://www.mdpi.com/1424-8220/24/24/8095
work_keys_str_mv AT alimahmoudmayya enhancetheconcretecrackclassificationbasedonanovelmultistageyolov10vitframework
AT nizarfaisalalkayem enhancetheconcretecrackclassificationbasedonanovelmultistageyolov10vitframework