Predicting Software Perfection Through Advanced Models to Uncover and Prevent Defects

Software defect prediction is a critical task in software engineering, enabling organizations to proactively identify and address potential issues in software systems, thereby improving quality and reducing costs. In this study, we evaluated and compared various machine learning models, including lo...

Full description

Saved in:
Bibliographic Details
Main Authors: Tariq Shahzad, Sunawar Khan, Tehseen Mazhar, Wasim Ahmad, Khmaies Ouahada, Habib Hamam
Format: Article
Language:English
Published: Wiley 2025-01-01
Series:IET Software
Online Access:http://dx.doi.org/10.1049/sfw2/8832164
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Software defect prediction is a critical task in software engineering, enabling organizations to proactively identify and address potential issues in software systems, thereby improving quality and reducing costs. In this study, we evaluated and compared various machine learning models, including logistic regression (LR), random forest (RF), support vector machines (SVMs), convolutional neural networks (CNNs), and eXtreme Gradient Boosting (XGBoost), for software defect prediction using a combination of diverse datasets. The models were trained and tested on preprocessed and feature-selected data, followed by optimization through hyperparameter tuning. Performance evaluation metrics were employed to analyze the results comprehensively, including classification reports, confusion matrices, receiver operating characteristic–area under the curve (ROC-AUC) curves, precision–recall curves, and cumulative gain charts. The results revealed that XGBoost consistently outperformed other models, achieving the highest accuracy, precision, recall, and AUC scores across all metrics. This indicates its robustness and suitability for predicting software defects in real-world applications.
ISSN:1751-8814