Explainable AutoML models for predicting the strength of high-performance concrete using Optuna, SHAP and ensemble learning

Accurately predicting key engineering properties, such as compressive and tensile strength, remains a significant challenge in high-performance concrete (HPC) due to its complex and heterogeneous composition. Early selection of optimal components and the development of reliable machine learning (ML)...

Full description

Saved in:
Bibliographic Details
Main Authors: Muhammad Salman Khan, Tianbo Peng, Muhammad Adeel Khan, Asad Khan, Mahmood Ahmad, Kamran Aziz, Mohanad Muayad Sabri Sabri, N. S. Abd EL-Gawaad
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-01-01
Series:Frontiers in Materials
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fmats.2025.1542655/full
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Accurately predicting key engineering properties, such as compressive and tensile strength, remains a significant challenge in high-performance concrete (HPC) due to its complex and heterogeneous composition. Early selection of optimal components and the development of reliable machine learning (ML) models can significantly reduce the time and cost associated with extensive experimentation. This study introduces four explainable Automated Machine Learning (AutoML) models that integrate Optuna for hyperparameter optimization, SHapley Additive exPlanations (SHAP) for interpretability, and ensemble learning algorithms such as Random Forest (RF), Extreme Gradient Boosting (XGB), Light Gradient Boosting Machine (LGB), and Categorical Gradient Boosting (CB). The resulting interpretable AutoML models O-RF, O-XGB, O-LGB, and O-CB are applied to predict the compressive and tensile strengths of HPC. Compared to a baseline model from the literature, O-LGB achieved significant improvements in predictive performance. For compressive strength, it reduced the Mean Absolute Error (MAE) by 87.69% and the Root Mean Squared Error (RMSE) by 71.93%. For tensile strength, it achieved a 99.41% improvement in MAE and a 96.67% reduction in RMSE, along with increases in R2. Furthermore, SHAP analysis identified critical factors influencing strength, such as cement content, water, and age for compressive strength, and curing age, water-binder ratio, and water-cement ratio for tensile strength. This approach provides civil engineers with a robust and interpretable tool for optimizing HPC properties, reducing experimentation costs, and supporting enhanced decision-making in structural design, risk assessment, and other applications.
ISSN:2296-8016