QSAR Models for Predicting the Antioxidant Potential of Chemical Substances

Antioxidants are widely studied compounds with significant applications in the nutraceutical and dietary industries. To enable the rapid screening of large libraries of substances for antioxidant activity and to provide a useful tool for the initial evaluation of substances of interest with unknown...

Full description

Saved in:
Bibliographic Details
Main Authors: Sofia Ghironi, Edoardo Luca Viganò, Gianluca Selvestrel, Emilio Benfenati
Format: Article
Language:English
Published: MDPI AG 2025-05-01
Series:Journal of Xenobiotics
Subjects:
Online Access:https://www.mdpi.com/2039-4713/15/3/80
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Antioxidants are widely studied compounds with significant applications in the nutraceutical and dietary industries. To enable the rapid screening of large libraries of substances for antioxidant activity and to provide a useful tool for the initial evaluation of substances of interest with unknown activity, we developed Quantitative Structure–Activity Relationship (QSAR) models to predict the antioxidant potential of chemical substances. We started from a dataset of 1911 antioxidant substances, retrieved from the AODB database by selecting the DPPH (1,1-diphenyl-2-picrylhydrazyl) radical scavenging activity assay and the experimental value of the half-maximal inhibitory concentration. Different machine learning algorithms were applied to build regression models, and the goodness-of-fit of each model was assessed using the statistical parameters of R squared (R<sup>2</sup>), the Root-Mean-Squared Error, and the Mean Absolute Error. The Extra Trees model outperformed the other models in both internal and external validations, achieving the highest R<sup>2</sup> of 0.77 and the lowest RMSE on the test set. Gradient Boosting and eXtreme Gradient Boosting also achieved promising results with R<sup>2</sup> values of 0.76 and 0.75, respectively. Given these results, we developed an integrated method that not only outperformed the individual models, achieving an R<sup>2</sup> of 0.78 on the external test set, but also provided valuable insights into the range of predicted values.
ISSN:2039-4705
2039-4713