Machine learning analysis of rivaroxaban solubility in mixed solvents for application in pharmaceutical crystallization
Abstract This study investigates the use of machine learning models to predict solubility of rivaroxaban in binary solvents based on temperature (T), mass fraction (w), and solvent type. Using a dataset with over 250 data points and including solvents encoded with one-hot encoding, four models were...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Portfolio
2025-01-01
|
Series: | Scientific Reports |
Subjects: | |
Online Access: | https://doi.org/10.1038/s41598-024-84741-1 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832594727194591232 |
---|---|
author | Mohammed Alqarni Ali Alqarni |
author_facet | Mohammed Alqarni Ali Alqarni |
author_sort | Mohammed Alqarni |
collection | DOAJ |
description | Abstract This study investigates the use of machine learning models to predict solubility of rivaroxaban in binary solvents based on temperature (T), mass fraction (w), and solvent type. Using a dataset with over 250 data points and including solvents encoded with one-hot encoding, four models were compared: Gradient Boosting (GB), Light Gradient Boosting (LGB), Extra Trees (ET), and Random Forest (RF). The Jellyfish Optimizer (JO) algorithm was applied to tune hyperparameters, enhancing model performance. The LGB model achieved the best results, with an R2 of 0.988 on the test set and low error rates (RMSE of 9.1284E-05 and MAE of 5.85322E-05), surpassing other models in predictive accuracy and generalizability. Parity plots confirmed the LGB model’s close alignment between predicted and actual solubility values, highlighting its robust performance. Furthermore, 3D surface plots and partial effect plots demonstrated LGB’s capacity to model solubility across different solvent systems, capturing complex interactions between T, w, and solvent effects. Finally, the LGB model predicted maximum solubility at a temperature of 305.76 K and a mass fraction of 0.753 in a dichloromethane + methanol mixture, providing valuable insights for solubility optimization in solvent selection. This work underscores the effectiveness of the LGB model for solubility prediction, with potential applications in formulation and experimental planning. |
format | Article |
id | doaj-art-e52899eeadcf4ef3b619e46afd8666e4 |
institution | Kabale University |
issn | 2045-2322 |
language | English |
publishDate | 2025-01-01 |
publisher | Nature Portfolio |
record_format | Article |
series | Scientific Reports |
spelling | doaj-art-e52899eeadcf4ef3b619e46afd8666e42025-01-19T12:24:35ZengNature PortfolioScientific Reports2045-23222025-01-0115111210.1038/s41598-024-84741-1Machine learning analysis of rivaroxaban solubility in mixed solvents for application in pharmaceutical crystallizationMohammed Alqarni0Ali Alqarni1Department of Pharmaceutical Chemistry, College of Pharmacy, Taif UniversityDepartment of Oral & Maxillofacial Surgery and Diagnostic Sciences, Faculty of Dentistry, Taif UniversityAbstract This study investigates the use of machine learning models to predict solubility of rivaroxaban in binary solvents based on temperature (T), mass fraction (w), and solvent type. Using a dataset with over 250 data points and including solvents encoded with one-hot encoding, four models were compared: Gradient Boosting (GB), Light Gradient Boosting (LGB), Extra Trees (ET), and Random Forest (RF). The Jellyfish Optimizer (JO) algorithm was applied to tune hyperparameters, enhancing model performance. The LGB model achieved the best results, with an R2 of 0.988 on the test set and low error rates (RMSE of 9.1284E-05 and MAE of 5.85322E-05), surpassing other models in predictive accuracy and generalizability. Parity plots confirmed the LGB model’s close alignment between predicted and actual solubility values, highlighting its robust performance. Furthermore, 3D surface plots and partial effect plots demonstrated LGB’s capacity to model solubility across different solvent systems, capturing complex interactions between T, w, and solvent effects. Finally, the LGB model predicted maximum solubility at a temperature of 305.76 K and a mass fraction of 0.753 in a dichloromethane + methanol mixture, providing valuable insights for solubility optimization in solvent selection. This work underscores the effectiveness of the LGB model for solubility prediction, with potential applications in formulation and experimental planning.https://doi.org/10.1038/s41598-024-84741-1Machine learningDrug solubilityCrystallizationRivaroxaban |
spellingShingle | Mohammed Alqarni Ali Alqarni Machine learning analysis of rivaroxaban solubility in mixed solvents for application in pharmaceutical crystallization Scientific Reports Machine learning Drug solubility Crystallization Rivaroxaban |
title | Machine learning analysis of rivaroxaban solubility in mixed solvents for application in pharmaceutical crystallization |
title_full | Machine learning analysis of rivaroxaban solubility in mixed solvents for application in pharmaceutical crystallization |
title_fullStr | Machine learning analysis of rivaroxaban solubility in mixed solvents for application in pharmaceutical crystallization |
title_full_unstemmed | Machine learning analysis of rivaroxaban solubility in mixed solvents for application in pharmaceutical crystallization |
title_short | Machine learning analysis of rivaroxaban solubility in mixed solvents for application in pharmaceutical crystallization |
title_sort | machine learning analysis of rivaroxaban solubility in mixed solvents for application in pharmaceutical crystallization |
topic | Machine learning Drug solubility Crystallization Rivaroxaban |
url | https://doi.org/10.1038/s41598-024-84741-1 |
work_keys_str_mv | AT mohammedalqarni machinelearninganalysisofrivaroxabansolubilityinmixedsolventsforapplicationinpharmaceuticalcrystallization AT alialqarni machinelearninganalysisofrivaroxabansolubilityinmixedsolventsforapplicationinpharmaceuticalcrystallization |