Machine learning analysis of rivaroxaban solubility in mixed solvents for application in pharmaceutical crystallization

Abstract This study investigates the use of machine learning models to predict solubility of rivaroxaban in binary solvents based on temperature (T), mass fraction (w), and solvent type. Using a dataset with over 250 data points and including solvents encoded with one-hot encoding, four models were...

Full description

Saved in:
Bibliographic Details
Main Authors: Mohammed Alqarni, Ali Alqarni
Format: Article
Language:English
Published: Nature Portfolio 2025-01-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-024-84741-1
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832594727194591232
author Mohammed Alqarni
Ali Alqarni
author_facet Mohammed Alqarni
Ali Alqarni
author_sort Mohammed Alqarni
collection DOAJ
description Abstract This study investigates the use of machine learning models to predict solubility of rivaroxaban in binary solvents based on temperature (T), mass fraction (w), and solvent type. Using a dataset with over 250 data points and including solvents encoded with one-hot encoding, four models were compared: Gradient Boosting (GB), Light Gradient Boosting (LGB), Extra Trees (ET), and Random Forest (RF). The Jellyfish Optimizer (JO) algorithm was applied to tune hyperparameters, enhancing model performance. The LGB model achieved the best results, with an R2 of 0.988 on the test set and low error rates (RMSE of 9.1284E-05 and MAE of 5.85322E-05), surpassing other models in predictive accuracy and generalizability. Parity plots confirmed the LGB model’s close alignment between predicted and actual solubility values, highlighting its robust performance. Furthermore, 3D surface plots and partial effect plots demonstrated LGB’s capacity to model solubility across different solvent systems, capturing complex interactions between T, w, and solvent effects. Finally, the LGB model predicted maximum solubility at a temperature of 305.76 K and a mass fraction of 0.753 in a dichloromethane + methanol mixture, providing valuable insights for solubility optimization in solvent selection. This work underscores the effectiveness of the LGB model for solubility prediction, with potential applications in formulation and experimental planning.
format Article
id doaj-art-e52899eeadcf4ef3b619e46afd8666e4
institution Kabale University
issn 2045-2322
language English
publishDate 2025-01-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-e52899eeadcf4ef3b619e46afd8666e42025-01-19T12:24:35ZengNature PortfolioScientific Reports2045-23222025-01-0115111210.1038/s41598-024-84741-1Machine learning analysis of rivaroxaban solubility in mixed solvents for application in pharmaceutical crystallizationMohammed Alqarni0Ali Alqarni1Department of Pharmaceutical Chemistry, College of Pharmacy, Taif UniversityDepartment of Oral & Maxillofacial Surgery and Diagnostic Sciences, Faculty of Dentistry, Taif UniversityAbstract This study investigates the use of machine learning models to predict solubility of rivaroxaban in binary solvents based on temperature (T), mass fraction (w), and solvent type. Using a dataset with over 250 data points and including solvents encoded with one-hot encoding, four models were compared: Gradient Boosting (GB), Light Gradient Boosting (LGB), Extra Trees (ET), and Random Forest (RF). The Jellyfish Optimizer (JO) algorithm was applied to tune hyperparameters, enhancing model performance. The LGB model achieved the best results, with an R2 of 0.988 on the test set and low error rates (RMSE of 9.1284E-05 and MAE of 5.85322E-05), surpassing other models in predictive accuracy and generalizability. Parity plots confirmed the LGB model’s close alignment between predicted and actual solubility values, highlighting its robust performance. Furthermore, 3D surface plots and partial effect plots demonstrated LGB’s capacity to model solubility across different solvent systems, capturing complex interactions between T, w, and solvent effects. Finally, the LGB model predicted maximum solubility at a temperature of 305.76 K and a mass fraction of 0.753 in a dichloromethane + methanol mixture, providing valuable insights for solubility optimization in solvent selection. This work underscores the effectiveness of the LGB model for solubility prediction, with potential applications in formulation and experimental planning.https://doi.org/10.1038/s41598-024-84741-1Machine learningDrug solubilityCrystallizationRivaroxaban
spellingShingle Mohammed Alqarni
Ali Alqarni
Machine learning analysis of rivaroxaban solubility in mixed solvents for application in pharmaceutical crystallization
Scientific Reports
Machine learning
Drug solubility
Crystallization
Rivaroxaban
title Machine learning analysis of rivaroxaban solubility in mixed solvents for application in pharmaceutical crystallization
title_full Machine learning analysis of rivaroxaban solubility in mixed solvents for application in pharmaceutical crystallization
title_fullStr Machine learning analysis of rivaroxaban solubility in mixed solvents for application in pharmaceutical crystallization
title_full_unstemmed Machine learning analysis of rivaroxaban solubility in mixed solvents for application in pharmaceutical crystallization
title_short Machine learning analysis of rivaroxaban solubility in mixed solvents for application in pharmaceutical crystallization
title_sort machine learning analysis of rivaroxaban solubility in mixed solvents for application in pharmaceutical crystallization
topic Machine learning
Drug solubility
Crystallization
Rivaroxaban
url https://doi.org/10.1038/s41598-024-84741-1
work_keys_str_mv AT mohammedalqarni machinelearninganalysisofrivaroxabansolubilityinmixedsolventsforapplicationinpharmaceuticalcrystallization
AT alialqarni machinelearninganalysisofrivaroxabansolubilityinmixedsolventsforapplicationinpharmaceuticalcrystallization