Development of several machine learning based models for determination of small molecule pharmaceutical solubility in binary solvents at different temperatures
Abstract Analysis of small-molecule drug solubility in binary solvents at different temperatures was carried out via several machine learning models and integration of models to optimize. We investigated the solubility of rivaroxaban in both dichloromethane and a variety of primary alcohols at vario...
Saved in:
| Main Authors: | , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-08-01
|
| Series: | Scientific Reports |
| Subjects: | |
| Online Access: | https://doi.org/10.1038/s41598-025-13090-4 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849763530619748352 |
|---|---|
| author | Mohammed Alqarni Ali Alqarni |
| author_facet | Mohammed Alqarni Ali Alqarni |
| author_sort | Mohammed Alqarni |
| collection | DOAJ |
| description | Abstract Analysis of small-molecule drug solubility in binary solvents at different temperatures was carried out via several machine learning models and integration of models to optimize. We investigated the solubility of rivaroxaban in both dichloromethane and a variety of primary alcohols at various temperatures and concentrations of solvents to understand its behavior in mixed solvents. Given the complex, non-linear patterns in solubility behavior, three advanced regression approaches were utilized: Polynomial Curve Fitting, a Bayesian-based Neural Network (BNN), and the Neural Oblivious Decision Ensemble (NODE) method. To optimize model performance, hyperparameters were fine-tuned using the Stochastic Fractal Search (SFS) algorithm. Among the tested models, BNN obtained the best precision for fitting, with a test R² of 0.9926 and a MSE of 3.07 × 10⁻⁸, proving outstanding accuracy in fitting the rivaroxaban data. The NODE model followed BNN, showing a test R² of 0.9413 and the lowest MAPE of 0.1835. The Polynomial model yielded a lower test R² of 0.8200 and higher error rates, indicating its limitations in unravelling the underlying relationships for the solubility variations. This study shows that advanced machine learning models, particularly BNN and NODE, can predict pharmaceutical solubility and improve crystallization process design and optimization. |
| format | Article |
| id | doaj-art-edcc81a8a2544be8b6aa577de09c7b45 |
| institution | DOAJ |
| issn | 2045-2322 |
| language | English |
| publishDate | 2025-08-01 |
| publisher | Nature Portfolio |
| record_format | Article |
| series | Scientific Reports |
| spelling | doaj-art-edcc81a8a2544be8b6aa577de09c7b452025-08-20T03:05:22ZengNature PortfolioScientific Reports2045-23222025-08-0115111310.1038/s41598-025-13090-4Development of several machine learning based models for determination of small molecule pharmaceutical solubility in binary solvents at different temperaturesMohammed Alqarni0Ali Alqarni1Department of Pharmaceutical Chemistry, College of Pharmacy, Taif UniversityDepartment of Oral & Maxillofacial Surgery and Diagnostic Sciences, Faculty of Dentistry, Taif UniversityAbstract Analysis of small-molecule drug solubility in binary solvents at different temperatures was carried out via several machine learning models and integration of models to optimize. We investigated the solubility of rivaroxaban in both dichloromethane and a variety of primary alcohols at various temperatures and concentrations of solvents to understand its behavior in mixed solvents. Given the complex, non-linear patterns in solubility behavior, three advanced regression approaches were utilized: Polynomial Curve Fitting, a Bayesian-based Neural Network (BNN), and the Neural Oblivious Decision Ensemble (NODE) method. To optimize model performance, hyperparameters were fine-tuned using the Stochastic Fractal Search (SFS) algorithm. Among the tested models, BNN obtained the best precision for fitting, with a test R² of 0.9926 and a MSE of 3.07 × 10⁻⁸, proving outstanding accuracy in fitting the rivaroxaban data. The NODE model followed BNN, showing a test R² of 0.9413 and the lowest MAPE of 0.1835. The Polynomial model yielded a lower test R² of 0.8200 and higher error rates, indicating its limitations in unravelling the underlying relationships for the solubility variations. This study shows that advanced machine learning models, particularly BNN and NODE, can predict pharmaceutical solubility and improve crystallization process design and optimization.https://doi.org/10.1038/s41598-025-13090-4Drug solubilityPolynomial regressionBayesian neural networkNeural oblivious decision ensembleCrystallization process |
| spellingShingle | Mohammed Alqarni Ali Alqarni Development of several machine learning based models for determination of small molecule pharmaceutical solubility in binary solvents at different temperatures Scientific Reports Drug solubility Polynomial regression Bayesian neural network Neural oblivious decision ensemble Crystallization process |
| title | Development of several machine learning based models for determination of small molecule pharmaceutical solubility in binary solvents at different temperatures |
| title_full | Development of several machine learning based models for determination of small molecule pharmaceutical solubility in binary solvents at different temperatures |
| title_fullStr | Development of several machine learning based models for determination of small molecule pharmaceutical solubility in binary solvents at different temperatures |
| title_full_unstemmed | Development of several machine learning based models for determination of small molecule pharmaceutical solubility in binary solvents at different temperatures |
| title_short | Development of several machine learning based models for determination of small molecule pharmaceutical solubility in binary solvents at different temperatures |
| title_sort | development of several machine learning based models for determination of small molecule pharmaceutical solubility in binary solvents at different temperatures |
| topic | Drug solubility Polynomial regression Bayesian neural network Neural oblivious decision ensemble Crystallization process |
| url | https://doi.org/10.1038/s41598-025-13090-4 |
| work_keys_str_mv | AT mohammedalqarni developmentofseveralmachinelearningbasedmodelsfordeterminationofsmallmoleculepharmaceuticalsolubilityinbinarysolventsatdifferenttemperatures AT alialqarni developmentofseveralmachinelearningbasedmodelsfordeterminationofsmallmoleculepharmaceuticalsolubilityinbinarysolventsatdifferenttemperatures |