Determination of disintegration time using formulation data for solid dosage oral formulations via advanced machine learning integrated optimizer models

Abstract We evaluated the properties of tablets by artificial intelligence and machine learning computational approach with integration of optimizer. A large dataset on formulations properties and corresponding tablet disintegration time was collected and the models were used to fit the dataset. Uti...

Full description

Saved in:
Bibliographic Details
Main Authors: Mohammed Ghazwani, Umme Hani
Format: Article
Language:English
Published: Nature Portfolio 2025-08-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-15996-5
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849736218824146944
author Mohammed Ghazwani
Umme Hani
author_facet Mohammed Ghazwani
Umme Hani
author_sort Mohammed Ghazwani
collection DOAJ
description Abstract We evaluated the properties of tablets by artificial intelligence and machine learning computational approach with integration of optimizer. A large dataset on formulations properties and corresponding tablet disintegration time was collected and the models were used to fit the dataset. Utilizing a dataset of approximately 2,000 entries encompassing molecular properties, physical properties, excipient composition, and formulation characteristics, three ML models were evaluated: TabNet, Radial Basis Function Support Vector Regression (RBF-SVR), and Neural Oblivious Decision Ensembles (NODE). Data preprocessing involved Min-Max normalization, outlier detection via Elliptic Envelope, and feature selection using Conditional Mutual Information, with hyperparameters optimized through the Water Cycle Algorithm. Performance was assessed using R², RMSE, and MAE across train, validation, and test sets, with 95% confidence intervals confirming robust predictions. NODE demonstrated great accuracy for fitting the data, with the highest calculated test R² (0.9805) and the lowest RMSE (7.078) and MAE (5.913), outperforming TabNet (R²: 0.9657, RMSE: 9.382, MAE: 7.299) and RBF-SVR (R²: 0.9652, RMSE: 9.452, MAE: 7.127). These findings highlight NODE’s efficacy in modeling complex data relationships, offering significant potential for optimizing tablet formulations in pharmaceutical research to design proper fast-disintegrating tablets.
format Article
id doaj-art-4a255ddcffbe4db891dcd812c55b1068
institution DOAJ
issn 2045-2322
language English
publishDate 2025-08-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-4a255ddcffbe4db891dcd812c55b10682025-08-20T03:07:20ZengNature PortfolioScientific Reports2045-23222025-08-0115111310.1038/s41598-025-15996-5Determination of disintegration time using formulation data for solid dosage oral formulations via advanced machine learning integrated optimizer modelsMohammed Ghazwani0Umme Hani1Department of Pharmaceutics, College of Pharmacy, King Khalid UniversityDepartment of Pharmaceutics, College of Pharmacy, King Khalid UniversityAbstract We evaluated the properties of tablets by artificial intelligence and machine learning computational approach with integration of optimizer. A large dataset on formulations properties and corresponding tablet disintegration time was collected and the models were used to fit the dataset. Utilizing a dataset of approximately 2,000 entries encompassing molecular properties, physical properties, excipient composition, and formulation characteristics, three ML models were evaluated: TabNet, Radial Basis Function Support Vector Regression (RBF-SVR), and Neural Oblivious Decision Ensembles (NODE). Data preprocessing involved Min-Max normalization, outlier detection via Elliptic Envelope, and feature selection using Conditional Mutual Information, with hyperparameters optimized through the Water Cycle Algorithm. Performance was assessed using R², RMSE, and MAE across train, validation, and test sets, with 95% confidence intervals confirming robust predictions. NODE demonstrated great accuracy for fitting the data, with the highest calculated test R² (0.9805) and the lowest RMSE (7.078) and MAE (5.913), outperforming TabNet (R²: 0.9657, RMSE: 9.382, MAE: 7.299) and RBF-SVR (R²: 0.9652, RMSE: 9.452, MAE: 7.127). These findings highlight NODE’s efficacy in modeling complex data relationships, offering significant potential for optimizing tablet formulations in pharmaceutical research to design proper fast-disintegrating tablets.https://doi.org/10.1038/s41598-025-15996-5Tablet disintegrationPharmaceuticsMachine learningOptimizationModeling
spellingShingle Mohammed Ghazwani
Umme Hani
Determination of disintegration time using formulation data for solid dosage oral formulations via advanced machine learning integrated optimizer models
Scientific Reports
Tablet disintegration
Pharmaceutics
Machine learning
Optimization
Modeling
title Determination of disintegration time using formulation data for solid dosage oral formulations via advanced machine learning integrated optimizer models
title_full Determination of disintegration time using formulation data for solid dosage oral formulations via advanced machine learning integrated optimizer models
title_fullStr Determination of disintegration time using formulation data for solid dosage oral formulations via advanced machine learning integrated optimizer models
title_full_unstemmed Determination of disintegration time using formulation data for solid dosage oral formulations via advanced machine learning integrated optimizer models
title_short Determination of disintegration time using formulation data for solid dosage oral formulations via advanced machine learning integrated optimizer models
title_sort determination of disintegration time using formulation data for solid dosage oral formulations via advanced machine learning integrated optimizer models
topic Tablet disintegration
Pharmaceutics
Machine learning
Optimization
Modeling
url https://doi.org/10.1038/s41598-025-15996-5
work_keys_str_mv AT mohammedghazwani determinationofdisintegrationtimeusingformulationdataforsoliddosageoralformulationsviaadvancedmachinelearningintegratedoptimizermodels
AT ummehani determinationofdisintegrationtimeusingformulationdataforsoliddosageoralformulationsviaadvancedmachinelearningintegratedoptimizermodels