A Novel Model Using ML Techniques for Clinical Trial Design and Expedited Patient Onboarding Process

Abhirvey Iyer,1 Sundaravalli Narayanaswami2 1Department of Pharmaceutical Engineering & Technology, Indian Institute of Technology (BHU) Varanasi, Varanasi, Uttar Pradesh, India; 2Public Systems Group, Indian Institute of Management Ahmedabad, Ahmedabad, Gujarat, IndiaCorrespondence: Sundaravall...

Full description

Saved in:
Bibliographic Details
Main Authors: Iyer A, Narayanaswami S
Format: Article
Language:English
Published: Dove Medical Press 2025-01-01
Series:ClinicoEconomics and Outcomes Research
Subjects:
Online Access:https://www.dovepress.com/a-novel-model-using-ml-techniques-for-clinical-trial-design-and-expedi-peer-reviewed-fulltext-article-CEOR
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841526597355569152
author Iyer A
Narayanaswami S
author_facet Iyer A
Narayanaswami S
author_sort Iyer A
collection DOAJ
description Abhirvey Iyer,1 Sundaravalli Narayanaswami2 1Department of Pharmaceutical Engineering & Technology, Indian Institute of Technology (BHU) Varanasi, Varanasi, Uttar Pradesh, India; 2Public Systems Group, Indian Institute of Management Ahmedabad, Ahmedabad, Gujarat, IndiaCorrespondence: Sundaravalli Narayanaswami, Email sundaravallin@iima.ac.inIntroduction: Clinical trials are critical for drug development and patient care; however, they often need more efficient trial design and patient enrolment processes. This research explores integrating machine learning (ML) techniques to address these challenges. Specifically, the study investigates ML models for two critical aspects: (1) streamlining clinical trial design parameters (like the site of drug action, type of Interventional/Observational model, etc) and (2) optimizing patient/volunteer enrolment for trials through efficient classification techniques.Methods: The study utilized two datasets: the first, with 55,000 samples (from ClinicalTrials.gov), was divided into five subsets (10,000– 15,000 rows each) for model evaluation, focusing on trial parameter optimization. The second dataset targeted patient eligibility classification (from the UCI ML Repository). Five ML models—XGBoost, Random Forest, Support Vector Classifier (SVC), Logistic Regression, and Decision Tree—were applied to both datasets, alongside Artificial Neural Networks (ANN) for the second dataset. Model performance was evaluated using precision, recall, balanced accuracy, ROC-AUC, and weighted F1-score, with results averaged across k-fold cross-validation.Results: In the first phase, XGBoost and Random Forest emerged as the best-performing models across all five subsets, achieving an average balanced accuracy of 0.71 and an average ROC-AUC of 0.7. The second dataset analysis revealed that while SVC and ANN performed well, ANN was preferred for its scalability to larger datasets. ANN achieved a test accuracy of 0.73714, demonstrating its potential for real-world implementation in patient streamlining.Discussion: The study highlights the effectiveness of ML models in improving clinical trial workflows. XGBoost and Random Forest demonstrated robust performance for large clinical datasets in optimizing trial parameters, while ANN proved advantageous for patient eligibility classification due to its scalability. These findings underscore the potential of ML to enhance decision-making, reduce delays, and improve the accuracy of clinical trial outcomes. As ML technology continues to evolve, its integration into clinical research could drive innovation and improve patient care.Keywords: clinical trials, site selection, machine learning, patient onboarding, feature engineering, feature encoding
format Article
id doaj-art-be2b8c48aa25467f866d9fe6979861ed
institution Kabale University
issn 1178-6981
language English
publishDate 2025-01-01
publisher Dove Medical Press
record_format Article
series ClinicoEconomics and Outcomes Research
spelling doaj-art-be2b8c48aa25467f866d9fe6979861ed2025-01-16T16:17:13ZengDove Medical PressClinicoEconomics and Outcomes Research1178-69812025-01-01Volume 1711899306A Novel Model Using ML Techniques for Clinical Trial Design and Expedited Patient Onboarding ProcessIyer ANarayanaswami SAbhirvey Iyer,1 Sundaravalli Narayanaswami2 1Department of Pharmaceutical Engineering & Technology, Indian Institute of Technology (BHU) Varanasi, Varanasi, Uttar Pradesh, India; 2Public Systems Group, Indian Institute of Management Ahmedabad, Ahmedabad, Gujarat, IndiaCorrespondence: Sundaravalli Narayanaswami, Email sundaravallin@iima.ac.inIntroduction: Clinical trials are critical for drug development and patient care; however, they often need more efficient trial design and patient enrolment processes. This research explores integrating machine learning (ML) techniques to address these challenges. Specifically, the study investigates ML models for two critical aspects: (1) streamlining clinical trial design parameters (like the site of drug action, type of Interventional/Observational model, etc) and (2) optimizing patient/volunteer enrolment for trials through efficient classification techniques.Methods: The study utilized two datasets: the first, with 55,000 samples (from ClinicalTrials.gov), was divided into five subsets (10,000– 15,000 rows each) for model evaluation, focusing on trial parameter optimization. The second dataset targeted patient eligibility classification (from the UCI ML Repository). Five ML models—XGBoost, Random Forest, Support Vector Classifier (SVC), Logistic Regression, and Decision Tree—were applied to both datasets, alongside Artificial Neural Networks (ANN) for the second dataset. Model performance was evaluated using precision, recall, balanced accuracy, ROC-AUC, and weighted F1-score, with results averaged across k-fold cross-validation.Results: In the first phase, XGBoost and Random Forest emerged as the best-performing models across all five subsets, achieving an average balanced accuracy of 0.71 and an average ROC-AUC of 0.7. The second dataset analysis revealed that while SVC and ANN performed well, ANN was preferred for its scalability to larger datasets. ANN achieved a test accuracy of 0.73714, demonstrating its potential for real-world implementation in patient streamlining.Discussion: The study highlights the effectiveness of ML models in improving clinical trial workflows. XGBoost and Random Forest demonstrated robust performance for large clinical datasets in optimizing trial parameters, while ANN proved advantageous for patient eligibility classification due to its scalability. These findings underscore the potential of ML to enhance decision-making, reduce delays, and improve the accuracy of clinical trial outcomes. As ML technology continues to evolve, its integration into clinical research could drive innovation and improve patient care.Keywords: clinical trials, site selection, machine learning, patient onboarding, feature engineering, feature encodinghttps://www.dovepress.com/a-novel-model-using-ml-techniques-for-clinical-trial-design-and-expedi-peer-reviewed-fulltext-article-CEORclinical trialssite selectionmachine learningpatient onboardingfeature engineeringfeature encoding
spellingShingle Iyer A
Narayanaswami S
A Novel Model Using ML Techniques for Clinical Trial Design and Expedited Patient Onboarding Process
ClinicoEconomics and Outcomes Research
clinical trials
site selection
machine learning
patient onboarding
feature engineering
feature encoding
title A Novel Model Using ML Techniques for Clinical Trial Design and Expedited Patient Onboarding Process
title_full A Novel Model Using ML Techniques for Clinical Trial Design and Expedited Patient Onboarding Process
title_fullStr A Novel Model Using ML Techniques for Clinical Trial Design and Expedited Patient Onboarding Process
title_full_unstemmed A Novel Model Using ML Techniques for Clinical Trial Design and Expedited Patient Onboarding Process
title_short A Novel Model Using ML Techniques for Clinical Trial Design and Expedited Patient Onboarding Process
title_sort novel model using ml techniques for clinical trial design and expedited patient onboarding process
topic clinical trials
site selection
machine learning
patient onboarding
feature engineering
feature encoding
url https://www.dovepress.com/a-novel-model-using-ml-techniques-for-clinical-trial-design-and-expedi-peer-reviewed-fulltext-article-CEOR
work_keys_str_mv AT iyera anovelmodelusingmltechniquesforclinicaltrialdesignandexpeditedpatientonboardingprocess
AT narayanaswamis anovelmodelusingmltechniquesforclinicaltrialdesignandexpeditedpatientonboardingprocess
AT iyera novelmodelusingmltechniquesforclinicaltrialdesignandexpeditedpatientonboardingprocess
AT narayanaswamis novelmodelusingmltechniquesforclinicaltrialdesignandexpeditedpatientonboardingprocess