SCARS-LOGISTIC: A novel variable selection approach for binary classification model to identify the significant determinants of sexually transmitted infections.

Variable selection methods are very popular, especially in the field of big data with large predictors. These procedures improve the accuracy and performance of the model by eliminating irrelevant and redundant variables. The main contribution of this study is to couple a logit model with a novel va...

Full description

Saved in:
Bibliographic Details
Main Authors: Maryam Sadiq, Nasser A Alsadhan, Ramla Shah, Sidra Younas, Zahid Rasheed
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2025-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0324395
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Variable selection methods are very popular, especially in the field of big data with large predictors. These procedures improve the accuracy and performance of the model by eliminating irrelevant and redundant variables. The main contribution of this study is to couple a logit model with a novel variable selection approach, "Stability Competitive Adaptive Re-weighted Sampling" to address binary response. The efficiency of the proposed method is compared with the traditional logistic regression model based on eight model assessment criteria over real data from sexually transmitted infections in Indian men. Due to higher stability, the proposed method outperformed having a lower Akaike information criterion, and the Bayesian information criterion, as well as higher R-squared measures. The finally selected proposed model identified essential information regarding sexually transmitted infections in India for policymakers.
ISSN:1932-6203