SmartScanPCOS: A feature-driven approach to cutting-edge prediction of Polycystic Ovary Syndrome using Machine Learning and Explainable Artificial Intelligence

PolyCystic Ovarian Syndrome (PCOS) poses significant challenges to women's reproductive health due to its diagnostic complexity arising from a variety of symptoms, including hirsutism, anovulation, pain, obesity, hyperandrogenism, and oligomenorrhea, necessitating multiple clinical tests. Lever...

Full description

Saved in:
Bibliographic Details
Main Authors: Umaa Mahesswari G, Uma Maheswari P
Format: Article
Language:English
Published: Elsevier 2024-10-01
Series:Heliyon
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S240584402415236X
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850194121867657216
author Umaa Mahesswari G
Uma Maheswari P
author_facet Umaa Mahesswari G
Uma Maheswari P
author_sort Umaa Mahesswari G
collection DOAJ
description PolyCystic Ovarian Syndrome (PCOS) poses significant challenges to women's reproductive health due to its diagnostic complexity arising from a variety of symptoms, including hirsutism, anovulation, pain, obesity, hyperandrogenism, and oligomenorrhea, necessitating multiple clinical tests. Leveraging Artificial Intelligence (AI) in healthcare offers several benefits that can significantly impact patient care, streamline operations, and improve medical outcomes overall. This study presents an Explainable Artificial Intelligence (XAI)-driven PCOS smart predictor, structured as a hierarchical ensemble consisting of two tiers of Random Forest classifiers following extensive analysis of seven conventional classifiers and two additional stacking ensemble classifiers. An open-source data set comprising numerical parametric features linked to PCOS for classifier training was used. Moreover, to identify essential features for PCOS prediction three feature selection methods: Threshold-driven Optimized Principal Component Analysis (TOPCA), Optimized Salp Swarm (OSSM), and Threshold-driven Optimized Mutual Information Method (TOMIM) were fine-tuned through thresholding and improvisation to detect diverse attribute sets with varying numbers and combinations. Notably, the two-level Random Forest classifier model outperformed others with a remarkable 99.31 % accuracy by employing the top 17 features selected through the Threshold-driven Optimized Mutual Information Method (TOMIM) along with anoverallaccuracy of 99.32 % with 8 fold cross validation for 25 runs. The Smart predictor, constructed using Shapash - a Python library for Explainable Artificial Intelligence - was utilized to deploy the two-level Random Forest classifier model. Ensuring transparency and result reliability, visualizations from robust Explainable AI libraries were employed at different prediction stages for all considered classifiers in this study.
format Article
id doaj-art-b8f2af9a55ca47c4a870e8a65c43501a
institution OA Journals
issn 2405-8440
language English
publishDate 2024-10-01
publisher Elsevier
record_format Article
series Heliyon
spelling doaj-art-b8f2af9a55ca47c4a870e8a65c43501a2025-08-20T02:14:03ZengElsevierHeliyon2405-84402024-10-011020e3920510.1016/j.heliyon.2024.e39205SmartScanPCOS: A feature-driven approach to cutting-edge prediction of Polycystic Ovary Syndrome using Machine Learning and Explainable Artificial IntelligenceUmaa Mahesswari G0Uma Maheswari P1Corresponding author.; Department of Computer Science and Engineering, College of Engineering Guindy, Anna University, Chennai, 600025, Tamil Nadu, IndiaDepartment of Computer Science and Engineering, College of Engineering Guindy, Anna University, Chennai, 600025, Tamil Nadu, IndiaPolyCystic Ovarian Syndrome (PCOS) poses significant challenges to women's reproductive health due to its diagnostic complexity arising from a variety of symptoms, including hirsutism, anovulation, pain, obesity, hyperandrogenism, and oligomenorrhea, necessitating multiple clinical tests. Leveraging Artificial Intelligence (AI) in healthcare offers several benefits that can significantly impact patient care, streamline operations, and improve medical outcomes overall. This study presents an Explainable Artificial Intelligence (XAI)-driven PCOS smart predictor, structured as a hierarchical ensemble consisting of two tiers of Random Forest classifiers following extensive analysis of seven conventional classifiers and two additional stacking ensemble classifiers. An open-source data set comprising numerical parametric features linked to PCOS for classifier training was used. Moreover, to identify essential features for PCOS prediction three feature selection methods: Threshold-driven Optimized Principal Component Analysis (TOPCA), Optimized Salp Swarm (OSSM), and Threshold-driven Optimized Mutual Information Method (TOMIM) were fine-tuned through thresholding and improvisation to detect diverse attribute sets with varying numbers and combinations. Notably, the two-level Random Forest classifier model outperformed others with a remarkable 99.31 % accuracy by employing the top 17 features selected through the Threshold-driven Optimized Mutual Information Method (TOMIM) along with anoverallaccuracy of 99.32 % with 8 fold cross validation for 25 runs. The Smart predictor, constructed using Shapash - a Python library for Explainable Artificial Intelligence - was utilized to deploy the two-level Random Forest classifier model. Ensuring transparency and result reliability, visualizations from robust Explainable AI libraries were employed at different prediction stages for all considered classifiers in this study.http://www.sciencedirect.com/science/article/pii/S240584402415236XPolycystic ovarian syndrome (PCOS)Explainable artificial intelligenceeXplainable artificial intelligence (XAI)Machine learningClassificationEnsemble model
spellingShingle Umaa Mahesswari G
Uma Maheswari P
SmartScanPCOS: A feature-driven approach to cutting-edge prediction of Polycystic Ovary Syndrome using Machine Learning and Explainable Artificial Intelligence
Heliyon
Polycystic ovarian syndrome (PCOS)
Explainable artificial intelligence
eXplainable artificial intelligence (XAI)
Machine learning
Classification
Ensemble model
title SmartScanPCOS: A feature-driven approach to cutting-edge prediction of Polycystic Ovary Syndrome using Machine Learning and Explainable Artificial Intelligence
title_full SmartScanPCOS: A feature-driven approach to cutting-edge prediction of Polycystic Ovary Syndrome using Machine Learning and Explainable Artificial Intelligence
title_fullStr SmartScanPCOS: A feature-driven approach to cutting-edge prediction of Polycystic Ovary Syndrome using Machine Learning and Explainable Artificial Intelligence
title_full_unstemmed SmartScanPCOS: A feature-driven approach to cutting-edge prediction of Polycystic Ovary Syndrome using Machine Learning and Explainable Artificial Intelligence
title_short SmartScanPCOS: A feature-driven approach to cutting-edge prediction of Polycystic Ovary Syndrome using Machine Learning and Explainable Artificial Intelligence
title_sort smartscanpcos a feature driven approach to cutting edge prediction of polycystic ovary syndrome using machine learning and explainable artificial intelligence
topic Polycystic ovarian syndrome (PCOS)
Explainable artificial intelligence
eXplainable artificial intelligence (XAI)
Machine learning
Classification
Ensemble model
url http://www.sciencedirect.com/science/article/pii/S240584402415236X
work_keys_str_mv AT umaamahesswarig smartscanpcosafeaturedrivenapproachtocuttingedgepredictionofpolycysticovarysyndromeusingmachinelearningandexplainableartificialintelligence
AT umamaheswarip smartscanpcosafeaturedrivenapproachtocuttingedgepredictionofpolycysticovarysyndromeusingmachinelearningandexplainableartificialintelligence