Enhancing Medicare Fraud Detection With a CNN-Transformer-XGBoost Framework and Explainable AI

Healthcare fraud is a critical challenge, contributing significantly to rising healthcare costs and financial losses. This article proposes a hybrid architecture for healthcare fraud detection, combining deep learning-based feature representation with gradient boosting classification and explainable...

Full description

Saved in:

Bibliographic Details
Main Authors:	Mohammad Balayet Hossain Sakil, Md Amit Hasan, Md Shahin Alam Mozumder, Md Rokibul Hasan, Shafiul Ajam Opee, M. F. Mridha, Zeyar Aung
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Healthcare fraud detection hybrid deep learning convolutional neural networks transformers XGBoost SHAP explainability
Online Access:	https://ieeexplore.ieee.org/document/10971341/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1849420936735883264
author	Mohammad Balayet Hossain Sakil Md Amit Hasan Md Shahin Alam Mozumder Md Rokibul Hasan Shafiul Ajam Opee M. F. Mridha Zeyar Aung
author_facet	Mohammad Balayet Hossain Sakil Md Amit Hasan Md Shahin Alam Mozumder Md Rokibul Hasan Shafiul Ajam Opee M. F. Mridha Zeyar Aung
author_sort	Mohammad Balayet Hossain Sakil
collection	DOAJ
description	Healthcare fraud is a critical challenge, contributing significantly to rising healthcare costs and financial losses. This article proposes a hybrid architecture for healthcare fraud detection, combining deep learning-based feature representation with gradient boosting classification and explainable AI techniques. The framework integrates convolutional neural networks (CNNs), transformers, and XGBoost to capture intricate patterns in claims data while maintaining interpretability through Shapley additive explanations. The model we proposed was tested on two datasets: the Medicare Provider Fraud dataset and the Healthcare Providers dataset. On the Medicare dataset, the framework achieved an F1-score of 0.95 on the training set and 0.92 on the test set, with an AUC-ROC of 0.98 and 0.97, respectively, outperforming state-of-the-art models such as LightGBM and CatBoost. On the Healthcare Providers dataset, the framework attained a test F1-score of 0.92 and an AUC-ROC of 0.96, consistently surpassing traditional models like Support Vector Machines and Random Forest. Key contributions include integrating domain-specific features, such as provider-patient interaction graphs and temporal patterns, and using explainability techniques to enhance trustworthiness. Furthermore, the framework demonstrated computational efficiency, with a training time of 150 seconds on the primary dataset, making it suitable for real-world deployment.
format	Article
id	doaj-art-385695bf791b4090845968539b85aeb4
institution	Kabale University
issn	2169-3536
language	English
publishDate	2025-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj-art-385695bf791b4090845968539b85aeb42025-08-20T03:31:37ZengIEEEIEEE Access2169-35362025-01-0113796097962210.1109/ACCESS.2025.356257710971341Enhancing Medicare Fraud Detection With a CNN-Transformer-XGBoost Framework and Explainable AIMohammad Balayet Hossain Sakil0Md Amit Hasan1Md Shahin Alam Mozumder2Md Rokibul Hasan3Shafiul Ajam Opee4https://orcid.org/0009-0009-9526-647XM. F. Mridha5https://orcid.org/0000-0001-5738-1631Zeyar Aung6https://orcid.org/0000-0001-5990-9305College of Graduate and Professional Studies, Trine University, Angola, IN, USAMaster of Science in Information Technology (MSIT), Washington University of Science and Technology, Alexandria, VA, USAMaster of Science in Information Technology (MSIT), Washington University of Science and Technology, Alexandria, VA, USAHarrison College of Business and Computing, Southeast Missouri State University, Cape Girardeau, MO, USADepartment of Computer Science, American International University-Bangladesh (AIUB), Dhaka, BangladeshDepartment of Computer Science, American International University-Bangladesh (AIUB), Dhaka, BangladeshDepartment of Computer Science, Khalifa University, Abu Dhabi, United Arab EmiratesHealthcare fraud is a critical challenge, contributing significantly to rising healthcare costs and financial losses. This article proposes a hybrid architecture for healthcare fraud detection, combining deep learning-based feature representation with gradient boosting classification and explainable AI techniques. The framework integrates convolutional neural networks (CNNs), transformers, and XGBoost to capture intricate patterns in claims data while maintaining interpretability through Shapley additive explanations. The model we proposed was tested on two datasets: the Medicare Provider Fraud dataset and the Healthcare Providers dataset. On the Medicare dataset, the framework achieved an F1-score of 0.95 on the training set and 0.92 on the test set, with an AUC-ROC of 0.98 and 0.97, respectively, outperforming state-of-the-art models such as LightGBM and CatBoost. On the Healthcare Providers dataset, the framework attained a test F1-score of 0.92 and an AUC-ROC of 0.96, consistently surpassing traditional models like Support Vector Machines and Random Forest. Key contributions include integrating domain-specific features, such as provider-patient interaction graphs and temporal patterns, and using explainability techniques to enhance trustworthiness. Furthermore, the framework demonstrated computational efficiency, with a training time of 150 seconds on the primary dataset, making it suitable for real-world deployment.https://ieeexplore.ieee.org/document/10971341/Healthcare fraud detectionhybrid deep learningconvolutional neural networkstransformersXGBoostSHAP explainability
spellingShingle	Mohammad Balayet Hossain Sakil Md Amit Hasan Md Shahin Alam Mozumder Md Rokibul Hasan Shafiul Ajam Opee M. F. Mridha Zeyar Aung Enhancing Medicare Fraud Detection With a CNN-Transformer-XGBoost Framework and Explainable AI IEEE Access Healthcare fraud detection hybrid deep learning convolutional neural networks transformers XGBoost SHAP explainability
title	Enhancing Medicare Fraud Detection With a CNN-Transformer-XGBoost Framework and Explainable AI
title_full	Enhancing Medicare Fraud Detection With a CNN-Transformer-XGBoost Framework and Explainable AI
title_fullStr	Enhancing Medicare Fraud Detection With a CNN-Transformer-XGBoost Framework and Explainable AI
title_full_unstemmed	Enhancing Medicare Fraud Detection With a CNN-Transformer-XGBoost Framework and Explainable AI
title_short	Enhancing Medicare Fraud Detection With a CNN-Transformer-XGBoost Framework and Explainable AI
title_sort	enhancing medicare fraud detection with a cnn transformer xgboost framework and explainable ai
topic	Healthcare fraud detection hybrid deep learning convolutional neural networks transformers XGBoost SHAP explainability
url	https://ieeexplore.ieee.org/document/10971341/
work_keys_str_mv	AT mohammadbalayethossainsakil enhancingmedicarefrauddetectionwithacnntransformerxgboostframeworkandexplainableai AT mdamithasan enhancingmedicarefrauddetectionwithacnntransformerxgboostframeworkandexplainableai AT mdshahinalammozumder enhancingmedicarefrauddetectionwithacnntransformerxgboostframeworkandexplainableai AT mdrokibulhasan enhancingmedicarefrauddetectionwithacnntransformerxgboostframeworkandexplainableai AT shafiulajamopee enhancingmedicarefrauddetectionwithacnntransformerxgboostframeworkandexplainableai AT mfmridha enhancingmedicarefrauddetectionwithacnntransformerxgboostframeworkandexplainableai AT zeyaraung enhancingmedicarefrauddetectionwithacnntransformerxgboostframeworkandexplainableai

Enhancing Medicare Fraud Detection With a CNN-Transformer-XGBoost Framework and Explainable AI

Similar Items