MapReduce based big data framework using associative Kruskal poly Kernel classifier for diabetic disease prediction

Recent trendy applications of Artificial Intelligence are Machine Learning (ML) algorithms, which have been extensively utilized for processes like pattern recognition, object classification, effective prediction of disease etc. However, ML techniques are reasonable solutions to computation methods...

Full description

Saved in:
Bibliographic Details
Main Authors: R. Ramani, S. Edwin Raja, D. Dhinakaran, S. Jagan, G. Prabaharan
Format: Article
Language:English
Published: Elsevier 2025-06-01
Series:MethodsX
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2215016125000573
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849419535470297088
author R. Ramani
S. Edwin Raja
D. Dhinakaran
S. Jagan
G. Prabaharan
author_facet R. Ramani
S. Edwin Raja
D. Dhinakaran
S. Jagan
G. Prabaharan
author_sort R. Ramani
collection DOAJ
description Recent trendy applications of Artificial Intelligence are Machine Learning (ML) algorithms, which have been extensively utilized for processes like pattern recognition, object classification, effective prediction of disease etc. However, ML techniques are reasonable solutions to computation methods and modeling, especially when the data size is enormous. These facts are established due to the reason that big data field has received considerable attention from both the industrial experts and academicians. The computation process must be accelerated to achieve early disease prediction in order to accomplish the prospects of ML for big data applications. In this paper, a method named “Associative Kruskal Wallis and MapReduce Poly Kernel (AKW-MRPK)'' is presented for early disease prediction. Initially, significant attributes are selected by applying Associative Kruskal Wallis Feature Selection model. This study parallelizes polynomial kernel vector using MapReduce based on the significant qualities gained, which will become a significant computing model to facilitate the early prognosis of disease. The proposed AKW-MRPK framework achieves up to 92 % accuracy, reduces computational time to as low as 0.875 ms for 25 patients, and demonstrates superior speedup efficiency with a value of 1.9 ms using two computational nodes, consistently outperforming supervised machine learning algorithms and Hadoop-based clusters across these critical metrics. • The AKW-MRPK method selects attributes and accelerates computations for predictions. • Parallelizing polynomial kernels improves accuracy and speed in healthcare data analysis.
format Article
id doaj-art-dda0b13912e44565b6567ce829f6a1ac
institution Kabale University
issn 2215-0161
language English
publishDate 2025-06-01
publisher Elsevier
record_format Article
series MethodsX
spelling doaj-art-dda0b13912e44565b6567ce829f6a1ac2025-08-20T03:32:03ZengElsevierMethodsX2215-01612025-06-011410321010.1016/j.mex.2025.103210MapReduce based big data framework using associative Kruskal poly Kernel classifier for diabetic disease predictionR. Ramani0S. Edwin Raja1D. Dhinakaran2S. Jagan3G. Prabaharan4Department of Artificial Intelligence and Data Science, P.S.R Engineering College, Sivakasi, IndiaDepartment of Computer Science and Engineering, Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Chennai, India; Corresponding author.Department of Computer Science and Engineering, Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Chennai, IndiaDepartment of Computer Science and Engineering, Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Chennai, IndiaDepartment of Computer Science and Engineering, Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Chennai, IndiaRecent trendy applications of Artificial Intelligence are Machine Learning (ML) algorithms, which have been extensively utilized for processes like pattern recognition, object classification, effective prediction of disease etc. However, ML techniques are reasonable solutions to computation methods and modeling, especially when the data size is enormous. These facts are established due to the reason that big data field has received considerable attention from both the industrial experts and academicians. The computation process must be accelerated to achieve early disease prediction in order to accomplish the prospects of ML for big data applications. In this paper, a method named “Associative Kruskal Wallis and MapReduce Poly Kernel (AKW-MRPK)'' is presented for early disease prediction. Initially, significant attributes are selected by applying Associative Kruskal Wallis Feature Selection model. This study parallelizes polynomial kernel vector using MapReduce based on the significant qualities gained, which will become a significant computing model to facilitate the early prognosis of disease. The proposed AKW-MRPK framework achieves up to 92 % accuracy, reduces computational time to as low as 0.875 ms for 25 patients, and demonstrates superior speedup efficiency with a value of 1.9 ms using two computational nodes, consistently outperforming supervised machine learning algorithms and Hadoop-based clusters across these critical metrics. • The AKW-MRPK method selects attributes and accelerates computations for predictions. • Parallelizing polynomial kernels improves accuracy and speed in healthcare data analysis.http://www.sciencedirect.com/science/article/pii/S2215016125000573Associative Kruskal Wallis and MapReduce Poly Kernel
spellingShingle R. Ramani
S. Edwin Raja
D. Dhinakaran
S. Jagan
G. Prabaharan
MapReduce based big data framework using associative Kruskal poly Kernel classifier for diabetic disease prediction
MethodsX
Associative Kruskal Wallis and MapReduce Poly Kernel
title MapReduce based big data framework using associative Kruskal poly Kernel classifier for diabetic disease prediction
title_full MapReduce based big data framework using associative Kruskal poly Kernel classifier for diabetic disease prediction
title_fullStr MapReduce based big data framework using associative Kruskal poly Kernel classifier for diabetic disease prediction
title_full_unstemmed MapReduce based big data framework using associative Kruskal poly Kernel classifier for diabetic disease prediction
title_short MapReduce based big data framework using associative Kruskal poly Kernel classifier for diabetic disease prediction
title_sort mapreduce based big data framework using associative kruskal poly kernel classifier for diabetic disease prediction
topic Associative Kruskal Wallis and MapReduce Poly Kernel
url http://www.sciencedirect.com/science/article/pii/S2215016125000573
work_keys_str_mv AT rramani mapreducebasedbigdataframeworkusingassociativekruskalpolykernelclassifierfordiabeticdiseaseprediction
AT sedwinraja mapreducebasedbigdataframeworkusingassociativekruskalpolykernelclassifierfordiabeticdiseaseprediction
AT ddhinakaran mapreducebasedbigdataframeworkusingassociativekruskalpolykernelclassifierfordiabeticdiseaseprediction
AT sjagan mapreducebasedbigdataframeworkusingassociativekruskalpolykernelclassifierfordiabeticdiseaseprediction
AT gprabaharan mapreducebasedbigdataframeworkusingassociativekruskalpolykernelclassifierfordiabeticdiseaseprediction