Perturbation based privacy preservation and classification using Jaya Algorithm and Dragonfly Inspired Algorithm

Healthcare datasets are very sensitive datasets. In case of unauthorized access, sensitive datasets could potentially cause damage, discrimination and unsolicited scrutiny. The patients’ health details constitute private personal information. They should not be disclosed. However, data might get sto...

Full description

Saved in:
Bibliographic Details
Main Authors: Dipanwita Sen, Bhupati Bhusan Mishra, Prasant Kumar Pattnaik
Format: Article
Language:English
Published: Elsevier 2025-06-01
Series:Franklin Open
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2773186325000568
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849471456794116096
author Dipanwita Sen
Bhupati Bhusan Mishra
Prasant Kumar Pattnaik
author_facet Dipanwita Sen
Bhupati Bhusan Mishra
Prasant Kumar Pattnaik
author_sort Dipanwita Sen
collection DOAJ
description Healthcare datasets are very sensitive datasets. In case of unauthorized access, sensitive datasets could potentially cause damage, discrimination and unsolicited scrutiny. The patients’ health details constitute private personal information. They should not be disclosed. However, data might get stored in cloud without any protection. That is why, privacy preservation of data for healthcare dataset is a significant consideration. In this work, the Wisconsin Prognostic Breast Cancer (WBC) dataset has been used. At first, a privacy preserving schema making use of perturbation implementing Jaya Algorithm has been elucidated. Out of 30 numerical attributes in the dataset, 6 are chosen for perturbation based on their relatively high Pearson’s Correlation coefficient values. They form the initial population of Jaya Algorithm. The objective function is defined and we opt for a minimization problem. After each iteration, the algorithm generates a new population from the previous population. Thereafter, the accuracies obtained by a few traditional classification algorithms as well as classifiers based on some meta-heuristic algorithms, are observed . The classical classifiers used are Decision Tree, Random Forest, AdaBoost, KNN and GNB. The standard evaluation metrics are recorded thereafter. For privacy, the evaluation metrics used are Secrecy, Value Difference(VD), RP, RK, CP and CK. For utility, the metrics are Accuracy, Precision, Recall, F1-Score and Area Under the Curve(AUC). Jaya Algorithm is then compared with traditional perturbation algorithms like 2DRT and 3DRT. It is seen that Jaya preserves more privacy and retains more utility as suggested by mean Friedman Test Rankings.. Among the metaheuristic optimization based classifiers, only the Dragonfly inspired Classifier(DIC) classifies over 90 % of the records correctly for the perturbed dataset. For classification, the perturbed dataset is fed into the DIC as the original population. The target is to minimize the distance between the testing dataset points and the centroids assigned to them. The new centroids are calculated using the updated training set points only. The updated dragonflies are assigned new centroids at each stage. All these simulations have been implemented in Python environment.
format Article
id doaj-art-e92898649e2c43e49fe0c2c86e243208
institution Kabale University
issn 2773-1863
language English
publishDate 2025-06-01
publisher Elsevier
record_format Article
series Franklin Open
spelling doaj-art-e92898649e2c43e49fe0c2c86e2432082025-08-20T03:24:48ZengElsevierFranklin Open2773-18632025-06-011110026610.1016/j.fraope.2025.100266Perturbation based privacy preservation and classification using Jaya Algorithm and Dragonfly Inspired AlgorithmDipanwita Sen0Bhupati Bhusan Mishra1Prasant Kumar Pattnaik2KIIT Deemed To Be University, Bhubaneswar, Odisha, 751024, IndiaKIIT Deemed To Be University, Bhubaneswar, Odisha, 751024, IndiaCorresponding author.; KIIT Deemed To Be University, Bhubaneswar, Odisha, 751024, IndiaHealthcare datasets are very sensitive datasets. In case of unauthorized access, sensitive datasets could potentially cause damage, discrimination and unsolicited scrutiny. The patients’ health details constitute private personal information. They should not be disclosed. However, data might get stored in cloud without any protection. That is why, privacy preservation of data for healthcare dataset is a significant consideration. In this work, the Wisconsin Prognostic Breast Cancer (WBC) dataset has been used. At first, a privacy preserving schema making use of perturbation implementing Jaya Algorithm has been elucidated. Out of 30 numerical attributes in the dataset, 6 are chosen for perturbation based on their relatively high Pearson’s Correlation coefficient values. They form the initial population of Jaya Algorithm. The objective function is defined and we opt for a minimization problem. After each iteration, the algorithm generates a new population from the previous population. Thereafter, the accuracies obtained by a few traditional classification algorithms as well as classifiers based on some meta-heuristic algorithms, are observed . The classical classifiers used are Decision Tree, Random Forest, AdaBoost, KNN and GNB. The standard evaluation metrics are recorded thereafter. For privacy, the evaluation metrics used are Secrecy, Value Difference(VD), RP, RK, CP and CK. For utility, the metrics are Accuracy, Precision, Recall, F1-Score and Area Under the Curve(AUC). Jaya Algorithm is then compared with traditional perturbation algorithms like 2DRT and 3DRT. It is seen that Jaya preserves more privacy and retains more utility as suggested by mean Friedman Test Rankings.. Among the metaheuristic optimization based classifiers, only the Dragonfly inspired Classifier(DIC) classifies over 90 % of the records correctly for the perturbed dataset. For classification, the perturbed dataset is fed into the DIC as the original population. The target is to minimize the distance between the testing dataset points and the centroids assigned to them. The new centroids are calculated using the updated training set points only. The updated dragonflies are assigned new centroids at each stage. All these simulations have been implemented in Python environment.http://www.sciencedirect.com/science/article/pii/S2773186325000568Privacy preserving data mining (PPDM)PerturbationJaya AlgorithmMeta-heuristic optimizationData mining (DM)Independent component analysis (ICA)
spellingShingle Dipanwita Sen
Bhupati Bhusan Mishra
Prasant Kumar Pattnaik
Perturbation based privacy preservation and classification using Jaya Algorithm and Dragonfly Inspired Algorithm
Franklin Open
Privacy preserving data mining (PPDM)
Perturbation
Jaya Algorithm
Meta-heuristic optimization
Data mining (DM)
Independent component analysis (ICA)
title Perturbation based privacy preservation and classification using Jaya Algorithm and Dragonfly Inspired Algorithm
title_full Perturbation based privacy preservation and classification using Jaya Algorithm and Dragonfly Inspired Algorithm
title_fullStr Perturbation based privacy preservation and classification using Jaya Algorithm and Dragonfly Inspired Algorithm
title_full_unstemmed Perturbation based privacy preservation and classification using Jaya Algorithm and Dragonfly Inspired Algorithm
title_short Perturbation based privacy preservation and classification using Jaya Algorithm and Dragonfly Inspired Algorithm
title_sort perturbation based privacy preservation and classification using jaya algorithm and dragonfly inspired algorithm
topic Privacy preserving data mining (PPDM)
Perturbation
Jaya Algorithm
Meta-heuristic optimization
Data mining (DM)
Independent component analysis (ICA)
url http://www.sciencedirect.com/science/article/pii/S2773186325000568
work_keys_str_mv AT dipanwitasen perturbationbasedprivacypreservationandclassificationusingjayaalgorithmanddragonflyinspiredalgorithm
AT bhupatibhusanmishra perturbationbasedprivacypreservationandclassificationusingjayaalgorithmanddragonflyinspiredalgorithm
AT prasantkumarpattnaik perturbationbasedprivacypreservationandclassificationusingjayaalgorithmanddragonflyinspiredalgorithm