(<italic>r, k, ε</italic>)-Anonymization: Privacy-Preserving Data Publishing Algorithm Based on Multi-Dimensional Outlier Detection, <italic>k</italic>-Anonymity, and <italic>ε</italic>-Differential Privacy

In recent years, there has been a tremendous rise in both the volume and variety of big data, providing enormous potential benefits to businesses that seek to utilize consumer experiences for research or commercial purposes. The general data protection regulation (GDPR) implementation, on the other...

Full description

Saved in:

Bibliographic Details
Main Authors:	Burak Cem Kara, Can Eyupoglu, Oktay Karakus
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Privacy-preserving data publishing data anonymity ε-differential privacy k-anonymity
Online Access:	https://ieeexplore.ieee.org/document/10960292/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1850141353019703296
author	Burak Cem Kara Can Eyupoglu Oktay Karakus
author_facet	Burak Cem Kara Can Eyupoglu Oktay Karakus
author_sort	Burak Cem Kara
collection	DOAJ
description	In recent years, there has been a tremendous rise in both the volume and variety of big data, providing enormous potential benefits to businesses that seek to utilize consumer experiences for research or commercial purposes. The general data protection regulation (GDPR) implementation, on the other hand, has introduced extensive control over the use of individuals’ personal information and placed many limits. Data anonymization technologies have become an important solution for businesses trying to generate value from data while adhering to GDPR limitations. To address these challenges, researchers have developed various methods, including k-anonymity and <inline-formula> <tex-math notation="LaTeX">$\varepsilon $ </tex-math></inline-formula>-differential privacy, offering solutions for both industry and academia. However, protecting individuals’ privacy against diverse attack attempts presents significant challenges for anonymization models that rely solely on a single technique, highlighting the need for more adaptable and hybrid approaches. In this study, a new hybrid anonymization algorithm called (r, k, <inline-formula> <tex-math notation="LaTeX">$\varepsilon $ </tex-math></inline-formula>)-anonymization has been proposed, which combines k-anonymity and <inline-formula> <tex-math notation="LaTeX">$\varepsilon $ </tex-math></inline-formula>-differential privacy models in a consistent framework and provides stronger privacy guarantees compared to existing privacy-preserving models. The proposed algorithm is capable of overcoming well-known shortcomings of the k-anonymity and <inline-formula> <tex-math notation="LaTeX">$\varepsilon $ </tex-math></inline-formula>-differential privacy models, and it has been confirmed by extensive tests on real-world datasets. The proposed (r, k, <inline-formula> <tex-math notation="LaTeX">$\varepsilon $ </tex-math></inline-formula>)-anonymization algorithm outperforms k-anonymity and <inline-formula> <tex-math notation="LaTeX">$\varepsilon $ </tex-math></inline-formula>-differential privacy in terms of the average error rate measure, achieving data utility increases of 31.74% and 26.99%, respectively.
format	Article
id	doaj-art-d3269b17a1894d3eb4da133296f8d7b2
institution	OA Journals
issn	2169-3536
language	English
publishDate	2025-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj-art-d3269b17a1894d3eb4da133296f8d7b22025-08-20T02:29:27ZengIEEEIEEE Access2169-35362025-01-0113704227043510.1109/ACCESS.2025.355941010960292(<italic>r, k, ε</italic>)-Anonymization: Privacy-Preserving Data Publishing Algorithm Based on Multi-Dimensional Outlier Detection, <italic>k</italic>-Anonymity, and <italic>ε</italic>-Differential PrivacyBurak Cem Kara0https://orcid.org/0000-0001-9225-454XCan Eyupoglu1https://orcid.org/0000-0002-6133-8617Oktay Karakus2https://orcid.org/0000-0001-8009-9319Department of Computer Engineering, Turkish Air Force Academy, National Defence University, İstanbul, TürkiyeDepartment of Computer Engineering, Turkish Air Force Academy, National Defence University, İstanbul, TürkiyeSchool of Computer Science and Informatics, Cardiff University, Cardiff, U.K.In recent years, there has been a tremendous rise in both the volume and variety of big data, providing enormous potential benefits to businesses that seek to utilize consumer experiences for research or commercial purposes. The general data protection regulation (GDPR) implementation, on the other hand, has introduced extensive control over the use of individuals’ personal information and placed many limits. Data anonymization technologies have become an important solution for businesses trying to generate value from data while adhering to GDPR limitations. To address these challenges, researchers have developed various methods, including k-anonymity and <inline-formula> <tex-math notation="LaTeX">$\varepsilon $ </tex-math></inline-formula>-differential privacy, offering solutions for both industry and academia. However, protecting individuals’ privacy against diverse attack attempts presents significant challenges for anonymization models that rely solely on a single technique, highlighting the need for more adaptable and hybrid approaches. In this study, a new hybrid anonymization algorithm called (r, k, <inline-formula> <tex-math notation="LaTeX">$\varepsilon $ </tex-math></inline-formula>)-anonymization has been proposed, which combines k-anonymity and <inline-formula> <tex-math notation="LaTeX">$\varepsilon $ </tex-math></inline-formula>-differential privacy models in a consistent framework and provides stronger privacy guarantees compared to existing privacy-preserving models. The proposed algorithm is capable of overcoming well-known shortcomings of the k-anonymity and <inline-formula> <tex-math notation="LaTeX">$\varepsilon $ </tex-math></inline-formula>-differential privacy models, and it has been confirmed by extensive tests on real-world datasets. The proposed (r, k, <inline-formula> <tex-math notation="LaTeX">$\varepsilon $ </tex-math></inline-formula>)-anonymization algorithm outperforms k-anonymity and <inline-formula> <tex-math notation="LaTeX">$\varepsilon $ </tex-math></inline-formula>-differential privacy in terms of the average error rate measure, achieving data utility increases of 31.74% and 26.99%, respectively.https://ieeexplore.ieee.org/document/10960292/Privacy-preserving data publishingdata anonymityε-differential privacyk-anonymity
spellingShingle	Burak Cem Kara Can Eyupoglu Oktay Karakus (<italic>r, k, ε</italic>)-Anonymization: Privacy-Preserving Data Publishing Algorithm Based on Multi-Dimensional Outlier Detection, <italic>k</italic>-Anonymity, and <italic>ε</italic>-Differential Privacy IEEE Access Privacy-preserving data publishing data anonymity ε-differential privacy k-anonymity
title	(<italic>r, k, ε</italic>)-Anonymization: Privacy-Preserving Data Publishing Algorithm Based on Multi-Dimensional Outlier Detection, <italic>k</italic>-Anonymity, and <italic>ε</italic>-Differential Privacy
title_full	(<italic>r, k, ε</italic>)-Anonymization: Privacy-Preserving Data Publishing Algorithm Based on Multi-Dimensional Outlier Detection, <italic>k</italic>-Anonymity, and <italic>ε</italic>-Differential Privacy
title_fullStr	(<italic>r, k, ε</italic>)-Anonymization: Privacy-Preserving Data Publishing Algorithm Based on Multi-Dimensional Outlier Detection, <italic>k</italic>-Anonymity, and <italic>ε</italic>-Differential Privacy
title_full_unstemmed	(<italic>r, k, ε</italic>)-Anonymization: Privacy-Preserving Data Publishing Algorithm Based on Multi-Dimensional Outlier Detection, <italic>k</italic>-Anonymity, and <italic>ε</italic>-Differential Privacy
title_short	(<italic>r, k, ε</italic>)-Anonymization: Privacy-Preserving Data Publishing Algorithm Based on Multi-Dimensional Outlier Detection, <italic>k</italic>-Anonymity, and <italic>ε</italic>-Differential Privacy
title_sort	italic r k x03b5 italic anonymization privacy preserving data publishing algorithm based on multi dimensional outlier detection italic k italic anonymity and italic x03b5 italic differential privacy
topic	Privacy-preserving data publishing data anonymity ε-differential privacy k-anonymity
url	https://ieeexplore.ieee.org/document/10960292/
work_keys_str_mv	AT burakcemkara italicrkx03b5italicanonymizationprivacypreservingdatapublishingalgorithmbasedonmultidimensionaloutlierdetectionitalickitalicanonymityanditalicx03b5italicdifferentialprivacy AT caneyupoglu italicrkx03b5italicanonymizationprivacypreservingdatapublishingalgorithmbasedonmultidimensionaloutlierdetectionitalickitalicanonymityanditalicx03b5italicdifferentialprivacy AT oktaykarakus italicrkx03b5italicanonymizationprivacypreservingdatapublishingalgorithmbasedonmultidimensionaloutlierdetectionitalickitalicanonymityanditalicx03b5italicdifferentialprivacy

(<italic>r, k, &#x03B5;</italic>)-Anonymization: Privacy-Preserving Data Publishing Algorithm Based on Multi-Dimensional Outlier Detection, <italic>k</italic>-Anonymity, and <italic>&#x03B5;</italic>-Differential Privacy

Similar Items

(<italic>r, k, ε</italic>)-Anonymization: Privacy-Preserving Data Publishing Algorithm Based on Multi-Dimensional Outlier Detection, <italic>k</italic>-Anonymity, and <italic>ε</italic>-Differential Privacy