Unitanony: a fine-grained and practical anonymization framework for better data utility

Abstract In order to share data without revealing private information, privacy-preserving data publishing techniques are proposed. K-anonymity and l-diversity secure against identity and attribute disclosure. Anonymization algorithms enforce the above models and are willing to reach two primary goal...

Full description

Saved in:
Bibliographic Details
Main Authors: Shuguang Yuan, Jing Yu, Tengfei Yang, Chi Chen
Format: Article
Language:English
Published: SpringerOpen 2025-07-01
Series:Cybersecurity
Subjects:
Online Access:https://doi.org/10.1186/s42400-024-00345-2
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849238290650103808
author Shuguang Yuan
Jing Yu
Tengfei Yang
Chi Chen
author_facet Shuguang Yuan
Jing Yu
Tengfei Yang
Chi Chen
author_sort Shuguang Yuan
collection DOAJ
description Abstract In order to share data without revealing private information, privacy-preserving data publishing techniques are proposed. K-anonymity and l-diversity secure against identity and attribute disclosure. Anonymization algorithms enforce the above models and are willing to reach two primary goals: achieving the privacy objective while maximizing data utility. Even though anonymization has been studied for decades, finding efficient techniques to improve data utility is an open question. It is a crucial challenge that impacts many anonymized data on the web, cloud, and IoT environments. However, some factors incur huge information loss for existing works. The original dataset may be transformed into generalized data to an excessive extent. To address this problem, we give a new framework and propose a heuristic algorithm called UnitAnony. It builds a full-coverage hierarchy for more generalization candidates and proposes an interval-mapping technique for fine-grained generalization extent. However, these improvements raise another challenge. It is the vast cost because more generalization will derive many operations for forming new values, grouping records, and verifying anonymization models. Therefore, we designed a data structure unit to generalize records at low costs and implemented a skipping strategy to execute the algorithm within an acceptable time. Besides, our algorithm can support many models. By evaluating well-known K-anonymity and l-diversity on real-world datasets, i.e., Adult and Census datasets, the experimental results demonstrate that our algorithm outperforms the existing algorithms (e.g., Mondrian, Top-down, Improved-Clustering, Flash, and Incognito) regarding data utility and effectiveness.
format Article
id doaj-art-3fc0cb83794d4cba9e9335e6843c6642
institution Kabale University
issn 2523-3246
language English
publishDate 2025-07-01
publisher SpringerOpen
record_format Article
series Cybersecurity
spelling doaj-art-3fc0cb83794d4cba9e9335e6843c66422025-08-20T04:01:41ZengSpringerOpenCybersecurity2523-32462025-07-018111710.1186/s42400-024-00345-2Unitanony: a fine-grained and practical anonymization framework for better data utilityShuguang Yuan0Jing Yu1Tengfei Yang2Chi Chen3Institute of Information Engineering, Chinese Academy of ScienceInstitute of Information Engineering, Chinese Academy of ScienceNational Computer Network Emergency Response Technical Team/Coordination Center of ChinaInstitute of Information Engineering, Chinese Academy of ScienceAbstract In order to share data without revealing private information, privacy-preserving data publishing techniques are proposed. K-anonymity and l-diversity secure against identity and attribute disclosure. Anonymization algorithms enforce the above models and are willing to reach two primary goals: achieving the privacy objective while maximizing data utility. Even though anonymization has been studied for decades, finding efficient techniques to improve data utility is an open question. It is a crucial challenge that impacts many anonymized data on the web, cloud, and IoT environments. However, some factors incur huge information loss for existing works. The original dataset may be transformed into generalized data to an excessive extent. To address this problem, we give a new framework and propose a heuristic algorithm called UnitAnony. It builds a full-coverage hierarchy for more generalization candidates and proposes an interval-mapping technique for fine-grained generalization extent. However, these improvements raise another challenge. It is the vast cost because more generalization will derive many operations for forming new values, grouping records, and verifying anonymization models. Therefore, we designed a data structure unit to generalize records at low costs and implemented a skipping strategy to execute the algorithm within an acceptable time. Besides, our algorithm can support many models. By evaluating well-known K-anonymity and l-diversity on real-world datasets, i.e., Adult and Census datasets, the experimental results demonstrate that our algorithm outperforms the existing algorithms (e.g., Mondrian, Top-down, Improved-Clustering, Flash, and Incognito) regarding data utility and effectiveness.https://doi.org/10.1186/s42400-024-00345-2Anonymization frameworkGeneralizationFine-grainedUnitAnony
spellingShingle Shuguang Yuan
Jing Yu
Tengfei Yang
Chi Chen
Unitanony: a fine-grained and practical anonymization framework for better data utility
Cybersecurity
Anonymization framework
Generalization
Fine-grained
UnitAnony
title Unitanony: a fine-grained and practical anonymization framework for better data utility
title_full Unitanony: a fine-grained and practical anonymization framework for better data utility
title_fullStr Unitanony: a fine-grained and practical anonymization framework for better data utility
title_full_unstemmed Unitanony: a fine-grained and practical anonymization framework for better data utility
title_short Unitanony: a fine-grained and practical anonymization framework for better data utility
title_sort unitanony a fine grained and practical anonymization framework for better data utility
topic Anonymization framework
Generalization
Fine-grained
UnitAnony
url https://doi.org/10.1186/s42400-024-00345-2
work_keys_str_mv AT shuguangyuan unitanonyafinegrainedandpracticalanonymizationframeworkforbetterdatautility
AT jingyu unitanonyafinegrainedandpracticalanonymizationframeworkforbetterdatautility
AT tengfeiyang unitanonyafinegrainedandpracticalanonymizationframeworkforbetterdatautility
AT chichen unitanonyafinegrainedandpracticalanonymizationframeworkforbetterdatautility