Unitanony: a fine-grained and practical anonymization framework for better data utility
Abstract In order to share data without revealing private information, privacy-preserving data publishing techniques are proposed. K-anonymity and l-diversity secure against identity and attribute disclosure. Anonymization algorithms enforce the above models and are willing to reach two primary goal...
Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
SpringerOpen
2025-07-01
|
| Series: | Cybersecurity |
| Subjects: | |
| Online Access: | https://doi.org/10.1186/s42400-024-00345-2 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849238290650103808 |
|---|---|
| author | Shuguang Yuan Jing Yu Tengfei Yang Chi Chen |
| author_facet | Shuguang Yuan Jing Yu Tengfei Yang Chi Chen |
| author_sort | Shuguang Yuan |
| collection | DOAJ |
| description | Abstract In order to share data without revealing private information, privacy-preserving data publishing techniques are proposed. K-anonymity and l-diversity secure against identity and attribute disclosure. Anonymization algorithms enforce the above models and are willing to reach two primary goals: achieving the privacy objective while maximizing data utility. Even though anonymization has been studied for decades, finding efficient techniques to improve data utility is an open question. It is a crucial challenge that impacts many anonymized data on the web, cloud, and IoT environments. However, some factors incur huge information loss for existing works. The original dataset may be transformed into generalized data to an excessive extent. To address this problem, we give a new framework and propose a heuristic algorithm called UnitAnony. It builds a full-coverage hierarchy for more generalization candidates and proposes an interval-mapping technique for fine-grained generalization extent. However, these improvements raise another challenge. It is the vast cost because more generalization will derive many operations for forming new values, grouping records, and verifying anonymization models. Therefore, we designed a data structure unit to generalize records at low costs and implemented a skipping strategy to execute the algorithm within an acceptable time. Besides, our algorithm can support many models. By evaluating well-known K-anonymity and l-diversity on real-world datasets, i.e., Adult and Census datasets, the experimental results demonstrate that our algorithm outperforms the existing algorithms (e.g., Mondrian, Top-down, Improved-Clustering, Flash, and Incognito) regarding data utility and effectiveness. |
| format | Article |
| id | doaj-art-3fc0cb83794d4cba9e9335e6843c6642 |
| institution | Kabale University |
| issn | 2523-3246 |
| language | English |
| publishDate | 2025-07-01 |
| publisher | SpringerOpen |
| record_format | Article |
| series | Cybersecurity |
| spelling | doaj-art-3fc0cb83794d4cba9e9335e6843c66422025-08-20T04:01:41ZengSpringerOpenCybersecurity2523-32462025-07-018111710.1186/s42400-024-00345-2Unitanony: a fine-grained and practical anonymization framework for better data utilityShuguang Yuan0Jing Yu1Tengfei Yang2Chi Chen3Institute of Information Engineering, Chinese Academy of ScienceInstitute of Information Engineering, Chinese Academy of ScienceNational Computer Network Emergency Response Technical Team/Coordination Center of ChinaInstitute of Information Engineering, Chinese Academy of ScienceAbstract In order to share data without revealing private information, privacy-preserving data publishing techniques are proposed. K-anonymity and l-diversity secure against identity and attribute disclosure. Anonymization algorithms enforce the above models and are willing to reach two primary goals: achieving the privacy objective while maximizing data utility. Even though anonymization has been studied for decades, finding efficient techniques to improve data utility is an open question. It is a crucial challenge that impacts many anonymized data on the web, cloud, and IoT environments. However, some factors incur huge information loss for existing works. The original dataset may be transformed into generalized data to an excessive extent. To address this problem, we give a new framework and propose a heuristic algorithm called UnitAnony. It builds a full-coverage hierarchy for more generalization candidates and proposes an interval-mapping technique for fine-grained generalization extent. However, these improvements raise another challenge. It is the vast cost because more generalization will derive many operations for forming new values, grouping records, and verifying anonymization models. Therefore, we designed a data structure unit to generalize records at low costs and implemented a skipping strategy to execute the algorithm within an acceptable time. Besides, our algorithm can support many models. By evaluating well-known K-anonymity and l-diversity on real-world datasets, i.e., Adult and Census datasets, the experimental results demonstrate that our algorithm outperforms the existing algorithms (e.g., Mondrian, Top-down, Improved-Clustering, Flash, and Incognito) regarding data utility and effectiveness.https://doi.org/10.1186/s42400-024-00345-2Anonymization frameworkGeneralizationFine-grainedUnitAnony |
| spellingShingle | Shuguang Yuan Jing Yu Tengfei Yang Chi Chen Unitanony: a fine-grained and practical anonymization framework for better data utility Cybersecurity Anonymization framework Generalization Fine-grained UnitAnony |
| title | Unitanony: a fine-grained and practical anonymization framework for better data utility |
| title_full | Unitanony: a fine-grained and practical anonymization framework for better data utility |
| title_fullStr | Unitanony: a fine-grained and practical anonymization framework for better data utility |
| title_full_unstemmed | Unitanony: a fine-grained and practical anonymization framework for better data utility |
| title_short | Unitanony: a fine-grained and practical anonymization framework for better data utility |
| title_sort | unitanony a fine grained and practical anonymization framework for better data utility |
| topic | Anonymization framework Generalization Fine-grained UnitAnony |
| url | https://doi.org/10.1186/s42400-024-00345-2 |
| work_keys_str_mv | AT shuguangyuan unitanonyafinegrainedandpracticalanonymizationframeworkforbetterdatautility AT jingyu unitanonyafinegrainedandpracticalanonymizationframeworkforbetterdatautility AT tengfeiyang unitanonyafinegrainedandpracticalanonymizationframeworkforbetterdatautility AT chichen unitanonyafinegrainedandpracticalanonymizationframeworkforbetterdatautility |