arXiv:2305.07415 Abstract | arXiv Analytics

arXiv:2305.07415 [cs.LG]Abstract References Reviews Resources

Comparison of machine learning models applied on anonymized data with different techniques

Judith Sáinz-Pardo Díaz, Álvaro López García

Published 2023-05-12Version 1

Anonymization techniques based on obfuscating the quasi-identifiers by means of value generalization hierarchies are widely used to achieve preset levels of privacy. To prevent different types of attacks against database privacy it is necessary to apply several anonymization techniques beyond the classical k-anonymity or $\ell$-diversity. However, the application of these methods is directly connected to a reduction of their utility in prediction and decision making tasks. In this work we study four classical machine learning methods currently used for classification purposes in order to analyze the results as a function of the anonymization techniques applied and the parameters selected for each of them. The performance of these models is studied when varying the value of k for k-anonymity and additional tools such as $\ell$-diversity, t-closeness and $\delta$-disclosure privacy are also deployed on the well-known adult dataset.

Comments: Accepted for publication: IEEE International Conference in Cyber Security and Resilience 2023 (IEEE CSR)

Categories: cs.LG, cs.CR, cs.DB

Keywords: machine learning models, anonymized data, anonymization techniques, machine learning methods, comparison

Tags: conference paper

Related articles: Most relevant | Search more

arXiv:2307.02973 [cs.LG] (Published 2023-07-06)

Pruning vs Quantization: Which is Better?

Andrey Kuzmin, Markus Nagel, Mart van Baalen, Arash Behboodi, Tijmen Blankevoort

arXiv:2105.01282 [cs.LG] (Published 2021-05-04)

Comparison of Machine Learning Methods for Predicting Winter Wheat Yield in Germany

Amit Kumar Srivastava et al.

arXiv:1703.00512 [cs.LG] (Published 2017-03-01)

PMLB: A Large Benchmark Suite for Machine Learning Evaluation and Comparison

Randal S. Olson, William La Cava, Patryk Orzechowski, Ryan J. Urbanowicz, Jason H. Moore

arXiv Analytics

arXiv:2305.07415 [cs.LG]Abstract References Reviews Resources

Comparison of machine learning models applied on anonymized data with different techniques

Links

Toolbox

arXiv:2305.07415 [cs.LG]AbstractReferencesReviewsResources

Comparison of machine learning models applied on anonymized data with different techniques

Links

Toolbox

arXiv:2305.07415 [cs.LG]Abstract References Reviews Resources