k -Anonymity in practice: How generalisation and suppression affect machine learning classifiers - Citegraph

Paper Info

Title
k -Anonymity in practice: How generalisation and suppression affect machine learning classifiers

Abstract
The protection of private information is a crucial issue in data-driven research and business contexts. Typically, techniques like anonymisation or (selective) deletion are introduced in order to allow data sharing, e. g. in the case of collaborative research endeavours. For use with anonymisation techniques, the k-anonymity criterion is one of the most popular, with numerous scientific publications on different algorithms and metrics. Anonymisation tech-niques often require changing the data and thus necessarily affect the results of machine learning models trained on the underlying data. In this work, we conduct a systematic com-parison and detailed investigation into the effects of different k-anonymisation algorithms on the results of machine learning models. We investigate a set of popular k-anonymisation algorithms with different classifiers and evaluate them on different real-world datasets. Our systematic evaluation shows that with an increasingly strong k-anonymity constraint, the classification performance generally degrades, but to varying degrees and strongly depend-ing on the dataset and anonymisation method. Furthermore, Mondrian can be considered as the method with the most appealing properties for subsequent classification. (c) 2021 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license ( http://creativecommons.org/licenses/by-nc-nd/4.0/ )

Year	DOI	Venue
2021	10.1016/j.cose.2021.102488	COMPUTERS & SECURITY
Keywords	DocType	Volume
k-Anonymity, Machine learning, Anonymisation, Generalisation, Suppression	Journal	111
ISSN	Citations	PageRank
0167-4048	0	0.34
References	Authors
0	6

Authors (6 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Djordje Slijepcevic	1	0	0.34
Maximilian Henzl	2	0	0.34
Lukas Daniel Klausner	3	0	0.34
Tobias Dam	4	0	0.34
Peter Kieseberg	5	187	29.39
Matthias Zeppelzauer	6	186	21.35

1