Classifying data from protected statistical datasets - Citegraph

Paper Info

Title
Classifying data from protected statistical datasets

Abstract
Statistical Disclosure Control (SDC) is an active research area in the recent years. The goal is to transform an original dataset X into a protected one X', such that X' does not reveal any relation between confidential and (quasi-)identifier attributes and such that X' can be used to compute reliable statistical information about X. Many specific protection methods have been proposed and analyzed, with respect to the levels of privacy and utility that they offer. However, when measuring utility, only differences between the statistical values of X and X' are considered. This would indicate that datasets protected by SDC methods can be used only for statistical purposes. We show in this paper that this is not the case, because a protected dataset X' can be used to construct good classifiers for future data. To do so, we describe an extensive set of experiments that we have run with different SDC protection methods and different (real) datasets. In general, the resulting classifiers are very good, which is good news for both the SDC and the Privacy-preserving Data Mining communities. In particular, our results question the necessity of some specific protection methods that have appeared in the privacy-preserving data mining (PPDM) literature with the clear goal of providing good classification.

Year	DOI	Venue
2010	10.1016/j.cose.2010.05.005	Computers and Security
Keywords	Field	DocType
disclosure risk,statistical disclosure control,weka experiments,classification methods,information loss	Data mining,Information loss,Identifier,Confidentiality,Computer science,Computer security,Statistical disclosure control	Journal
Volume	Issue	ISSN
29	8	Computers & Security
Citations	PageRank	References
14	0.58	26
Authors
4

Authors (4 rows)

Cited by (14 rows)

References (26 rows)

Name	Order	Citations	PageRank
Javier Herranz	1	107	8.73
Stan Matwin	2	3025	344.20
Jordi Nin	3	311	26.53
Vicenç Torra	4	2666	234.27

1