Abstract | ||
---|---|---|
We proposed a new k-anonymity algorithm to publish datasets with privacy protection. We improved clustering techniquesto lower data distort and enhance diversity of sensitive attributes values. Our algorithm includes four phases. Tuples are distributed to several groups in phase one. Tuples in a group own same sensitive value. In phase two, groups smaller than the threshold merge and then they are partitioned into several clusters according to quasi-identifier attributes. Each cluster would become an equivalence class. In phase three, remainder tuples are distributed to clusters evenly to satisfy L-diversity. Finally, quasi-identifier attributes values in each cluster are generalized to satisfy k-anonymity. We used OCC dataset to compare our algorithm with classic method based on clustering. Empirical results showed that our algorithm could be used to publish datasets with high security and limited information loss. © Springer-Verlag Berlin Heidelberg 2013. |
Year | DOI | Venue |
---|---|---|
2012 | 10.1007/978-3-642-35795-4_46 | ISCTCS |
Keywords | Field | DocType |
cluster,k-anonymity,l-diversity,privacy protection | Canopy clustering algorithm,CURE data clustering algorithm,Data stream clustering,Correlation clustering,Affinity propagation,Computer science,Algorithm,FLAME clustering,k-medoids,Cluster analysis | Conference |
Volume | Issue | ISSN |
320 | null | null |
Citations | PageRank | References |
2 | 0.39 | 7 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Fei Liu | 1 | 2 | 0.39 |
Yan Jia | 2 | 5 | 2.79 |
Weihong Han | 3 | 8 | 1.95 |