Title
Cocoa: A Synthetic Data Generator For Testing Anonymization Techniques
Abstract
Conducting extensive testing of anonymization techniques is critical to assess their robustness and identify the scenarios where they are most suitable. However, the access to real microdata is highly restricted and the one that is publicly-available is usually anonymized or aggregated; hence, reducing its value for testing purposes. In this paper, we present a framework (COCOA) for the generation of realistic synthetic microdata that allows to define multi-attribute relationships in order to preserve the functional dependencies of the data. We prove how COCOA is useful to strengthen the testing of anonymization techniques by broadening the number and diversity of the test scenarios. Results also show how COCOA is practical to generate large datasets.
Year
DOI
Venue
2016
10.1007/978-3-319-45381-1_13
PRIVACY IN STATISTICAL DATABASES: UNESCO CHAIR IN DATA PRIVACY
Field
DocType
Volume
Data mining,Computer science,Robustness (computer science),Functional dependency,Synthetic data,Scenario testing,Microdata (HTML),Garbage collection,Execution time,Information privacy
Conference
9867
ISSN
Citations 
PageRank 
0302-9743
4
0.44
References 
Authors
10
4
Name
Order
Citations
PageRank
Vanessa Ayala-Rivera1183.96
A. Omar Portillo-Dominguez2255.68
Liam Murphy381174.94
Christina Thorpe4539.00