Title
Challenges in Measuring Utility for Fully Synthetic Data
Abstract
Evaluating the utility of the generated data is a pivotal step in any synthetic data project. Most projects start by exploring various synthesis approaches trying to identify the most suitable synthesis strategy for the data at hand. Utility evaluations are also always necessary to decide whether the data are of sufficient quality to be released. Various utility measures have been proposed for this purpose in the literature. However, as I will show in this paper, some of these measures can be misleading when considered in isolation while others seem to be inappropriate to assess whether the synthetic data are suitable to be released. This illustrates that a detailed validity assessment looking at various dimensions of utility will always be inevitable to find the optimal synthesis strategy.
Year
DOI
Venue
2022
10.1007/978-3-031-13945-1_16
Privacy in Statistical Databases
Keywords
DocType
Volume
Confidence interval overlap, Confidentiality, Global utility, pMSE, Privacy
Conference
13463
ISSN
Citations 
PageRank 
0302-9743
0
0.34
References 
Authors
0
1
Name
Order
Citations
PageRank
Drechsler Jörg100.34