Title
On the Tradeoff Between Data-Privacy and Utility for Data Publishing
Abstract
A typical method for privacy-preserving data publishing mechanism is to add random noise to the original data for publishing. No matter what kind of noise is added, there is a chance that the original state can be estimated in a certain accuracy. The probability of the original data inferred by the malicious receiver in a given interval is measured by <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$(\alpha,\ \beta)$</tex> -data-privacy. With random noise added to the original data, the utility of the published data will decrease. In this paper, we investigate the tradeoff between data privacy and data utility under <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$(\alpha,\beta)$</tex> - data-privacy, aiming to seek an optimal noise distribution. To maximize the weighted sum of privacy and utility we prove that when the added noise is symmetric and the data utility is measured by <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$\ell^{1}$</tex> - or <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$\ell^2$</tex> -norm function, the optimal noise follows the uniform distribution. Then we further investigate the optimal noise to maximize data utility with a certain privacy guarantee and we derive that the optimal noise is a group of impulse functions. Finally, we compare <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$(\alpha, \beta)$</tex> -data-privacy with differential privacy and obtain the inequality relationship between the two privacy parameters. Simulations are conducted to validate the correctness of the obtained results.
Year
DOI
Venue
2018
10.1109/PADSW.2018.8644952
2018 IEEE 24th International Conference on Parallel and Distributed Systems (ICPADS)
Keywords
Field
DocType
Estimation,Publishing,Differential privacy,Privacy,Probability density function,Distributed databases
Applied mathematics,Differential privacy,Computer science,Correctness,Uniform distribution (continuous),Impulse (physics),Real-time computing,Data publishing,Distributed database,Information privacy,Probability density function
Conference
ISSN
ISBN
Citations 
1521-9097
978-1-5386-7308-9
1
PageRank 
References 
Authors
0.36
0
5
Name
Order
Citations
PageRank
Wenjing Liao110.36
Jianping He2196.39
Shanying Zhu313021.54
Cai-Lian Chen483198.98
Wenjing Liao5103.53