Title
Towards a Unified Framework for Data Cleaning and Data Privacy.
Abstract
Data quality has become a pervasive challenge for organizations as they wrangle with large, heterogeneous datasets to extract value. Existing data cleaning solutions have focused on scalable techniques to resolve inconsistencies quickly. However, given the proliferation of sensitive, confidential user information, data privacy concerns have largely remained unexplored in data cleaning techniques. In this work, we present a new privacy-aware, data cleaning framework that aims to resolve data inconsistencies while minimizing the amount of information disclosed. We present a set of data disclosure operations that facilitate the data cleaning process, and propose two information-theoretic measures for privacy loss and data utility that are used to correct inconsistencies in the data.
Year
Venue
Field
2015
WISE
Data mining,Data quality,Confidentiality,Computer science,User information,Information privacy,Database,Scalability
DocType
Citations 
PageRank 
Conference
1
0.35
References 
Authors
2
2
Name
Order
Citations
PageRank
Yu Huang15619.96
Fei Chiang225619.02