Abstract | ||
---|---|---|
Population informatics is the systematic study of populations via secondary analysis of massive data collections about people, called the social genome. A major challenge in building the social genome is the difficulty in data integration of heterogeneous and uncoordinated data while protecting the confidentiality of the data subjects. Here, we present our work in designing a flexible computerized third party linkage platform, Secure Decoupled Linkage (SDLink), which can provide both privacy protection and accurate high quality integrated data using a hybrid human-machine data integration system. Our evaluation results show that chaffing used in combination with universe manipulation is very effective in blocking inferences during the clerical review process. |
Year | DOI | Venue |
---|---|---|
2013 | 10.1109/BigData.2013.6691789 | BigData Conference |
Keywords | Field | DocType |
heterogeneous data,massive data collections secondary analysis,data privacy,privacy preserving record linkage,social genome,universe manipulation,decoupled data,data analysis,sdlink,flexible computerized third party linkage platform,demography,population informatics,secure decoupled linkage,hybrid human-machine data integration system,uncoordinated data,clerical review process,data integration,social sciences computing,privacy protection | Data integration,Genome,Data mining,World Wide Web,Confidentiality,Computer science,Population informatics,Third party,Information privacy | Conference |
ISSN | Citations | PageRank |
2639-1589 | 0 | 0.34 |
References | Authors | |
3 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Hye-Chung Kum | 1 | 114 | 12.99 |
Ashok Krishnamurthy | 2 | 455 | 56.47 |
Darshana Pathak | 3 | 0 | 0.34 |
Michael K. Reiter | 4 | 8695 | 764.03 |
Stanley C. Ahalt | 5 | 435 | 54.14 |