Title
Secure, Privacy-Preserving Analysis of Distributed Databases
Abstract
In industrial and government settings, there is often a need to perform statistical analyses that require data stored in multiple distributed databases. However, the barriers to literally integrating these data can be substantial, even insurmountable. In this article we show how tools from information technology-specifically, secure multiparty computation and networking-can be used to perform statistically valid analyses of distributed databases. The common characteristic of these methods is that the owners share sufficient statistics computed on the local databases in a way that protects each owner's data from the other owners. Our focus is on horizontally partitioned data, in which data records rather than attributes are spread among the databases. We present protocols for securely performing regression, maximum likelihood estimation, and Bayesian analysis, as well as secure construction of contingency tables. We outline three current research directions: a software system implementing the protocols, secure EM algorithms, and partially trusted third parties, which reduce incentives for owners to be dishonest.
Year
DOI
Venue
2007
10.1198/004017007000000209
TECHNOMETRICS
Keywords
Field
DocType
data confidentiality,distributed databases,secure multiparty computation
Secure multi-party computation,Confidentiality,Expectation–maximization algorithm,Information technology,Software system,Distributed database,Information privacy,Statistics,Sufficient statistic,Mathematics
Journal
Volume
Issue
ISSN
49
3
0040-1706
Citations 
PageRank 
References 
20
1.15
8
Authors
6
Name
Order
Citations
PageRank
Alan F. Karr1100576.93
William J. Fulp2321.90
Francisco Vera3252.56
S. Stanley Young4649.85
Xiaodong Lin5805.34
Jerome P. Reiter621622.12