Title
Efficient privacy preserving k-means clustering
Abstract
This paper introduces an efficient privacy-preserving protocol for distributed K-means clustering over an arbitrary partitioned data, shared among N parties. Clustering is one of the fundamental algorithms used in the field of data mining. Advances in data acquisition methodologies have resulted in collection and storage of vast quantities of user’s personal data. For mutual benefit, organizations tend to share their data for analytical purposes, thus raising privacy concerns for the users. Over the years, numerous attempts have been made to introduce privacy and security at the expense of massive additional communication costs. The approaches suggested in the literature make use of the cryptographic protocols such as Secure Multiparty Computation (SMC) and/or homomorphic encryption schemes like Paillier’s encryption. Methods using such schemes have proven communication overheads. And in practice are found to be slower by a factor of more than 106. In light of the practical limitations posed by privacy using the traditional approaches, we explore a paradigm shift to side-step the expensive protocols of SMC. In this work, we use the paradigm of secret sharing, which allows the data to be divided into multiple shares and processed separately at different servers. Using the paradigm of secret sharing, allows us to design a provably-secure, cloud computing based solution which has negligible communication overhead compared to SMC and is hence over a million times faster than similar SMC based protocols.
Year
DOI
Venue
2010
10.1007/978-3-642-13601-6_17
PAISI
Keywords
Field
DocType
k-means clustering,data mining,data acquisition methodology,secret sharing,paradigm shift,negligible communication,personal data,privacy concern,massive additional communication cost,efficient privacy,similar smc,arbitrary partitioned data,cloud computing,data acquisition,cryptographic protocol,homomorphic encryption,k means clustering,secure multiparty computation,provable security
Data mining,Homomorphic encryption,Secure multi-party computation,Secret sharing,Cryptographic protocol,Computer security,Computer science,Server,Encryption,Cluster analysis,Cloud computing
Conference
Volume
ISSN
ISBN
6122
0302-9743
3-642-13600-1
Citations 
PageRank 
References 
17
0.74
24
Authors
4
Name
Order
Citations
PageRank
Maneesh Upmanyu1331.78
Anoop M. Namboodiri225526.36
Kannan Srinathan342241.70
C. V. Jawahar41700148.58