Abstract | ||
---|---|---|
Privacy-preserving data mining--developing models without seeing the data --- is receiving growing attention. This paper assumes a privacy-preserving distributed data mining scenario: data sources collaborate to develop a global model, but must not disclose their data to others. The problem of secure distributed classification is an important one. In many situations, data is split between multiple organizations. These organizations may want to utilize all of the data to create more accurate predictive models while revealing neither their training data/databases nor the instances to be classified. Naïve Bayes is often used as a baseline classifier, consistently providing reasonable classification performance. This paper brings privacy-preservation to that baseline, presenting protocols to develop a Naïve Bayes classifier on both vertically as well as horizontally partitioned data. |
Year | DOI | Venue |
---|---|---|
2008 | 10.1007/s00778-006-0041-y | The Vldb Journal |
Keywords | DocType | Volume |
Data mining,Privacy,Security,Naïve Bayes,Distributed computing | Journal | 17 |
Issue | ISSN | Citations |
4 | 1066-8888 | 69 |
PageRank | References | Authors |
2.39 | 30 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Jaideep Vaidya | 1 | 2778 | 171.18 |
Murat Kantarcıoğlu | 2 | 69 | 2.39 |
Chris Clifton | 3 | 3327 | 544.44 |