Title
Pattern discovery in distributed databases
Abstract
Most algorithms for learning and pattern discovery in data assume that all the needed data is available on one computer at a single site. This assumption does not hold in situations where a number of independent databases reside on geographically distributed nodes of a computer network. These databases cannot be moved to a single site due to size, security, privacy and data-ownership concerns but all of them together constitute the dataset in which patterns must be discovered. Some pattern discovery algorithms can be adapted to such situations and some others become inefficient or inapplicable. In this paper we show how a decision-tree induction algorithm may be adapted for distributed data situations. We also discuss some general issues relating to the adaptability of other pattern discovery algorithms to distributed data situations
Year
Venue
Keywords
1997
AAAI/IAAI
single site,data-ownership concern,data situation,pattern discovery algorithm,pattern discovery,general issue,decision-tree induction algorithm,independent databases,computer network,needed data,distributed database
Field
DocType
ISBN
Adaptability,Data mining,Information retrieval,Computer science,Artificial intelligence,Distributed database,Machine learning
Conference
0-262-51095-2
Citations 
PageRank 
References 
15
1.44
8
Authors
2
Name
Order
Citations
PageRank
Raj Bhatnagar119139.25
Sriram Srinivasan237927.92