Title
Efficient Mining of Statistical Dependencies
Abstract
The Multi-Stream Dependency Detection algorithm finds rules that capture statistical dependencies between patterns in multivariate time series of categorical data [Oates and Cohen, 1996c]. Rule strength is measured by the G statistic [Wickens, 1989], and an upper bound on the value of G for the descendants of a node allows MSDD'S search space to be pruned. However, in the worst case, the algorithm will explore exponentially many rules. This paper presents and empirically evaluates two ways of addressing this problem. The first is a set of three methods for reducing the size of MSDD'S search space based on information collected during the search process. Second, we discuss an implementation of MSDD that distributes its computations over multiple machines on a network.
Year
Venue
Keywords
1999
AISTATS
statistical dependencies,statistical dependency,multivariate time series,efficient mining,rule strength,multiple machine,search space,categorical data,search process,g statistic,paper present,multi-stream dependency detection algorithm,upper bound
Field
DocType
ISBN
Data mining,Statistic,Upper and lower bounds,Multivariate statistics,Categorical variable,Computer science,Artificial intelligence,Machine learning,Computation
Conference
1-55860-613-0
Citations 
PageRank 
References 
7
0.89
4
Authors
3
Name
Order
Citations
PageRank
Tim Oates11069190.77
Matthew D. Schmill29814.67
paul r cohen31927460.49