Abstract | ||
---|---|---|
Temporal data, which is a sequence of data tuples measured at successive time instances, is typically very large. Hence instead of mining the entire data, we are interested in dividing the huge data into several smaller intervals of interest which we call temporal neighborhoods. In this paper we propose an approach to generate temporal neighborhoods through unequal depth discretization. We describe two novel algorithms (a) Similarity based Merging (SMerg) and, (b) Stationary distribution based Merging (StMerg). These algorithms are based on the robust framework of Markov models and the Markov Stationary distribution respectively. We identify temporal neighborhoods with distinct demarcations based on unequal depth discretization of the data. We discuss detailed experimental results in both synthetic and real world data. Specifically we show (i) the efficacy of our approach through precision and recall of labeled bins, (ii) the ground truth validation in real world datasets and, (iii) knowledge discovery in the temporal neighborhoods such as global anomalies. Our results indicate that we are able to identify valuable knowledge based on our ground truth validation from real world traffic data. |
Year | DOI | Venue |
---|---|---|
2009 | 10.1109/ICDM.2009.26 | ICDM |
Keywords | Field | DocType |
entire data,temporal neighborhood discovery,real world data,huge data,markov models,ground truth validation,temporal data,markov stationary distribution,real world datasets,real world traffic data,unequal depth discretization,temporal neighborhood,knowledge discovery,ground truth,markov processes,stationary distribution,merging,statistical distributions,bismuth,data mining,discretization,markov model,knowledge base,data models,computational modeling | Data mining,Data modeling,Markov process,Computer science,Temporal database,Artificial intelligence,Pattern recognition,Markov model,Markov chain,Precision and recall,Ground truth,Knowledge extraction,Machine learning | Conference |
ISSN | Citations | PageRank |
1550-4786 | 4 | 0.45 |
References | Authors | |
9 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Sandipan Dey | 1 | 54 | 6.68 |
Vandana P. Janeja | 2 | 141 | 18.93 |
Aryya Gangopadhyay | 3 | 391 | 112.49 |