Title
Aggregation-Aware Compression of Probabilistic Streaming Time Series
Abstract
In recent years, there has been a growing interest for probabilistic data management. We focus on probabilistic time series where a main characteristic is the high volumes of data, calling for efficient compression techniques. To date, most work on probabilistic data reduction has provided synopses that minimize the error of representation w.r.t. the original data. However, in most cases, the compressed data will be meaningless for usual queries involving aggregation operators such as SUM or AVG. We propose PHA Probabilistic Histogram Aggregation, a compression technique whose objective is to minimize the error of such queries over compressed probabilistic data. We incorporate the aggregation operator given by the end-user directly in the compression technique, and obtain much lower error in the long term. We also adopt a global error aware strategy in order to manage large sets of probabilistic time series, where the available memory is carefully balanced between the series, according to their individual variability.
Year
DOI
Venue
2015
10.1007/978-3-319-21024-7_16
Machine Learning and Data Mining in Pattern Recognition
Field
DocType
Volume
Divergence-from-randomness model,Histogram,Data mining,Computer science,Probabilistic analysis of algorithms,Compression ratio,Operator (computer programming),Artificial intelligence,Probabilistic logic,Data management,Machine learning,Probabilistic database
Conference
9166
ISSN
Citations 
PageRank 
0302-9743
0
0.34
References 
Authors
11
2
Name
Order
Citations
PageRank
Reza Akbarinia125425.77
Florent Masseglia240843.08