Title
Ares: Automatic Disaggregation of Historical Data
Abstract
We address the challenge of reconstructing historical counts from aggregated, possibly overlapping historical reports. For example, given the monthly and weekly sums, how can we find the daily counts of people infected with flu? We propose an approach, called ARES (Automatic REStoration), that performs automatic data reconstruction in two phases: (1) first, it estimates the sequence of historical counts utilizing domain knowledge, such as smoothness and periodicity of historical events; (2) then, it uses the estimated sequence to learn notable patterns in the target sequence to refine the reconstructed time series. In order to derive such patterns, ARES uses an annihilating filter technique. The idea is to learn a linear shift-invariant operator whose response to the desired sequence is (approximately) zero-yielding a set of null-space equations that the desired signal should satisfy, without the need for the accompanying data. The reconstruction accuracy can be further improved by applying the second phase iteratively. We evaluate ARES on the real epidemiological data from the Tycho project and demonstrate that ARES recovers historical data from aggregated reports with high accuracy. In particular, it considerably outperforms top competitors, including least squares approximation and the more advanced H-FUSE method (42% and 34% improvement based on average RMSE, respectively).
Year
DOI
Venue
2018
10.1109/ICDE.2018.00016
2018 IEEE 34th International Conference on Data Engineering (ICDE)
Keywords
Field
DocType
Information Fusion,Information Disaggregation,Annihilating Filter
Least squares,Data mining,Data reconstruction,Domain knowledge,Computer science,Mean squared error,Operator (computer programming),Smoothness,Information fusion
Conference
ISSN
ISBN
Citations 
1063-6382
978-1-5386-5521-4
1
PageRank 
References 
Authors
0.36
6
6
Name
Order
Citations
PageRank
Fan Yang121.39
Hyun Ah Song29810.75
Zongge Liu321.39
Christos Faloutsos4279724490.38
Vladimir Zadorozhny532.09
Nicholas D. Sidiropoulos61644131.55