Abstract | ||
---|---|---|
We address the challenge of reconstructing historical counts from aggregated, possibly overlapping historical reports. For example, given the monthly and weekly sums, how can we find the daily counts of people infected with flu? We propose an approach, called ARES (Automatic REStoration), that performs automatic data reconstruction in two phases: (1) first, it estimates the sequence of historical counts utilizing domain knowledge, such as smoothness and periodicity of historical events; (2) then, it uses the estimated sequence to learn notable patterns in the target sequence to refine the reconstructed time series. In order to derive such patterns, ARES uses an annihilating filter technique. The idea is to learn a linear shift-invariant operator whose response to the desired sequence is (approximately) zero-yielding a set of null-space equations that the desired signal should satisfy, without the need for the accompanying data. The reconstruction accuracy can be further improved by applying the second phase iteratively. We evaluate ARES on the real epidemiological data from the Tycho project and demonstrate that ARES recovers historical data from aggregated reports with high accuracy. In particular, it considerably outperforms top competitors, including least squares approximation and the more advanced H-FUSE method (42% and 34% improvement based on average RMSE, respectively). |
Year | DOI | Venue |
---|---|---|
2018 | 10.1109/ICDE.2018.00016 | 2018 IEEE 34th International Conference on Data Engineering (ICDE) |
Keywords | Field | DocType |
Information Fusion,Information Disaggregation,Annihilating Filter | Least squares,Data mining,Data reconstruction,Domain knowledge,Computer science,Mean squared error,Operator (computer programming),Smoothness,Information fusion | Conference |
ISSN | ISBN | Citations |
1063-6382 | 978-1-5386-5521-4 | 1 |
PageRank | References | Authors |
0.36 | 6 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Fan Yang | 1 | 2 | 1.39 |
Hyun Ah Song | 2 | 98 | 10.75 |
Zongge Liu | 3 | 2 | 1.39 |
Christos Faloutsos | 4 | 27972 | 4490.38 |
Vladimir Zadorozhny | 5 | 3 | 2.09 |
Nicholas D. Sidiropoulos | 6 | 1644 | 131.55 |