Title
Efficient Querying of Correlated Uncertain Data with Cached Results.
Abstract
Although there have been many efforts for management of uncertain data, evaluating probabilistic inference queries, a known NP-hard problem, is still a big challenge, especially for querying data with highly correlations. The state-of-art exact algorithms for accelerating the evaluation of inference queries are based on special indices. Besides, with the observation of the existence of many frequent queries, some researchers try to improve efficiency by reusing previously queried results. Indexing depends on the static properties like data distributions, whereas caching is in favor of the dynamic features like query workload. In this paper we propose a new approach for speeding up the evaluation of inference queries by caching frequent results in a junction tree-based hierarchical index. To the best of our knowledge, this is the first effort on utilizing both the static (data) and dynamic (query workload) properties to efficiently evaluate probabilistic inference queries. Moreover, according to our experience, different caching strategies may significantly affect the query performance. Basically a good caching strategy needs to have high cache hit ratio with limited space budget.Based on these considerations, we propose a novel caching approach, called FVEC, and present corresponding algorithms for efficiently querying correlated uncertain data. We further conduct a series of extensive experiments on large uncertain datasets in order to illustrate the effectiveness and efficiency of our proposed approaches. As illustrated by the results, compared with previous solutions, our method could greatly improve the query performance. © Springer-Verlag 2013.
Year
DOI
Venue
2013
10.1007/978-3-642-37487-6_35
DASFAA
Field
DocType
Volume
Probabilistic inference,Data mining,Inference,Computer science,Workload,Reuse,Cache,Search engine indexing,Uncertain data,Database
Conference
7825 LNCS
Issue
ISSN
Citations 
PART 1
16113349
0
PageRank 
References 
Authors
0.34
16
4
Name
Order
Citations
PageRank
Jinchuan Chen139018.64
Min Zhang213438.40
Xike Xie3294.07
Xiaoyong Du4882123.29