Title
CLARO: modeling and processing uncertain data streams
Abstract
Uncertain data streams, where data are incomplete and imprecise, have been observed in many environments. Feeding such data streams to existing stream systems produces results of unknown quality, which is of paramount concern to monitoring applications. In this paper, we present the claro system that supports stream processing for uncertain data naturally captured using continuous random variables. claro employs a unique data model that is flexible and allows efficient computation. Built on this model, we develop evaluation techniques for relational operators by exploring statistical theory and approximation. We also consider query planning for complex queries given an accuracy requirement. Evaluation results show that our techniques can achieve high performance while satisfying accuracy requirements and outperform state-of-the-art sampling methods.
Year
DOI
Venue
2012
10.1007/s00778-011-0261-7
VLDB J.
Keywords
Field
DocType
Uncertain data streams,Continuous uncertainty,Data models,Query processing
Data mining,Data modeling,Data stream mining,Computer science,Uncertain data,Sampling (statistics),Relational operator,Statistical theory,Stream processing,Data model,Database
Journal
Volume
Issue
ISSN
21
5
1066-8888
Citations 
PageRank 
References 
24
0.85
25
Authors
5
Name
Order
Citations
PageRank
Thanh T. Tran1241.19
Liping Peng21077.50
Yanlei Diao32234108.95
Andrew Mcgregor4134064.31
Anna Liu544134.75