Title
Buffering in query evaluation over XML streams
Abstract
All known algorithms for evaluating advanced XPath queries (e.g., ones with predicates or with closure axes) on XML streams employ buffers to temporarily store fragments of the document stream. In many cases, these buffers grow very large and constitute a major memory bottleneck. In this paper, we identify two broad classes of evaluation problems that independently necessitate the use of large memory buffers in evaluation of queries over XML streams: (1) full-fledged evaluation (as opposed to just filtering) of queries with predicates; (2) evaluation (whether full-fledged or filtering) of queries with "multi-variate" predicates.We prove quantitative lower bounds on the amount of memory required in each of these scenarios. The bounds are stated in terms of novel document properties that we define. We show that these scenarios, in combination with query evaluation over recursive documents, cover the cases in which large buffers are required. Finally, we present algorithms that match the lower bounds for an important fragment of XPath.
Year
DOI
Venue
2005
10.1145/1065167.1065195
PODS
Keywords
Field
DocType
full-fledged evaluation,evaluation problem,xml stream,major memory bottleneck,large memory buffer,query evaluation,document stream,large buffer,lower bound,advanced xpath query,consistency,concurrency control,serializability,anomaly,two phase locking,snapshot isolation
Bottleneck,Snapshot isolation,Serializability,XML,Concurrency control,Computer science,Two-phase locking,Theoretical computer science,XPath,Recursion
Conference
ISBN
Citations 
PageRank 
1-59593-062-0
39
1.42
References 
Authors
23
3
Name
Order
Citations
PageRank
Ziv Bar-Yossef11776118.00
Marcus Fontoura2111661.74
Vanja Josifovski32265148.84