Title
XQStream++: Fast tuple extraction algorithm for streaming XML data.
Abstract
Tuple extraction from streaming XML should be cost effective for real-time query evaluation. Recently, StreamTX exhibits a good performance in terms of both running time and memory usage to support the tuple extraction queries for streaming XML. However, we empirically observe that StreamTX incurs computational overhead unnecessarily, since it builds on TwigStack, an XML query processing algorithm originally developed for stored XML. In this paper, we first design a non-recursive XQStream algorithm to handle inefficient recursive calls of StreamTX. Subsequently, we extend the basic XQStream by incorporating two novel schemes: (1) the relational pointer to efficiently and effectively evaluate the structural relationship of elements, and (2) the pattern reuse to reduce redundant path evaluations for pattern matching. The performance evaluation on various datasets provides new empirical findings. First, XQStream++, which incorporates the relational pointer and the pattern reuse scheme into XQStream, significantly outperforms the state-of-the-art algorithms in running time with a small, nearly constant memory usage. Second, the most recently released XQuery engines outperform StreamTX in running time.
Year
DOI
Venue
2015
10.1016/j.ins.2014.06.041
Information Sciences
Keywords
Field
DocType
Streaming XML,Tuple extraction,Relational pointer,Pattern reuse
Pointer (computer programming),Overhead (computing),Data mining,Computer science,XML database,Artificial intelligence,Streaming XML,XML,Tuple,Parallel computing,Pattern matching,Machine learning,XQuery
Journal
Volume
ISSN
Citations 
314
0020-0255
0
PageRank 
References 
Authors
0.34
21
3
Name
Order
Citations
PageRank
Byunggul Ryu192.24
JongWoo Ha2556.79
Sangkeun Lee349865.59