Title
Caching techniques for XML message filtering
Abstract
An XML publish/subscribe system is based on filtering XML message streams for a large number of subscriptions expressed in XPath. A major issue on an XML-based publish/subscribe system is its performance. As the number of XML documents and XPath-based subscriptions increases in the system, to provide XML filtering efficiently becomes a challenging problem. Hence, there is an urgent need for optimization techniques to meet this challenge. There are many existing approaches on designing efficient XML filtering engine. Most existing research efforts focus on efficient filtering algorithms for achieving a high system performance or supporting more complex XPath syntax. Each proposed scheme has its advantages and limitations. Not much research, however, has considered using caching in the context of XML filtering. In this paper, we propose two caching schemes to be used in conjunction with an XML filtering engine. First, we present a complete message caching algorithm that is a strict caching policy to reduce the computation cost that accrues from multiple filtering of the same messages, by reusing results of previously processed messages. Second, we investigate a structure-based caching method that is an approximate caching policy for messages sharing the same structure. Performance evaluation for synthetic data and real data both show that complete message caching and structure-based caching schemes are able to achieve significantly better filtering performance (up to 80% for both caching schemes for the message streams experimented with).
Year
DOI
Venue
2009
10.1109/PCCC.2009.5403839
IPCCC
Keywords
Field
DocType
optimisation,caching policy,structure-based caching method,xml documents,xml message streams,xml,cache storage,xml-based publish/subscribe system,information filtering,optimization techniques,xml message filtering,xml publish/subscribe system,caching techniques,messages sharing,xml filtering engine,filtering algorithms,performance evaluation,middleware,message passing,caching,publish/subscribe,xpath-based subscriptions,document handling,xpath syntax,message caching algorithm,publish subscribe,xml document,system performance,synthetic data
Middleware,XML Encryption,Efficient XML Interchange,Streaming XML,XML,Computer science,Computer network,XPath,Message passing,XML Signature,Distributed computing
Conference
ISSN
ISBN
Citations 
1097-2641
978-1-4244-5737-3
0
PageRank 
References 
Authors
0.34
14
3
Name
Order
Citations
PageRank
Yang Cao142.16
Shikharesh Majumdar243575.95
Chung-Horng Lung368780.56