Title
Using Evolutive Summary Counters for Efficient Cooperative Caching in Search Engines
Abstract
We propose and analyze a distributed cooperative caching strategy based on the Evolutive Summary Counters (ESC), a new data structure that stores an approximated record of the data accesses in each computing node of a search engine. The ESC capture the frequency of accesses to the elements of a data collection, and the evolution of the access patterns for each node in a network of computers. The ESC can be efficiently summarized into what we call ESC-summaries to obtain approximate statistics of the document entries accessed by each computing node. We use the ESC-summaries to introduce two algorithms that manage our distributed caching strategy, one for the distribution of the cache contents, ESC-placement, and another one for the search of documents in the distributed cache, ESC-search. While the former improves the hit rate of the system and keeps a large ratio of data accesses local, the latter reduces the network traffic by restricting the number of nodes queried to find a document. We show that our cooperative caching approach outperforms state-of-the-art models in both hit rate, throughput, and location recall for multiple scenarios, i.e., different query distributions and systems with varying degrees of complexity.
Year
DOI
Venue
2012
10.1109/TPDS.2011.162
IEEE Trans. Parallel Distrib. Syst.
Keywords
Field
DocType
hit rate,data collection,cache content,efficient cooperative caching,caching strategy,data access,computing node,cooperative caching approach,search engines,evolutive summary counters,new data structure,cooperative caching strategy,document entry,data structures,distributed systems,evolutionary computation,approximation theory,search engine,distributed system,radiation detectors,statistics,data structure,radiation detector,distributed processing
Hit rate,Data structure,Data collection,Search engine,Computer science,Cache,Distributed cache,Computer network,Evolutionary computation,Throughput,Distributed computing
Journal
Volume
Issue
ISSN
23
4
1045-9219
Citations 
PageRank 
References 
2
0.39
0
Authors
4
Name
Order
Citations
PageRank
David Dominguez-Sal118916.35
Josep Aguilar-Saborit2868.01
Mihai Surdeanu32582174.69
Josep Lluis Larriba-Pey4102.64