Title
Modeling Randomized Data Streams In Caching, Data Processing, And Crawling Applications
Abstract
Many BigData applications (e.g., MapReduce, web caching, search in large graphs) process streams of random key-value records that follow highly skewed frequency distributions. In this work, we first develop stochastic models for the probability to encounter unique keys during exploration of such streams and their growth rate over time. We then apply these models to the analysis of LRU caching, MapReduce overhead, and various crawl properties (e.g., node-degree bias, frontier size) in random graphs.
Year
Venue
Field
2015
2015 IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (INFOCOM)
Graph,Data processing,Data stream mining,Frequency distribution,Random graph,Crawling,Computer science,Stochastic modelling,Big data,Distributed computing
DocType
ISSN
Citations 
Conference
0743-166X
1
PageRank 
References 
Authors
0.35
9
2
Name
Order
Citations
PageRank
Sarker Tanzir Ahmed161.11
Dmitri Loguinov2129891.08