Title
On the bias of traceroute sampling: Or, power-law degree distributions in regular graphs
Abstract
Understanding the structure of the Internet graph is a crucial step for building accurate network models and designing efficient algorithms for Internet applications. Yet, obtaining its graph structure is a surprisingly difficult task, as edges cannot be explicitly queried. Instead, empirical studies rely on traceroutes to build what are essentially single-source, all-destinations, shortest-path trees. These trees only sample a fraction of the network's edges, and a recent paper by Lakhina et al. found empirically that the resuting sample is intrinsically biased. For instance, the observed degree distribution under traceroute sampling exhibits a power law even when the underlying degree distribution is Poisson.In this paper, we study the bias of traceroute sampling systematically, and, for a very general class of underlying degree distributions, calculate the likely observed distributions explicitly. To do this, we use a continuous-time realization of the process of exposing the BFS tree of a random graph with a given degree distribution, calculate the expected degree distribution of the tree, and show that it is sharply concentrated. As example applications of our machinery, we show how traceroute sampling finds power-law degree distributions in both δ-regular and Poisson-distributed random graphs. Thus, our work puts the observations of Lakhina et al. on a rigorous footing, and extends them to nearly arbitrary degree distributions.
Year
DOI
Venue
2005
10.1145/1538902.1538905
symposium on the theory of computing
Keywords
DocType
Volume
traceroute,degree distribution,accurate network model,power-law degree distribution,Internet Protocol,traceroute sampling,internet topology,Internet application,Poisson-distributed random graph,arbitrary degree distribution,graph structure,sampling bias,regular graph,underlying degree distribution
Journal
56
Issue
ISSN
ISBN
4
Proc. 37th ACM Symposium on Theory of Computing (STOC) 2005
1-58113-960-8
Citations 
PageRank 
References 
89
16.88
23
Authors
4
Name
Order
Citations
PageRank
Dimitris Achlioptas12037174.89
Aaron Clauset22033146.18
David Kempe35891403.41
Cristopher Moore41765160.55