Title
Factor Analysis Of Internet Traffic Destinations From Similar Source Networks
Abstract
Purpose - This study aims to assess whether similar user populations in the Internet produce similar geographical traffic destination patterns on a per-country basis.Design/methodology/approach - The authors collected a country-wide NetFlow trace, which encompasses the whole Spanish academic network. Such a trace comprises several similar campus networks in terms of population size and structure. To compare their behaviors, the authors propose a mixture model, which is primarily based on the Zipf-Mandelbrot power law to capture the heavy-tailed nature of the per-country traffic distribution. Then, factor analysis is performed to understand the relation between the response variable, number of bytes or packets per day, with dependent variables such as the source IP network, traffic direction, and country.Findings - Surprisingly, the results show that the geographical distribution is strongly dependent on the source IP network. Furthermore, even though there are thousands of users in a typical campus network, it turns out that the aggregation level which is required to observe a stable geographical pattern is even larger.Practical implications - Based on these findings, conclusions drawn for one network cannot be directly extrapolated to different ones. Therefore, ISPs' traffic measurement campaigns should include an extensive set of networks to cope with the space diversity, and also encompass a significant period of time due to the large transient time.Originality/value - Current state of the art includes some analysis of geographical patterns, but not comparisons between networks with similar populations. Such comparison can be useful for the design of content distribution networks and the cost-optimization of peering agreements.
Year
DOI
Venue
2012
10.1108/10662241211199951
INTERNET RESEARCH
Keywords
Field
DocType
Content distribution networks, Factor analysis, Geographical characterization, Heavy-hitters, Internet remote host location, Zipf-Mandelbrot, Geography
Byte,Computer science,NetFlow,Network packet,Computer network,Internet protocol suite,Variables,Mixture model,Internet traffic,The Internet
Journal
Volume
Issue
ISSN
22
1
1066-2243
Citations 
PageRank 
References 
1
0.36
33
Authors
4
Name
Order
Citations
PageRank
Felipe Mata1142.93
José Luis García-Dorado29513.01
Javier Aracil321342.23
Jorge E. López de Vergara418726.98