Title
Investigation of network traffic in geo-distributed data centers
Abstract
Understanding characteristics of network traffic in a Hadoop cluster is a key to check existing problems in order to improve them for better performance of MapReduce operations. However, current works is focusing on analyzing MapReduce performance within one single data center, but network traffic in a geo-distributed data centers environment has not been well-studied yet. In this paper, we study the network traffic characteristics in geo-distributed data centers and identify some interesting results. We first construct geo-distributed data centers by adding latency among data center clusters with 18 data nodes. Then we collect traffic log data by running MapReduce applications on the geo-distributed data centers. Finally, by analyzing the log data, we found some interesting results for our future research.
Year
DOI
Venue
2015
10.1109/ICAwST.2015.7314042
2015 IEEE 7th International Conference on Awareness Science and Technology (iCAST)
Keywords
Field
DocType
Hadoop,big data infrastructure,MapReduce,Geo-distributed data center,network traffic
Data mining,Latency (engineering),Computer science,Data center,Database
Conference
ISSN
Citations 
PageRank 
2325-5986
1
0.35
References 
Authors
4
5
Name
Order
Citations
PageRank
yutaka koshiba110.35
Wuhui Chen230734.07
yuichi yamada310.35
Takazumi Tanaka461.47
Incheon Paik524138.80