Abstract | ||
---|---|---|
Understanding characteristics of network traffic in a Hadoop cluster is a key to check existing problems in order to improve them for better performance of MapReduce operations. However, current works is focusing on analyzing MapReduce performance within one single data center, but network traffic in a geo-distributed data centers environment has not been well-studied yet. In this paper, we study the network traffic characteristics in geo-distributed data centers and identify some interesting results. We first construct geo-distributed data centers by adding latency among data center clusters with 18 data nodes. Then we collect traffic log data by running MapReduce applications on the geo-distributed data centers. Finally, by analyzing the log data, we found some interesting results for our future research. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1109/ICAwST.2015.7314042 | 2015 IEEE 7th International Conference on Awareness Science and Technology (iCAST) |
Keywords | Field | DocType |
Hadoop,big data infrastructure,MapReduce,Geo-distributed data center,network traffic | Data mining,Latency (engineering),Computer science,Data center,Database | Conference |
ISSN | Citations | PageRank |
2325-5986 | 1 | 0.35 |
References | Authors | |
4 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
yutaka koshiba | 1 | 1 | 0.35 |
Wuhui Chen | 2 | 307 | 34.07 |
yuichi yamada | 3 | 1 | 0.35 |
Takazumi Tanaka | 4 | 6 | 1.47 |
Incheon Paik | 5 | 241 | 38.80 |