Title
The Implications of Diverse Applications and Scalable Data Sets in Benchmarking Big Data Systems
Abstract
Now we live in an era of big data, and big data applications are becoming more and more pervasive. How to benchmark data center computer systems running big data applications in short big data systems is a hot topic. In this paper, we focus on measuring the performance impacts of diverse applications and scalable volumes of data sets on big data systems. For four typical data analysis applications--an important class of big data applications, we find two major results through experiments: first, the data scale has a significant impact on the performance of big data systems, so we must provide scalable volumes of data sets in big data benchmarks. Second, for the four applications, even all of them use the simple algorithms, the performance trends are different with increasing data scales, and hence we must consider not only variety of data sets but also variety of applications in benchmarking big data systems.
Year
DOI
Venue
2012
10.1007/978-3-642-53974-9-5
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Keywords
DocType
Volume
benchmarking,big data,scalable data
Conference
8163 LNCS
Issue
ISSN
Citations 
null
16113349
5
PageRank 
References 
Authors
0.56
14
8
Name
Order
Citations
PageRank
Zhen Jia133817.82
Runlin Zhou270.94
Chunge Zhu3143.13
Lei Wang457746.85
Wanling Gao529919.12
Yingjie Shi628512.17
Jianfeng Zhan776762.86
Lixin Zhang857145.96