Abstract | ||
---|---|---|
It is a fact that the attention of research community in computer science, business executives, and decision makers is drastically drawn by big data. As the volume of data becomes bigger, it needs performance-oriented data-intensive processing frameworks such as MapReduce, which can scale computation on large commodity clusters. Hadoop MapReduce processes data in Hadoop Distributed File System as jobs scheduled according to YARN fair scheduler and capacity scheduler. However, with advancement and dynamic changes in hardware and operating environments, the performance of clusters is greatly affected. Various efforts in literature have been made to address the issues of heterogeneity (i.e., clusters consisting of virtual machines and machines with different hardware), network communication, data locality, better resource utilization, and run-time scheduling. In this paper, we present a survey to discuss various research efforts made so far to improve Hadoop MapReduce scheduling. We classify scheduling algorithms and techniques proposed in the literature so far based on their addressing areas and present a taxonomy. Furthermore, we also discuss various aspects of open issues and challenges in the scheduling of MapReduce to improve its performance. Copyright (c) 2015 John Wiley & Sons, Ltd. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1002/cpe.3578 | CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE |
Keywords | Field | DocType |
scheduling,task scheduling,job scheduling,data-intensive computing,big data,cloud | Data-intensive computing,Fair-share scheduling,Computer science,Scheduling (computing),Parallel computing,Two-level scheduling,Job scheduler,Dynamic priority scheduling,Big data,Distributed computing,Cloud computing | Journal |
Volume | Issue | ISSN |
27 | 17 | 1532-0626 |
Citations | PageRank | References |
5 | 0.39 | 33 |
Authors | ||
7 |
Name | Order | Citations | PageRank |
---|---|---|---|
Muhammad Idris | 1 | 35 | 5.85 |
Shujaat Hussain | 2 | 84 | 8.87 |
Maqbool Ali | 3 | 39 | 6.89 |
Arsen Abdulali | 4 | 5 | 0.73 |
Muhammad Hameed Siddiqi | 5 | 105 | 10.85 |
Byeong Ho Kang | 6 | 541 | 72.76 |
Sungyoung Lee | 7 | 2932 | 279.41 |