Title
Research On Scheduling Scheme For Hadoop Clusters
Abstract
In this paper, we import a prefetching mechanism into MapReduce model while retaining compatibility with the native Hadoop. Given a data-intensive application running on a Hadoop MapReduce cluster, our approach estimates the execution time of each task and adaptively preloads an amount of data to the memory before the new task is assigned to the computing node. We implement a predictive schedule and prefetching (PSP) mechanism, which is integrated into the native MapReduce runtime system. We also evaluate performance on a 10-node cluster using two popular benchmarks-grep and wordcount. The PSP mechanism reduces the execution time of grep and wordcount up to 28 % with an average of 19%. Moreover, the PSP model increases the overall throughput and improves the I/O utilization. Because of the limitation of length, we did not present the experiment result detail in this paper.
Year
DOI
Venue
2013
10.1016/j.procs.2013.05.423
2013 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE
Keywords
Field
DocType
MapReduce, Prefetching, Schedule
Cluster (physics),Scheduling (computing),Computer science,Parallel computing,Execution time,Throughput,Runtime system
Conference
Volume
ISSN
Citations 
18
1877-0509
9
PageRank 
References 
Authors
0.57
5
6
Name
Order
Citations
PageRank
Jiong Xie116110.15
Fanjun Meng2255.43
Hailong Wang390.57
HongFang Pan490.57
JinHong Cheng590.57
Xiao Qin61836125.69