Research On Scheduling Scheme For Hadoop Clusters - Citegraph

Paper Info

Title
Research On Scheduling Scheme For Hadoop Clusters

Abstract
In this paper, we import a prefetching mechanism into MapReduce model while retaining compatibility with the native Hadoop. Given a data-intensive application running on a Hadoop MapReduce cluster, our approach estimates the execution time of each task and adaptively preloads an amount of data to the memory before the new task is assigned to the computing node. We implement a predictive schedule and prefetching (PSP) mechanism, which is integrated into the native MapReduce runtime system. We also evaluate performance on a 10-node cluster using two popular benchmarks-grep and wordcount. The PSP mechanism reduces the execution time of grep and wordcount up to 28 % with an average of 19%. Moreover, the PSP model increases the overall throughput and improves the I/O utilization. Because of the limitation of length, we did not present the experiment result detail in this paper.

Year	DOI	Venue
2013	10.1016/j.procs.2013.05.423	2013 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE
Keywords	Field	DocType
MapReduce, Prefetching, Schedule	Cluster (physics),Scheduling (computing),Computer science,Parallel computing,Execution time,Throughput,Runtime system	Conference
Volume	ISSN	Citations
18	1877-0509	9
PageRank	References	Authors
0.57	5	6

Authors (6 rows)

Cited by (9 rows)

References (5 rows)

Name	Order	Citations	PageRank
Jiong Xie	1	161	10.15
Fanjun Meng	2	25	5.43
Hailong Wang	3	9	0.57
HongFang Pan	4	9	0.57
JinHong Cheng	5	9	0.57
Xiao Qin	6	1836	125.69

1