Title
NativeTask: A Hadoop compatible framework for high performance
Abstract
Although Hadoop MapReduce provides good programming abstractions and horizontal scalability, it is often blamed for its poor single node performance. In the meantime, MapReduce has already achieved a large install base, thus any performance improvement should keep the compatibility. In this paper, we address the challenges via several approaches guided by low-level performance analysis. And we materialize the approaches via NativeTask, a high-performance, fully compatible MapReduce execution engine. We evaluate its performance with representative HiBench workloads. The results show that the speedup NativeTask achieves ranges from 10% to 160%, and it paves the way for a better MapReduce that excels on both single node performance and scalability. In the future, hardware acceleration can also be applied to further improve the system's efficiency.
Year
DOI
Venue
2013
10.1109/BigData.2013.6691703
BigData Conference
Keywords
Field
DocType
nativetask,high performance,c++ implementation,low-level performance analysis,cache-oblivious sort,hardware acceleration,programming abstractions,representative hibench workloads,cpu-bound application,system efficiency,hadoop compatible framework,software performance evaluation,compatibility,hadoop,horizontal scalability,hadoop mapreduce,distributed processing
Compatibility (mechanics),Computer science,Parallel computing,Hardware acceleration,Speedup,Performance improvement,Scalability
Conference
Volume
Issue
ISSN
null
null
2639-1589
Citations 
PageRank 
References 
2
0.39
13
Authors
9
Name
Order
Citations
PageRank
Dong Yang1595.36
Xiang Zhong250.86
Dong Yan3176.40
Fangqin Dai420.39
Xusen Yin551.19
Cheng Lian63129.99
Zhongliang Zhu720.39
Weihua Jiang820.39
Gansha Wu91079.06