Title
A Counter Based Approach For Reducer Placement With Augmented Hadoop Rack Awareness
Abstract
As the data-driven paradigm for intelligent systems design is gaining prominence, performance requirements have become very stringent, leading to numerous fine-tuned versions of Hadoop and its MapReduce programming model. However, very few researchers have investigated the effect of intelligent reducer placement on Hadoop's performance. This paper delves into this much ignored reducer placement phase for improving Hadoop's performance and proposes to spawn reduce phase of Hadoop tasks in an asynchronous fashion across nodes in a Hadoop cluster. The main contributions of this paper are: (i) to track when map phase of tasks are completed, (ii) to count the number of maps completed, and finally (iii) assign reducers to Hadoop nodes based on map counts such that run-time data copying is minimized. To this end, this paper presents a novel counter based reducer placement (CBRP) algorithm based on the counter values maintained by JobTracker at the rack and node levels. Experiments conducted demonstrate the merit of the proposed reducer placement with average improvements ranging between 5% and 17% experienced across different benchmarks with both late shuffle and early shuffle.
Year
DOI
Venue
2021
10.3906/elk-2001-106
TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES
Keywords
DocType
Volume
MapReduce, HDFS, Hadoop rack awareness, reducer placement
Journal
29
Issue
ISSN
Citations 
1
1300-0632
0
PageRank 
References 
Authors
0.34
0
3
Name
Order
Citations
PageRank
Wajahat Hussain Mir100.34
K Hemant Kumar Reddy200.34
Diptendu Sinha Roy300.34