Title
Using the Macroflow Abstraction to Minimize Machine Slot-time Spent on Networking in Hadoop.
Abstract
Machine slot-time spent on data transmission has direct impact on average job completion time (JCT). In this paper, we propose Macroflow, a networking abstraction that can capture the primitive scheduling granularity of machine slot-time. We demonstrate that minimizing machine slot-time is equivalent to minimizing the average macroflow completion time (MCT). We prove that minimizing MCT to be strongly NP-hard and focus on developing effective heuristics. We propose the Smallest-Macroflow-First (SMF) and Smallest-Average-Macroflow-First (SAMF) heuristics that greedily schedule macroflows based on their network footprint. To work with existing commodity switches, priority discretization is performed to classify macroflows into a small number of priority queues.
Year
Venue
DocType
2018
APNet
Conference
ISBN
Citations 
PageRank 
978-1-4503-6395-2
0
0.34
References 
Authors
11
10
Name
Order
Citations
PageRank
Bingchuan Tian1124.22
Chen Tian2378.36
Jiajun Sun312.06
Junhua Yan400.34
Yizhou Tang551.15
Wei Wang6145882.69
Dai Haipeng741955.44
Nai Xia851.14
guihai chen93537317.28
Wanchun Dou1087896.01