Title
Efficient Skew Handling for Outer Joins in a Cloud Computing Environment.
Abstract
Outer joins are ubiquitous in many workloads and Big Data systems. The question of how to best execute outer joins in large parallel systems is particularly challenging, as real world datasets are characterized by data skew leading to performance issues. Although skew handling techniques have been extensively studied for inner joins, there is little published work solving the corresponding problem...
Year
DOI
Venue
2018
10.1109/TCC.2015.2487965
IEEE Transactions on Cloud Computing
Keywords
Field
DocType
Cloud computing,Density estimation robust algorithm,Sparks,Tin,Silicon,Histograms,Load management
Load management,Joins,Spark (mathematics),Computer science,Load balancing (computing),Real-time computing,Skew,Hash function,Scalability,Cloud computing,Distributed computing
Journal
Volume
Issue
ISSN
6
2
2168-7161
Citations 
PageRank 
References 
3
0.39
0
Authors
2
Name
Order
Citations
PageRank
Long Cheng19116.99
Spyros Kotoulas259046.46