Abstract | ||
---|---|---|
Outer joins are ubiquitous in many workloads and Big Data systems. The question of how to best execute outer joins in large parallel systems is particularly challenging, as real world datasets are characterized by data skew leading to performance issues. Although skew handling techniques have been extensively studied for inner joins, there is little published work solving the corresponding problem... |
Year | DOI | Venue |
---|---|---|
2018 | 10.1109/TCC.2015.2487965 | IEEE Transactions on Cloud Computing |
Keywords | Field | DocType |
Cloud computing,Density estimation robust algorithm,Sparks,Tin,Silicon,Histograms,Load management | Load management,Joins,Spark (mathematics),Computer science,Load balancing (computing),Real-time computing,Skew,Hash function,Scalability,Cloud computing,Distributed computing | Journal |
Volume | Issue | ISSN |
6 | 2 | 2168-7161 |
Citations | PageRank | References |
3 | 0.39 | 0 |
Authors | ||
2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Long Cheng | 1 | 91 | 16.99 |
Spyros Kotoulas | 2 | 590 | 46.46 |