Title | ||
---|---|---|
TOD-Tree: Task-Overlapped Direct Send Tree Image Compositing for Hybrid MPI Parallelism and GPUs. |
Abstract | ||
---|---|---|
Modern supercomputers have thousands of nodes, each with CPUs and/or GPUs capable of several teraflops. However, the network connecting these nodes is relatively slow, on the order of gigabits per second. For time-critical workloads such as interactive visualization, the bottleneck is no longer computation but communication. In this paper, we present an image compositing algorithm that works on both CPU-only and GPU-accelerated supercomputers and focuses on communication avoidance and overlapping communication with computation at the expense of evenly balancing the workload. The algorithm has three stages: a parallel direct send stage, followed by a tree compositing stage and a gather stage. We compare our algorithm with radix-k and binary-swap from the IceT library in a hybrid OpenMP/MPI setting on the Stampede and Edison supercomputers, show strong scaling results and explain how we generally achieve better performance than these two algorithms. We developed a GPU-based image compositing algorithm where we use CUDA kernels for computation and GPU Direct RDMA for inter-node GPU communication. We tested the algorithm on the Piz Daint GPU-accelerated supercomputer and show that we achieve performance on par with CPUs. Last, we introduce a workflow in which both rendering and compositing are done on the GPU. |
Year | DOI | Venue |
---|---|---|
2017 | 10.1109/TVCG.2016.2542069 | IEEE Trans. Vis. Comput. Graph. |
Keywords | Field | DocType |
Graphics processing units,Rendering (computer graphics),Supercomputers,Parallel processing,Data visualization,Loading,Message systems | Bottleneck,Data visualization,Supercomputer,Computer science,CUDA,Parallel computing,Theoretical computer science,Remote direct memory access,Rendering (computer graphics),Compositing,Hardware architecture | Journal |
Volume | Issue | ISSN |
23 | 6 | 1077-2626 |
Citations | PageRank | References |
2 | 0.38 | 22 |
Authors | ||
5 |
Name | Order | Citations | PageRank |
---|---|---|---|
A. V. Pascal Grosset | 1 | 22 | 2.52 |
Manasa Prasad | 2 | 4 | 2.72 |
Cameron Christensen | 3 | 19 | 3.50 |
Aaron Knoll | 4 | 300 | 21.33 |
Charles Hansen | 5 | 1574 | 95.02 |