Abstract | ||
---|---|---|
Multidimensional discrete Fourier transforms (DFTs) are typically decomposed into multiple one-dimensional (1D) transforms. Hence, parallel implementations of any multidimentional DFT focus on parallelizing within or across the 1D DFT. Existing DFT packages exploit the inherent parallelism across the 1D DFTs and offer rigid frameworks, that cannot be extended to incorporate both forms of parallelism and various data layouts to enable some of the parallelism. However, in the era of exascale, where systems have thousand of nodes and intricate network topologies, flexibility and parallel efficiency are key aspects all multidimentional DFT frameworks need to have in order to map and scale the computation appropriately. In this work, we show the need for a versatile parallel framework that facilitates the development of a family of parallel multidimentional DFT algorithms by (1) using different data layouts to distribute the data across the compute nodes, (2) exploiting the two different parallelization schemes to different degrees, and (3) unifying the two parallelization schemes within a single framework. We show that the flexibility of selecting different parallel multidimentional DFT algorithms allows for almost linear strong scaling results for problem sizes of 1024(3) on two supercomputers, namely, RIKEN's K-Computer and Oakridge's Summit. |
Year | DOI | Venue |
---|---|---|
2020 | 10.1137/19M1288401 | SIAM JOURNAL ON SCIENTIFIC COMPUTING |
Keywords | DocType | Volume |
3D DFTs, distributed systems, performance, scalability | Journal | 42 |
Issue | ISSN | Citations |
5 | 1064-8275 | 0 |
PageRank | References | Authors |
0.34 | 0 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Doru Thom Popovici | 1 | 0 | 0.34 |
Martin D Schatz | 2 | 0 | 0.34 |
Franz Franchetti | 3 | 974 | 88.39 |
Tze Meng Low | 4 | 0 | 0.34 |