Title | ||
---|---|---|
An Extended Roofline Model with Communication-Awareness for Distributed-Memory HPC Systems. |
Abstract | ||
---|---|---|
Performance modeling of parallel applications on distributed memory systems is a challenging task due to the effects of CPU speed, memory access time, and communication cost. In this paper, we propose a simple and intuitive graphical model, which extends the widely used Roofline performance model to include the communication cost in addition to the memory access time and the peak CPU performance. This new performance model inherits the simplicity of the original Roofline model and enables performance evaluation on a third dimension of communication performance. Such a model will greatly facilitate and expedite the analysis, development and optimization of parallel programs on high-end computer systems. We empirically validate the extended new Roofline model usingfl oating-point-computation-bound, memory-bound, and communication-bound applications. Three distinct high-end computing platforms have been tested: 1) high performance computing (HPC) systems, 2) high throughput computing systems, and 3) cloud computing systems. Our experimental results with four different parallel applications show that the new model can approximately evaluate the performance of different programs on various distributed-memory systems. Furthermore, the extended new model is able to provide insight into how the problem size can affect the upper bound performance of parallel applications, which is a special property revealed by the new dimension of communication cost analysis.
|
Year | DOI | Venue |
---|---|---|
2019 | 10.1145/3293320.3293321 | HPC Asia |
Keywords | Field | DocType |
Roofline model, performance prediction, visualization | Computer performance,Access time,Supercomputer,High-throughput computing,Computer science,Distributed memory,Graphical model,Instructions per second,Performance prediction,Distributed computing | Conference |
ISBN | Citations | PageRank |
978-1-4503-6632-8 | 0 | 0.34 |
References | Authors | |
13 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
David Cardwell | 1 | 0 | 0.34 |
Fengguang Song | 2 | 232 | 19.88 |