Abstract | ||
---|---|---|
We develop efficient CPU kernels for multiphase compressible flows and evaluate different optimization strategies. The presented software achieves up to 48% of the peak performance on shared memory architectures, outperforming by 9-14X what is considered to be state-of-the-art. On 48-core CPUs we observe speedups of 40-45X and measure up to 360 GFLOP/s over 840 GFLOP/s of the peak. |
Year | DOI | Venue |
---|---|---|
2012 | 10.1007/978-3-642-38718-0_22 | HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2012 |
Field | DocType | Volume |
Compressibility,Memory bandwidth,Shared memory,Computer science,Parallel computing,Software,Compressible flow | Conference | 7851 |
ISSN | Citations | PageRank |
0302-9743 | 0 | 0.34 |
References | Authors | |
10 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Babak Hejazialhosseini | 1 | 68 | 6.10 |
Christian Conti | 2 | 24 | 2.70 |
Diego Rossinelli | 3 | 119 | 10.43 |
Petros Koumoutsakos | 4 | 1065 | 84.99 |