Abstract | ||
---|---|---|
Lattice Boltzmann method (LBM) is an important computational fluid dynamics (CFD) approach to solving the Naiver-Stokes equations and simulating complex fluid flows. LBM is also well known as a memory bound problem and its performance is limited by the memory access time on modern computer systems. In this paper, we design and develop both sequential and parallel memory-aware algorithms to optimize the performance of LBM. The new memory-aware algorithms can enhance data reuses across multiple time steps to further improve the performance of the original and fused LBM. We theoretically analyze the algorithms to provide an insight into how data reuses occur in each algorithm. Finally, we conduct experiments and detailed performance analysis on two different manycore systems. Based on the experimental results, the parallel memory-aware LBM algorithm can outperform the fused LBM by up to 292% on the Intel Haswell system when using 28 cores, and by 302 % on the Intel Skylake system when using 48 cores. |
Year | DOI | Venue |
---|---|---|
2018 | 10.1109/CAHPC.2018.8645909 | 2018 30th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD) |
Keywords | Field | DocType |
Lattice Boltzmann method,Memory aware parallel algorithms,Multicore systems | Access time,Computer science,Parallel computing,Lattice Boltzmann methods,Algorithm,Complex fluid,Computational fluid dynamics,Multicore systems | Conference |
ISSN | ISBN | Citations |
1550-6533 | 978-1-5386-7769-8 | 1 |
PageRank | References | Authors |
0.35 | 0 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yuankun Fu | 1 | 1 | 0.69 |
Feng Li | 2 | 338 | 49.66 |
Fengguang Song | 3 | 232 | 19.88 |
Luoding Zhu | 4 | 31 | 5.28 |