Abstract | ||
---|---|---|
This paper presents benchmark results of three different parallel-programming paradigms on an unstructured shock capturing numerical code for transient problems. The three parallel programming methods include: (1) a shared-memory programming of OpenMP using cache coherent non-uniform memory access (CC-NUMA) of SGI Origin2000, (2) an MPI (Message Passing Interface) implementation and (3) a SHMEM implementation using the parallel library called "Shared Memory Access Library". The methods (2) and (3) are both based on distributed memory architecture. SGI Origin2000 is used throughout the current study. It is found that the scalability of the programming (1) is so poor that its usage for the unstructuredCFD code is impractical. The scalabilities of programming (2) and (3) are much better than programming (1) and the computational speed of giga-flops range can be achieved with 16 CPUs. The parallel programming with SHMEM libraries is approximately twice as fast as the one with MPI. |
Year | DOI | Venue |
---|---|---|
2001 | 10.1109/IPDPS.2001.924935 | IPDPS |
Keywords | Field | DocType |
unstructured shock,shmem implementation,parallel library,parallel programming method,numerical code,parallelization methods,cache coherent non-uniform memory,memory architecture,parallel programming,sgi origin2000,shmem library,shared-memory programming,cache coherence,computational fluid dynamics,non uniform memory access,message passing interface,numerical simulation,shared memory,shock waves,message passing,concurrent computing,scalability,hardware,benchmark | Shared memory,Computer science,Cache,Parallel computing,Message Passing Interface,Concurrent computing,SHMEM,Message passing,Memory architecture,Distributed computing,Scalability | Conference |
ISBN | Citations | PageRank |
0-7695-0990-8 | 0 | 0.34 |
References | Authors | |
1 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Tsutomu Saito | 1 | 0 | 0.34 |
Atsushi Abe | 2 | 13 | 1.34 |
Kazuyoshi Takayama | 3 | 13 | 4.70 |