Abstract | ||
---|---|---|
Partitioned Global Address Space (PGAS) programming models like Open SHMEM and hybrid models like Open SHMEM+MPI can deliver high performance and improved programmability. However, current implementations of Open SHMEM assume a fully-connected process model which affects their performance and scalability. We address this critical issue by designing on-demand connection management support for Open SHMEM which significantly improves the startup performance and reduces the resource usage. We further enhance the Open SHMEM startup performance by utilizing non-blocking out-of-band communication APIs. We evaluate our designs using a set of micro benchmarks and applications and observe 30 times reduction in Open SHMEM initialization time and 8.3 times improvement in execution time of a Hello World application at 8,192 processes. In particular, when sufficient work can be overlapped, we show that use of non-blocking out-of-band communication APIs allow for a constant initialization cost of Open SHMEM jobs at different core counts. We also obtain up to 90% reduction in number of network endpoints and up to 35% improvement in application execution time with NAS Parallel Benchmarks. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1109/IPDPSW.2015.104 | IPDPS Workshops |
Keywords | Field | DocType |
On-demand Connection Management, OpenSHMEM, PGAS, Job Launch, InfiniBand | Programming paradigm,InfiniBand,Computer science,Server,Parallel computing,Memory management,Initialization,Partitioned global address space,SHMEM,Operating system,Scalability,Distributed computing | Conference |
Citations | PageRank | References |
1 | 0.36 | 20 |
Authors | ||
5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Sourav Chakraborty | 1 | 381 | 49.27 |
Hari Subramoni | 2 | 466 | 50.51 |
Jonathan L. Perkins | 3 | 30 | 3.77 |
Ammar Ahmad Awan | 4 | 91 | 10.84 |
Dhabaleswar K. Panda | 5 | 5366 | 446.70 |