Title
Building Scalable PGAS Communication Subsystem on Blue Gene/Q
Abstract
This paper presents a design of scalable Partitioned Global Address Space (PGAS) communication subsystems on recently proposed Blue Gene/Q architecture. The proposed design provides an in-depth modeling of communication infrastructure using Parallel Active Messaging Interface(PAMI). The communication infrastructure is used to design time-space efficient communication protocols for frequently used data-types (contiguous, uniformly non-contiguous) with Remote Direct Memory Access (RDMA) get/put primitives. The proposed design accelerates load balance counters by using asynchronous threads, which are required due to the missing network hardware support for generic Atomic Memory Operations (AMOs). Under the proposed design, the synchronization traffic is reduced by tracking conflicting memory accesses in distributed memory with a slight increment in space complexity. An evaluation with simple communication benchmarks show a adjacent node get latency of 2.89us and peak bandwidth of 1775 MB/s resulting in 99% communication efficiency. The evaluation shows a reduction in the execution time by up to 30% for NWChem self consistent field calculation on 4096 processes using the proposed asynchronous thread based design.
Year
DOI
Venue
2013
10.1109/IPDPSW.2013.262
IPDPS Workshops
Keywords
Field
DocType
computational complexity,file organisation,parallel architectures,parallel machines,protocols,synchronisation,system-on-chip,AMO,Blue Gene/Q architecture,NWChem self consistent field calculation,PAMI,RDMA,asynchronous thread based design,communication infrastructure,conflicting memory accesses,generic atomic memory operations,get-put primitives,network hardware support,parallel active messaging interface,partitioned global address space,remote direct memory access,scalable PGAS communication subsystem,space complexity,synchronization traffic,time-space efficient communication protocols,Blue Gene/Q,Communication,PGAS,RDMA
Asynchronous communication,Synchronization,Computer science,Parallel computing,Distributed memory,Thread (computing),Remote direct memory access,Partitioned global address space,Scalability,Communications protocol
Conference
Citations 
PageRank 
References 
1
0.35
23
Authors
4
Name
Order
Citations
PageRank
Abhinav Vishnu163547.28
Darren J. Kerbyson21102104.36
Kevin Barker3514.46
Hubertus van Dam4101.23