Title
Virtual Topologies for Scalable Resource Management and Contention Attenuation in a Global Address Space Model on the Cray XT5
Abstract
Global Address Space (GAS) programming models enable a convenient, shared-memory style addressing model, and support completely asynchronous data movement. Their underlying runtime systems face critical challenges in (1) scalably managing resources (such as memory for communication buffers), and (2) gracefully handling unpredictable communication patterns and any associated contention. In this research, we investigate these challenges for a popular GAS runtime library, Aggregate Remote Memory Copy Interface (ARMCI) on, large-scale Cray XT5 systems. We represent the management of communication resources as directed graphs, and propose two new scalable virtual topologies, Meshed Fully Connected Graphs (MFCG) and Cubic Fully Connected Graphs (CFCG), for scalable resource management and contention attenuation. To ensure deadlock-free communication in these multi-dimensional topologies, we design and develop Lowest Dimension First (LDF) forwarding to support fully- or partially-populated MFCG and CFCG on any number of nodes. We have extensively evaluated the benefits of these virtual topologies on the petascale Jaguar Cray XT5 system at Oak Ridge National Laboratory. Our experimental results demonstrate MFCG as the most suitable virtual topology because of its benefits in resource management, contention mitigation, and the resulting benefit to scientific applications.
Year
DOI
Venue
2011
10.1109/ICPP.2011.38
ICPP
Keywords
Field
DocType
communication resource,contention attenuation,connected graphs,xt5 system,virtual topologies,global address space model,cray xt5,communication buffer,scalable resource management,deadlock-free communication,contention mitigation,new scalable virtual topology,unpredictable communication pattern,associated contention,connected graph,directed graph,directed graphs,shared memory,programming model,resource manager,contention,gas,resource allocation
Resource management,Asynchronous communication,Computer science,Parallel computing,Computer network,Network topology,Runtime library,Resource allocation,Cray XT5,Petascale computing,Distributed computing,Scalability
Conference
Citations 
PageRank 
References 
0
0.34
14
Authors
4
Name
Order
Citations
PageRank
Weikuan Yu1104277.40
Vinod Tipparaju263846.25
Xinyu Que312411.81
Vetter, Jeffrey42383186.44