Title
NUMA-aware Scalable Graph Traversal on SGI UV Systems.
Abstract
Breadth-first search (BFS) is one of the most fundamental processing algorithms in graph theory. We previously presented a scalable BFS algorithm based on Beamer's direction-optimizing algorithm for non-uniform memory access (NUMA)-based systems, in which the NUMA architecture was carefully considered. This paper presents our new implementation that reduces remote memory access in a top-down direction of direction-optimizing algorithm. We also discuss numerical results obtained on the SGI UV 2000 and UV 300 systems, which are shared-memory supercomputers based on a cache coherent (cc)-NUMA architecture that can handle thousands of threads on a single operating system. Our implementation has achieved performance rates of 219 billion edges per second on a Kronecker graph with 234 vertices and 238 edges on a rack of an SGI UV 300 system with 1,152 threads. This result exceeds the fastest entry for a shared-memory system on the current Graph500 list presented in November 2015, which includes our previous implementation.
Year
DOI
Venue
2016
10.1145/2915516.2915522
HPGP@HPDC
Keywords
DocType
Citations 
Graph algorithm, NUMA-aware, Graph500 benchmark
Conference
6
PageRank 
References 
Authors
0.44
15
6
Name
Order
Citations
PageRank
Yuichiro Yasui160.44
Katsuki Fujisawa224828.63
Eng Lim Goh360.78
John Baron460.44
Atsushi Sugiura560.44
Uchiyama, T.6286.18