Title
NAS Integer Sort on Multi-threaded Shared Memory Machines
Abstract
Multi-threaded shared memory machines, like the commercial Tera MTA or the experimental SB-PRAM, have an extremely good performance on the Integer Sort benchmark of the NAS Parallel Benchmark Suite and are expected to scale. The number of CPU cycles is an order of magnitude lower than the numbers reported of general purpose distributed memory or shared memory machines; even vector computers are slower. The reasons for this behavior are investigated. It turns out that both machines can take advantage of a fetch-and-add operation and that due to multi-threading no time is lost waiting for memory accesses to complete. Except for non-scalable vector computers, the Cray T3E, which supports fetch-and-add but not multi-threading, is the only parallel computer that could challenge these machines.
Year
DOI
Venue
1998
10.1007/BFb0057960
Euro-Par
Keywords
Field
DocType
nas integer sort,multi-threaded shared memory machines,parallel computer,distributed memory,shared memory
Interleaved memory,Uniform memory access,Shared memory,Computer science,Parallel computing,Distributed memory,Distributed shared memory,Flat memory model,Instruction cycle,Operating system,Memory module
Conference
Volume
ISSN
ISBN
1470
0302-9743
3-540-64952-2
Citations 
PageRank 
References 
0
0.34
8
Authors
2
Name
Order
Citations
PageRank
Thomas Grün100.34
Mark A. Hillebrand220015.17