Title
Design and Evaluation of a Switch Cache Architecture for CC-NUMA Multiprocessors
Abstract
Cache coherent nonuniform memory access (CC-NUMA) multiprocessors provide a scalable design for shared memory. But, they continue to suffer from large remote memory access latencies due to comparatively slow memory technology and large data transfer latencies in the interconnection network. In this paper, we propose a novel hardware caching technique, called switch cache, to improve the remote memory access performance of CC-NUMA multiprocessors. The main idea is to implement small fast caches in crossbar switches of the interconnect medium to capture and store shared data as they flow from the memory module to the requesting processor. This stored data acts as a cache for subsequent requests, thus reducing the need for remote memory accesses tremendously. The implementation of a cache in a crossbar switch needs to be efficient and robust, yet flexible for changes in the caching protocol. The design and implementation details of a CAche Embedded Switch ARchitecture, CAESAR, using wormhole routing with virtual channels is presented. We explore the design space of switch caches by modeling CAESAR in a detailed execution driven simulator and analyze the performance benefits. Our results show that the CAESAR switch cache is capable of improving the performance of CC-NUMA multiprocessors by up to 45 percent reduction in remote memory accesses for some applications. By serving remote read requests at various stages in the interconnect, we observe improvements in execution time as high as 20 percent for these applications. We conclude that switch caches provide a cost-effective solution for designing high performance CC-NUMA multiprocessors.
Year
DOI
Venue
2000
10.1109/12.868025
IEEE Trans. Computers
Keywords
Field
DocType
remote memory access performance,remote memory access,memory module,switch cache,shared memory,cache coherent nonuniform memory,large remote memory access,crossbar switch,remote memory,cc-numa multiprocessors,switch cache architecture,network routing,routing,cost effectiveness,cache coherence,hardware,switches,space exploration,protocols,robustness
Uniform memory access,Cache pollution,CPU cache,Computer science,Cache,Parallel computing,Cache-only memory architecture,Real-time computing,Cache coloring,Non-uniform memory access,Bus sniffing
Journal
Volume
Issue
ISSN
49
8
0018-9340
Citations 
PageRank 
References 
5
0.48
19
Authors
2
Name
Order
Citations
PageRank
Ravishankar Iyer172035.52
Laxmi N. Bhuyan22393248.44