Title
InvisiFence: performance-transparent memory ordering in conventional multiprocessors
Abstract
A multiprocessor's memory consistency model imposes ordering constraints among loads, stores, atomic operations, and memory fences. Even for consistency models that relax ordering among loads and stores, ordering constraints still induce significant performance penalties due to atomic operations and memory ordering fences. Several prior proposals reduce the performance penalty of strongly ordered models using post-retirement speculation, but these designs either (1) maintain speculative state at a per-store granularity, causing storage requirements to grow proportionally to speculation depth, or (2) employ distributed global commit arbitration using unconventional chunk-based invalidation mechanisms. In this paper we propose InvisiFence, an approach for implementing memory ordering based on post-retirement speculation that avoids these concerns. InvisiFence leverages minimalistic mechanisms for post-retirement speculation proposed in other contexts to (1) track speculative state efficiently at block-granularity with dedicated storage requirements independent of speculation depth, (2) provide fast commit by avoiding explicit commit arbitration, and (3) operate under a conventional invalidation-based cache coherence protocol. InvisiFence supports both modes of operation found in prior work: speculating only when necessary to minimize the risk of rollback-inducing violations or speculating continuously to decouple consistency enforcement from the processor core. Overall, InvisiFence requires approximately one kilobyte of additional state to transform a conventional multiprocessor into one that provides performance-transparent memory ordering, fences, and atomic operations.
Year
DOI
Venue
2009
10.1145/1555754.1555785
Proceedings of the 40th Annual International Symposium on Computer Architecture
Keywords
Field
DocType
consistency model,parallel programming,languages,design
Speculation,Kilobyte,Computer science,Commit,Parallel computing,Memory ordering,Real-time computing,Multiprocessing,Consistency model,Multi-core processor,Distributed computing,Cache coherence
Conference
Volume
Issue
ISSN
37
3
1063-6897
Citations 
PageRank 
References 
60
1.53
24
Authors
3
Name
Order
Citations
PageRank
Colin Blundell143616.70
Milo M. K. Martin22677125.22
Thomas F. Wenisch32112105.25