Title
Odd-ECC: on-demand DRAM error correcting codes.
Abstract
An application may have different sensitivity to faults in different subsets of the data it uses. Some data regions may therefore be more critical than others. Capitalizing on this observation, Odd-ECC provides a mechanism to dynamically select the memory fault tolerance of each allocated page of a program on demand depending on the criticality of the respective data. Odd-ECC error correcting codes (ECCs) are stored in separate physical pages and hidden by the OS as pages unavailable to the user. Still, these ECCs are physically aligned with the data they protect so the memory controller can efficiently access them. Thereby, capacity, performance and energy overheads of memory fault tolerance are proportional to the criticality of the data stored. Odd-ECC is applied to memory systems that use conventional 2D DRAM DIMMs as well as to 3D-stacked DRAMs and evaluated using various applications. Compared to at memory protection schemes, Odd-ECC substantially reduces ECCs capacity overheads while achieving the same Mean Time to Failure (MTTF) and in addition it slightly improves performance and energy costs. Under the same capacity constraints, Odd-ECC achieves substantially higher MTTF, compared to a Yat memory protection. This comes at a performance and energy cost, which is however still a fraction of the cost introduced by a at equally strong scheme.
Year
DOI
Venue
2017
10.1145/3132402.3132443
MEMSYS 2017: PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON MEMORY SYSTEMS
Keywords
DocType
Citations 
Main memory reliability,Applications reliability analysis,DRAM,3D-Stacked memory,Error correcting codes
Conference
0
PageRank 
References 
Authors
0.34
0
5
Name
Order
Citations
PageRank
Alirad Malek100.34
Evangelos Vasilakis282.48
Vassilis Papaefstathiou39713.71
Pedro Trancoso437743.79
Ioannis Sourdis545644.17