Improving DRAM performance by parallelizing refreshes with accesses - Citegraph

Paper Info

Title
Improving DRAM performance by parallelizing refreshes with accesses

Abstract
Modern DRAM cells are periodically refreshed to prevent data loss due to leakage. Commodity DDR (double data rate) DRAM refreshes cells at the rank level. This degrades performance significantly because it prevents an entire DRAM rank from serving memory requests while being refreshed. DRAM designed for mobile platforms, LPDDR (low power DDR) DRAM, supports an enhanced mode, called per-bank refresh, that refreshes cells at the bank level. This enables a bank to be accessed while another in the same rank is being refreshed, alleviating part of the negative performance impact of refreshes. Unfortunately, there are two shortcomings of per-bank refresh employed in today's systems. First, we observe that the perbank refresh scheduling scheme does not exploit the full potential of overlapping refreshes with accesses across banks because it restricts the banks to be refreshed in a sequential round-robin order. Second, accesses to a bank that is being refreshed have to wait. To mitigate the negative performance impact of DRAM refresh, we propose two complementary mechanisms, DARP (Dynamic Access Refresh Parallelization) and SARP (Subarray Access Refresh Parallelization). The goal is to address the drawbacks of per-bank refresh by building more efficient techniques to parallelize refreshes and accesses within DRAM. First, instead of issuing per-bank refreshes in a round-robin order, as it is done today, DARP issues per-bank refreshes to idle banks in an out-of-order manner. Furthermore, DARP proactively schedules refreshes during intervals when a batch of writes are draining to DRAM. Second, SARP exploits the existence of mostly-independent subarrays within a bank. With minor modifications to DRAM organization, it allows a bank to serve memory accesses to an idle subarray while another subarray is being refreshed. Extensive evaluations on a wide variety of workloads and systems show that our mechanisms improve system performance (and energy efficiency) compared to three st- te-of-the-art refresh policies and the performance benefit increases as DRAM density increases.

Year	DOI	Venue
2014	10.1109/HPCA.2014.6835946	High Performance Computer Architecture
Keywords	Field	DocType
DRAM chips,parallel processing,storage management,DARP,DRAM cells,DRAM density,DRAM organization,DRAM performance improvement,DRAM rank,LPDDR DRAM,SARP,bank access,bank level cell refresh,commodity DDR DRAM,commodity double data rate DRAM,data loss,dynamic access refresh parallelization,energy efficiency,enhanced mode called per-bank refresh,low power DDR DRAM,memory access,memory request,mobile platform,mostly-independent subarrays,performance degradation,performance impact,proactive refresh scheduling,refresh policies,sequential round-robin order,subarray access refresh parallelization,write batch	Data loss,Scheduling (computing),Computer science,Real-time computing,Out-of-order execution,Dram,Efficient energy use,Parallel computing,Schedule,Memory rank,Operating system,Double data rate,Embedded system	Conference
Volume	ISSN	Citations
abs/1712.07754	1530-0897	40
PageRank	References	Authors
1.02	23	7

Authors (7 rows)

Cited by (40 rows)

References (23 rows)

Name	Order	Citations	PageRank
Kai-Wei Chang	1	4735	276.81
Dong-Hyuk Lee	2	1254	48.26
Zeshan Chishti	3	723	34.65
Alaa R. Alameldeen	4	1672	80.06
Chris Wilkerson	5	1575	61.73
Yoongu Kim	6	1396	44.25
Onur Mutlu	7	9446	357.40

1