Title
Custom-sized caches in application-specific memory hierarchies
Abstract
Developing FPGA implementations with an input specification in a high-level programming language such as C/C++ or OpenCL allows for a substantially shortened design cycle compared to a design entry at register transfer level. This work targets high-level synthesis (HLS) implementations that process large amounts of data and therefore require access to an off-chip memory. We leverage the customizability of the FPGA on-chip memory to automatically construct a multi-cache architecture in order to enhance the performance of the interface between parallel functional units of the HLS core and an external memory. Our focus is on automatic cache sizing. Firstly, our technique determines and uses up unused left-over block RAM resources for the construction of on-chip caches. Secondly, we devise a high-level cache performance estimation based on the memory access trace of the program. We use this memory trace to find a heterogeneous configuration of cache sizes, tailored to the application's memory access characteristic, that maximizes the performance of the multi-cache system subject to an on-chip memory resource constraint. We evaluate our technique with three benchmark implementations on an FPGA board and obtain a reduction in execution latency of up to 2× (1.5× on average) when compared to a one-size-fits-all cache sizing. We also quantify the impact of our automatically generated cache system on the overall energy consumption of the implementation.
Year
DOI
Venue
2015
10.1109/FPT.2015.7393141
2015 International Conference on Field Programmable Technology (FPT)
Keywords
Field
DocType
FPGA,high-level programming language,C/C++,OpenCL,register transfer level,high-level synthesis,off-chip memory,RAM,on-chip caches,on-chip memory resource constraint,energy consumption,application-specific memory hierarchies,custom-sized caches
Interleaved memory,Computer architecture,Uniform memory access,Shared memory,Cache pollution,Computer science,Parallel computing,Cache-only memory architecture,Cache coloring,Non-uniform memory access,Memory map,Embedded system
Conference
Citations 
PageRank 
References 
3
0.39
16
Authors
5
Name
Order
Citations
PageRank
Felix Winterstein1948.00
Kermin E. Fleming219015.12
Hsin-Jung Yang3504.91
John Wickerson414210.08
George A. Constantinides51391160.26