Title
PATer: A Hardware Prefetching Automatic Tuner on IBM POWER8 Processor
Abstract
Hardware prefetching on IBM’s latest POWER8 processor is able to improve performance of many applications significantly, but it can also cause performance loss for others. The IBM POWER8 processor provides one of the most sophisticated hardware prefetching designs which supports 225 different configurations. Obviously, it is a big challenge to find the optimal or near-optimal hardware prefetching configuration for a specific application. We present a dynamic prefetching tuning scheme in this paper, named Prefetch Automatic Tuner (PATer). PATer uses a prediction model based on machine learning to dynamically tune the prefetch configuration based on the values of hardware performance monitoring counters (PMCs). By developing a two-phase prefetching selection algorithm and a prediction accuracy optimization algorithm in this tool, we identify a set of selected key hardware prefetch configurations that matter mostly to performance as well as a set of PMCs that maximize the machine learning prediction accuracy. We show that PATer is able to accelerate the execution of diverse workloads up to 1.4x.
Year
DOI
Venue
2016
10.1109/LCA.2015.2442972
Computer Architecture Letters
Keywords
Field
DocType
computer architecture
IBM,Computer science,Real-time computing,Computer hardware,Benchmark (computing),Tuner,Computer architecture,Memory hierarchy,Performance monitoring,Selection algorithm,Parallel computing,POWER8,Instruction prefetch
Journal
Volume
Issue
ISSN
PP
99
1556-6056
Citations 
PageRank 
References 
6
0.41
3
Authors
6
Name
Order
Citations
PageRank
minghua li160.41
guancheng chen290.78
qijun wang360.41
yong hua lin490.78
Per Stenström53048234.09
Peter Hofstee6284.30