Title
Roofline Model Toolkit: A Practical Tool For Architectural And Program Analysis
Abstract
We present preliminary results of the Roofline Toolkit for multicore, manycore, and accelerated architectures. This paper focuses on the processor architecture characterization engine, a collection of portable instrumented micro benchmarks implemented with Message Passing Interface (MPI), and OpenMP used to express thread-level parallelism. These benchmarks are specialized to quantify the behavior of different architectural features. Compared to previous work on performance characterization, these microbenchmarks focus on capturing the performance of each level of the memory hierarchy, along with thread-level parallelism, instruction-level parallelism and explicit SIMD parallelism, measured in the context of the compilers and run-time environments. We also measure sustained PCIe throughput with four GPU memory managed mechanisms. By combining results from the architecture characterization with the Roofline model based solely on architectural specifications, this work offers insights for performance prediction of current and future architectures and their software systems. To that end, we instrument three applications and plot their resultant performance on the corresponding Roofline model when run on a Blue Gene/Q architecture.
Year
DOI
Venue
2014
10.1007/978-3-319-17248-4_7
HIGH PERFORMANCE COMPUTING SYSTEMS: PERFORMANCE MODELING, BENCHMARKING, AND SIMULATION
Keywords
Field
DocType
Roofline, Memory bandwidth, CUDA unified memory
Memory hierarchy,Memory bandwidth,Computer science,Parallel computing,SIMD,Software system,Message Passing Interface,Program analysis,Multi-core processor,Microarchitecture
Conference
Volume
ISSN
Citations 
8966
0302-9743
23
PageRank 
References 
Authors
1.06
4
8
Name
Order
Citations
PageRank
Yu Jung Lo1231.06
Samuel Williams2128298.56
Brian Van Straalen31128.83
Terry J. Ligocki4774.64
Matthew J. Cordery5301.56
Nicholas J. Wright640827.79
Mary W. Hall72328263.34
leonid oliker81358145.15