Title
When Parallel Speedups Hit the Memory Wall
Abstract
After Amdah's trailblazing work, many other authors proposed analytical speedup models but none have considered the limiting effect of the memory wall. These models exploited aspects such as problem-size variation, memory size, communication overhead, and synchronization overhead, but data-access delays are assumed to be constant. Nevertheless, such delays can vary, for example, according to the number of cores used and the ratio between processor and memory frequencies. Given the large number of possible configurations of operating frequency and number of cores that current architectures can offer, suitable speedup models to describe such variations among these configurations are quite desirable for off-line or on-line scheduling decisions. This work proposes a new parallel speedup model that accounts for the variations on the average data-access delay to describe the limiting effect of the memory wall on parallel speedups in homogeneous shared-memory architectures. Analytical results indicate that the proposed modeling can capture the desired behavior while experimental hardware results validate the former. Additionally, we show that when accounting for parameters that reflect the intrinsic characteristics of the applications, such as the degree of parallelism and susceptibility to the memory wall, our proposal has significant advantages over machine-learning-based modeling. Moreover, our experiments show that conventional machine-learning modeling, besides being black-boxed, needs about one order of magnitude more measurements to reach the same level of accuracy achieved by the proposed model.
Year
DOI
Venue
2019
10.1109/ACCESS.2020.2990418
IEEE ACCESS
Keywords
DocType
Volume
Parallel systems,data access delay,performance modeling,speedup,memory wall
Journal
8
ISSN
Citations 
PageRank 
2169-3536
1
0.43
References 
Authors
0
4
Name
Order
Citations
PageRank
Alex F. A. Furtunato111.45
Kyriakos Georgiou212.46
Kerstin Eder323226.56
Samuel Xavier-de-Souza47910.07