Title
Unveiling Thread Communication Bottlenecks Using Hardware-Independent Metrics
Abstract
A critical factor for developing robust shared-memory applications is the efficient use of the cache and the communication between threads. Inappropriate data structures, algorithm design, and inefficient thread affinity may result in superfluous communication between threads/cores and severe performance problems. For this reason, state-of-the-art profiling tools focus on thread communication and behavior to present different metrics that enable programmers to write cache-friendly programs. The data shared between a pair of threads should be reused with a reasonable distance to preserve data locality. However, existing tools do not take into account the locality of communication events and mainly focus on analyzing the amount of communication instead. In this paper, we introduce a new method to analyze performance and communication bottlenecks that arise from data-access patterns and thread interactions of each code region. We propose new hardware-independent metrics to characterize thread communication and provide suggestions for applying appropriate optimizations on a specific code region. We evaluated our approach on the SPLASH and Rodinia benchmark suites. Experimental results validate the effectiveness of our approach by finding communication locality issues due to inefficient data structures and/or poor algorithm implementations. By applying the suggested optimizations, we improved the performance in Rodinia benchmarks by up to 56%. Furthermore, by varying the input size we demonstrated the ability of our method to assess the cache usage and scalability of a given application in terms of its inherent communication.
Year
DOI
Venue
2018
10.1145/3225058.3225142
PROCEEDINGS OF THE 47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING
Keywords
Field
DocType
Shared memory, multi-threading, data locality, profiling, communication
Data structure,Multithreading,Locality,Shared memory,Cache,Computer science,Thread (computing),Processor affinity,Distributed computing,Scalability
Conference
ISSN
Citations 
PageRank 
0190-3918
0
0.34
References 
Authors
29
3
Name
Order
Citations
PageRank
Arya Mazaheri111.02
Felix Wolf25712.00
Ali Jannesari314920.11