Title
Memory-Constrained Vectorization and Scheduling of Dataflow Graphs for Hybrid CPU-GPU Platforms.
Abstract
The increasing use of heterogeneous embedded systems with multi-core CPUs and Graphics Processing Units (GPUs) presents important challenges in effectively exploiting pipeline, task, and data-level parallelism to meet throughput requirements of digital signal processing applications. Moreover, in the presence of system-level memory constraints, hand optimization of code to satisfy these requirements is inefficient and error prone and can therefore, greatly slow down development time or result in highly underutilized processing resources. In this article, we present vectorization and scheduling methods to effectively exploit multiple forms of parallelism for throughput optimization on hybrid CPU-GPU platforms, while conforming to system-level memory constraints. The methods operate on synchronous dataflow representations, which are widely used in the design of embedded systems for signal and information processing. We show that our novel methods can significantly improve system throughput compared to previous vectorization and scheduling approaches under the same memory constraints. In addition, we present a practical case-study of applying our methods to significantly improve the throughput of an orthogonal frequency division multiplexing receiver system for wireless communications.
Year
DOI
Venue
2018
10.1145/3157669
ACM Transactions in Embedded Computing Systems
Keywords
Field
DocType
Dataflow models, design optimization, heterogeneous computing, signal processing systems, software synthesis
Graphics,Digital signal processing,Wireless,Computer science,Scheduling (computing),Parallel computing,Vectorization (mathematics),Dataflow,Throughput,Orthogonal frequency-division multiplexing
Journal
Volume
Issue
ISSN
17
2
1539-9087
Citations 
PageRank 
References 
1
0.41
16
Authors
3
Name
Order
Citations
PageRank
Shuoxin Lin191.61
jiahao wu2107.03
Shuvra S. Bhattacharyya31416162.67