Title
Evaluating Performance And Energy-Efficiency Of A Parallel Signal Correlation Algorithm On Current Multi And Manycore Architectures
Abstract
Increasing variety and affordability of multi- and many-core embedded architectures can pose both a challenge and opportunity to developers of high performance computing applications. In this paper we present a case study where we develop and evaluate a unified parallel approach to a signal-correlation algorithm,currently in-use in a commercial/industrial locating system. We utilize both HPX C++ and CUDA runtimes to achieve scalable code for current embedded multi- and many-core architectures (NVIDIA Tegra, Intel Broadwell M, Arm Cortex A-15). We also compare our approach onto traditional high-performance hardware as well as a native embedded many-core variant. To increase the accuracy of our performance analysis we introduce dedicated performance model. The results show that our approach is feasible and enables us to harness the advantages of modern micro-server architectures, but also indicates that there are limitations to some of the currently existing many-core embedded architectures, that can lead to traditional hardware being superior both in efficiency and absolute performance.
Year
DOI
Venue
2016
10.1016/j.procs.2016.05.484
INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016)
Keywords
Field
DocType
Programming Models, Many Core, Runtimes, User Feedback, Micro Servers, Shared Memory
ARM architecture,Signal correlation,Shared memory,Programming paradigm,Supercomputer,CUDA,Computer science,Efficient energy use,Algorithm,Embedded system,Scalability
Conference
Volume
ISSN
Citations 
80
1877-0509
0
PageRank 
References 
Authors
0.34
4
5
Name
Order
Citations
PageRank
Arne Hendricks101.01
T. Heller2675.57
Andreas Schäfer3172.05
Maximilian Kasparek461.41
Dietmar Fey535570.76