Title
Exploiting Task and Data Parallelism in ILUPACK’s Preconditioned CG Solver on NUMA Architectures and Many-core Accelerators
Abstract
•Specialized implementations of ILUPACK’s iterative solver for NUMA platforms.•Specialized implementations of ILUPACK’s iterative solver for many-core accelerators.•Exploitation of task parallelism via OmpSs runtime (dynamic schedule).•Exploitation of task parallelism via MPI (static schedule).•Exploitation of data parallelism for GPUs.
Year
DOI
Venue
2016
10.1016/j.parco.2015.12.004
Parallel Computing
Keywords
Field
DocType
Sparse linear systems,Reconditioned Conjugate Gradient solver,Task and data parallelism,Multi-core processors,Intel Xeon Phi,Graphics processing units (GPUs)
Graphics,Instruction-level parallelism,x86,Computer science,Xeon Phi,Task parallelism,Parallel computing,Data parallelism,Solver,Multi-core processor
Journal
Volume
Issue
ISSN
54
C
0167-8191
Citations 
PageRank 
References 
5
0.46
7
Authors
7
Name
Order
Citations
PageRank
José I. Aliaga1314.71
Rosa M. Badia22234160.45
Maria Barreda3727.88
matthias bollhofer450.46
Ernesto Dufrechou52511.02
Pablo Ezzatti612428.24
Enrique S. Quintana-Orti740532.27