Design and Implementation of a Highly Efficient DGEMM for 64-Bit ARMv8 Multi-core Processors | 4 | 0.40 | 2015 |
OpenMC: Towards Simplifying Programming for TianHe Supercomputers. | 5 | 0.44 | 2014 |
A Fast Parallel Implementation of Molecular Dynamics with the Morse Potential on a Heterogeneous Petascale Supercomputer | 5 | 0.50 | 2012 |
Parallelizing SOR for GPGPUs using alternate loop tiling | 9 | 0.74 | 2012 |
Adaptive Optimization for Petascale Heterogeneous CPU/GPU Computing | 35 | 1.53 | 2010 |
Solving 2D Nonlinear Unsteady Convection-Diffusion Equations on Heterogenous Platforms with Multiple GPUs | 0 | 0.34 | 2009 |