Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures. | 0 | 0.34 | 2021 |
A Sparse Tensor Benchmark Suite for CPUs and GPUs | 1 | 0.37 | 2020 |
On The Feasibility Of Using Reduced-Precision Tensor Core Operations For Graph Analytics | 0 | 0.34 | 2020 |
Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite | 5 | 0.47 | 2018 |
Designing Scalable Distributed Memory Models: A Case Study. | 0 | 0.34 | 2017 |
MIC-SVM: Designing a Highly Efficient Support Vector Machine for Advanced Modern Multi-core and Many-Core Architectures | 21 | 0.85 | 2014 |
Building Scalable PGAS Communication Subsystem on Blue Gene/Q | 1 | 0.35 | 2013 |
Designing energy efficient communication runtime systems: a view from PGAS models | 8 | 0.46 | 2013 |
Codesign Challenges for Exascale Systems: Performance, Power, and Reliability | 5 | 0.41 | 2011 |
Designing Energy Efficient Communication Runtime Systems for Data Centric Programming Models. | 10 | 0.54 | 2010 |