Understanding power variation and its implications on performance optimization on the Cori supercomputer | 0 | 0.34 | 2021 |
Non-recurring engineering (NRE) best practices: a case study with the NERSC/NVIDIA OpenMP contract | 1 | 0.36 | 2021 |
Uncovering Access, Reuse, And Sharing Characteristics Of I/O-Intensive Files On Large-Scale Production Hpc Systems | 1 | 0.35 | 2020 |
Performance Trade-offs in GPU Communication: A Study of Host and Device-initiated Approaches | 0 | 0.34 | 2020 |
Performance Assessment of OpenMP Compilers Targeting NVIDIA V100 GPUs | 2 | 0.40 | 2020 |
A Case Study of Porting HPGMG from CUDA to OpenMP Target Offload | 1 | 0.37 | 2020 |
Quantifying the impact of network congestion on application performance and network metrics | 1 | 0.35 | 2020 |
Characterizing Scientific Workflows on HPC Systems using Logs | 1 | 0.35 | 2020 |
GPCNeT: designing a benchmark suite for inducing and measuring contention in HPC networks | 5 | 0.47 | 2019 |
Understanding Data Motion in the Modern HPC Data Center | 1 | 0.36 | 2019 |
A Zoom-in Analysis of I/O Logs to Detect Root Causes of I/O Performance Bottlenecks | 3 | 0.38 | 2019 |
A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity | 0 | 0.34 | 2018 |
IOMiner: Large-Scale Analytics Framework for Gaining Knowledge from I/O Logs | 2 | 0.39 | 2018 |
A year in the life of a parallel file system. | 6 | 0.45 | 2018 |
UMAMI: a recipe for generating meaningful metrics through holistic I/O performance analysis. | 4 | 0.44 | 2017 |
Performance analysis of emerging data analytics and HPC workloads. | 0 | 0.34 | 2017 |
Performance characterization of scientific workflows for the optimal use of Burst Buffers | 3 | 0.38 | 2016 |
Modular HPC I/O Characterization with Darshan. | 4 | 0.42 | 2016 |
Achieving High Parallel Efficiency on Modern Processors for X-Ray Scattering Data Analysis. | 0 | 0.34 | 2016 |
Measurement and Interpretation of Micro-benchmark and Application Energy Use on the Cray XC30 | 6 | 0.49 | 2014 |
Roofline Model Toolkit: A Practical Tool For Architectural And Program Analysis | 23 | 1.06 | 2014 |
Abstract machine models and proxy architectures for exascale computing | 23 | 0.80 | 2014 |
Cori: A Pre-Exascale Supercomputer for Big Data and HPC Applications. | 3 | 0.40 | 2014 |
Analysis Of Cray Xc30 Performance Using Trinity-Nersc-8 Benchmarks And Comparison With Cray Xe6 And Ibm Bg/Q | 7 | 0.50 | 2013 |
Performance Tuning Of Fock Matrix And Two-Electron Integral Calculations For Nwchem On Leading Hpc Platforms | 5 | 0.47 | 2013 |
Extreme Data Science at the National Energy Research Scientific Computing (NERSC) Center. | 1 | 0.35 | 2013 |
Evaluating Interconnect and Virtualization Performance forHigh Performance Computing | 9 | 0.70 | 2012 |
A preliminary evaluation of the hardware acceleration of the cray gemini interconnect for PGAS languages and comparison with MPI | 12 | 0.81 | 2012 |
Comprehensive Performance Monitoring for GPU Cluster Systems | 3 | 0.39 | 2011 |
Performance Analysis of High Performance Computing Applications on the Amazon Web Services Cloud | 238 | 10.81 | 2010 |
Effective Performance Measurement at Petascale Using IPM | 24 | 2.01 | 2010 |
A programming model performance study using the NAS parallel benchmarks | 14 | 0.99 | 2010 |
Effective Holistic Performance Measurement at Petascale Using IPM | 2 | 0.38 | 2010 |
Performance Analysis and Workload Characterization with IPM. | 3 | 0.46 | 2009 |