Sentinel: Efficient Tensor Migration and Allocation on Heterogeneous Memory Systems for Deep Learning | 1 | 0.35 | 2021 |
Sparta: high-performance, element-wise sparse tensor contraction on heterogeneous memory | 0 | 0.34 | 2021 |
Fast, flexible, and comprehensive bug detection for persistent memory programs | 0 | 0.34 | 2021 |
Optimizing large-scale plasma simulations on persistent memory-based heterogeneous memory with effective data placement across memory hierarchy | 0 | 0.34 | 2021 |
Archtm: Architecture-Aware, High Performance Transaction For Persistent Memory | 0 | 0.34 | 2021 |
Unimem: Runtime Data Management On Non-Volatile Memory-Based Heterogeneous Main Memory For High Performance Computing | 0 | 0.34 | 2021 |
Enabling energy-efficient DNN training on hybrid GPU-FPGA accelerators | 1 | 0.35 | 2021 |
MD-HM: memoization-based molecular dynamics simulations on big memory system | 1 | 0.35 | 2021 |
Zero-Offload: Democratizing Billion-Scale Model Training | 0 | 0.34 | 2021 |
Tahoe: tree structure-aware high performance inference engine for decision tree ensemble on GPU | 1 | 0.35 | 2021 |
Athena: high-performance sparse tensor contraction sequence on heterogeneous memory | 1 | 0.35 | 2021 |
Efficient Buffer Overflow Detection on GPU | 0 | 0.34 | 2021 |
Exploration on Routing Configuration of HNoC With Intelligent On-Chip Resource Management | 0 | 0.34 | 2020 |
Ribbon: High Performance Cache Line Flushing for Persistent Memory | 0 | 0.34 | 2020 |
HM-ANN: Efficient Billion-Point Nearest Neighbor Search on Heterogeneous Memory | 0 | 0.34 | 2020 |
Workshop 6: HIPS High-level Parallel Programming Models and Supportive Environments | 0 | 0.34 | 2020 |
RIANN: Real-time Incremental Learning with Approximate Nearest Neighbor on Mobile Devices. | 0 | 0.34 | 2020 |
MATCH: An MPI Fault Tolerance Benchmark Suite | 1 | 0.36 | 2020 |
Demystifying the Performance of HPC Scientific Applications on NVM-based Memory Systems | 2 | 0.37 | 2020 |
Exploring Non-Volatility of Non-Volatile Memory for High Performance Computing Under Failures | 0 | 0.34 | 2020 |
Smart-PGSim: using neural network to accelerate AC-OPF power grid simulation | 0 | 0.34 | 2020 |
Adaptive neural network-based approximation to accelerate eulerian fluid simulation | 2 | 0.39 | 2019 |
MOARD: Modeling Application Resilience to Transient Faults on Data Objects | 3 | 0.37 | 2019 |
EasyCrash: Exploring Non-Volatility of Non-Volatile Memory for High Performance Computing Under Failures. | 0 | 0.34 | 2019 |
Architecture-Aware, High Performance Transaction for Persistent Memory. | 0 | 0.34 | 2019 |
Performance Analysis and Characterization of Training Deep Learning Models on NVIDIA TX2. | 0 | 0.34 | 2019 |
UMap: Enabling Application-driven Optimizations for Page Management | 2 | 0.39 | 2019 |
Performance Analysis and Characterization of Training Deep Learning Models on Mobile Device | 1 | 0.35 | 2019 |
Multi-Parameter Performance Modeling Based on Machine Learning with Basic Block Features | 0 | 0.34 | 2019 |
PARIS: Predicting application resilience using machine learning | 1 | 0.35 | 2018 |
Processing-in-Memory for Energy-Efficient Neural Network Training - A Heterogeneous Approach. | 12 | 0.46 | 2018 |
Characterization and Comparison of Application Resilience for Serial and Parallel Executions. | 1 | 0.35 | 2018 |
Modeling Application Resilience In Large-Scale Parallel Execution | 0 | 0.34 | 2018 |
Understanding Application Recomputability without Crash Consistency in Non-Volatile Memory. | 0 | 0.34 | 2018 |
FlipTracker: Understanding Natural Error Resilience in HPC Applications. | 4 | 0.38 | 2018 |
Runtime Concurrency Control and Operation Scheduling for High Performance Neural Network Training | 1 | 0.36 | 2018 |
GMOD: a dynamic GPU memory overflow detector | 2 | 0.36 | 2018 |
Runtime data management on non-volatile memory-based heterogeneous memory for task-parallel programs. | 3 | 0.39 | 2018 |
Performance Modeling for Optimal Data Placement on GPU with Heterogeneous Memory Systems | 1 | 0.35 | 2017 |
Unimem: runtime data managementon non-volatile memory-based heterogeneous main memory | 9 | 0.58 | 2017 |
Early Evaluation of Intel Optane Non-Volatile Memory with HPC I/O Workloads. | 0 | 0.34 | 2017 |
Algorithm-Directed Crash Consistence in Non-volatile Memory for HPC | 3 | 0.41 | 2017 |
Performance Evaluation and Modeling of HPC I/O on Non-Volatile Memory | 3 | 0.42 | 2017 |
High Performance Data Persistence in Non-Volatile Memory for Resilient High Performance Computing. | 0 | 0.34 | 2017 |
Optimizing Data Placement on GPU Memory: A Portable Approach. | 4 | 0.43 | 2017 |
Application-Level Resilience Modeling for HPC Fault Tolerance. | 0 | 0.34 | 2017 |
Exploring Synchronization in Cache Coherent Manycore Systems: A Case Study with Xeon Phi | 0 | 0.34 | 2017 |
Integrated Thermal Analysis for Processing In Die-Stacking Memory. | 5 | 0.48 | 2016 |
Algorithm-Directed Data Placement in Explicitly Managed Non-Volatile Memory. | 12 | 0.86 | 2016 |
Performance Implications of Processing-in-Memory Designs on Data-Intensive Applications | 0 | 0.34 | 2016 |