Extracting clean performance models from tainted programs | 0 | 0.34 | 2021 |
Skipping Non-essential Instructions Makes Data-Dependence Profiling Faster | 1 | 0.35 | 2020 |
Accelerating winograd convolutions using symbolic computation and meta-programming | 0 | 0.34 | 2020 |
The Art of Getting Deep Neural Networks in Shape. | 1 | 0.37 | 2019 |
Engineering Algorithms for Scalability through Continuous Validation of Performance Expectations | 2 | 0.37 | 2019 |
Dissecting sequential programs for parallelization - An approach based on computational units. | 0 | 0.34 | 2019 |
Automatic construct selection and variable classification in OpenMP | 2 | 0.38 | 2019 |
Efficient Job Scheduling for Clusters with Shared Tiered Storage | 0 | 0.34 | 2019 |
Unveiling Thread Communication Bottlenecks Using Hardware-Independent Metrics | 0 | 0.34 | 2018 |
Efficient Fault Tolerance Through Dynamic Node Replacement | 0 | 0.34 | 2018 |
A scalable algorithm for simulating the structural plasticity of the brain | 1 | 0.36 | 2018 |
Lightweight Requirements Engineering for Exascale Co-design | 2 | 0.39 | 2018 |
Using Deep Learning For Automated Communication Pattern Characterization: Little Steps And Big Challenges | 1 | 0.39 | 2018 |
Exploring the Performance Envelope of the LLL Algorithm | 0 | 0.34 | 2018 |
Parallelizing Audio Analysis Applications - A Case Study. | 0 | 0.34 | 2017 |
Off-Road Performance Modeling - How to Deal with Segmented Data. | 2 | 0.38 | 2017 |
Isoefficiency in Practice: Configuring and Understanding the Performance of Task-based Applications. | 2 | 0.38 | 2017 |
Brief Announcement: Meeting the Challenges of Parallelizing Sequential Programs. | 0 | 0.34 | 2017 |
Editorial of special issue on Software Engineering for Parallel Systems. | 0 | 0.34 | 2017 |
Following the Blind Seer - Creating Better Performance Models Using Less Information. | 1 | 0.35 | 2017 |
Fast Multi-parameter Performance Modeling | 3 | 0.41 | 2016 |
Unveiling parallelization opportunities in sequential programs. | 3 | 0.38 | 2016 |
Automatic Parallel Pattern Detection in the Algorithm Structure Design Space | 2 | 0.38 | 2016 |
Automatic Generation of Unit Tests for Correlated Variables in Parallel Programs. | 3 | 0.37 | 2016 |
Fast Data-Dependence Profiling by Skipping Repeatedly Executed Memory Operations. | 1 | 0.35 | 2015 |
Beyond Data Parallelism: Identifying Parallel Tasks in Sequential Programs. | 0 | 0.34 | 2015 |
An Efficient Data-Dependence Profiler for Sequential and Parallel Programs | 11 | 0.55 | 2015 |
Dependence-Based Code Transformation for Coarse-Grained Parallelism | 1 | 0.36 | 2015 |
Characterizing Loop-Level Communication Patterns in Shared Memory. | 1 | 0.35 | 2015 |
Using Template Matching to Infer Parallel Design Patterns | 3 | 0.40 | 2014 |
Predicting Parallelization of Sequential Programs Using Supervised Learning | 0 | 0.34 | 2013 |
Discovery of Potential Parallelism in Sequential Programs | 14 | 0.69 | 2013 |