Exchanging Best Practices for Supporting Computational and Data-Intensive Research, The Xpert Network | 0 | 0.34 | 2022 |
On the automatic parallelization of subscripted subscript patterns using array property analysis | 0 | 0.34 | 2021 |
Optimizing GPU programs by register demotion - poster. | 0 | 0.34 | 2019 |
Comparative analysis of coprocessors. | 0 | 0.34 | 2019 |
Proceedings of the ACM International Conference on Supercomputing, ICS 2019, Phoenix, AZ, USA, June 26-28, 2019. | 0 | 0.34 | 2019 |
Pagoda: A GPU Runtime System for Narrow Tasks | 0 | 0.34 | 2019 |
HiPA: history-based piecewise approximation for functions. | 0 | 0.34 | 2017 |
POSTER: Pagoda: A Runtime System to Maximize GPU Utilization in Data Parallel Tasks with Limited Parallelism. | 0 | 0.34 | 2016 |
Formalizing Structured Control Flow Graphs. | 0 | 0.34 | 2016 |
PETRA: Performance Evaluation Tool for Modern Parallelizing Compilers | 0 | 0.34 | 2015 |
HYDRA : Extending Shared Address Programming for Accelerator Clusters. | 0 | 0.34 | 2015 |
HeteroDoop: A MapReduce Programming System for Accelerator Clusters | 6 | 0.55 | 2015 |
Reliable and Efficient Distributed Checkpointing System for Grid Environments | 2 | 0.38 | 2014 |
The Cetus Source-to-Source Compiler Infrastructure: Overview and Evaluation | 11 | 0.61 | 2013 |
Scaling large-data computations on multi-GPU accelerators | 5 | 0.45 | 2013 |
OpenMPC: extended OpenMP for efficient programming and tuning on GPUs | 7 | 0.75 | 2013 |
Effects of compiler optimizations in OpenMP to CUDA translation | 1 | 0.36 | 2012 |
A hybrid approach of OpenMP for clusters | 13 | 0.71 | 2012 |
Topic 11: Multicore and Manycore Programming. | 0 | 0.34 | 2012 |
Performance analysis and tuning of automatically parallelized OpenMP applications | 3 | 0.41 | 2011 |
A Study of the Usefulness of Producer/Consumer Synchronization. | 0 | 0.34 | 2011 |
Automatic Scaling of OpenMP Beyond Shared Memory. | 3 | 0.45 | 2011 |
OpenMPC: Extended OpenMP Programming and Tuning for GPUs | 132 | 6.49 | 2010 |
Cetus: A Source-to-Source Compiler Infrastructure for Multicores | 85 | 3.57 | 2009 |
OpenMP to GPGPU: a compiler framework for automatic translation and optimization | 215 | 14.11 | 2009 |
PEAK—a fast and effective performance tuning system via compiler optimization orchestration | 13 | 1.51 | 2008 |
Optimizing irregular shared-memory applications for clusters | 5 | 1.51 | 2008 |
OpenMP in a New Era of Parallelism, 4th International Workshop, IWOMP 2008, West Lafayette, IN, USA, May 12-14, 2008, Proceedings | 16 | 2.35 | 2008 |
Adaptive runtime tuning of parallel sparse matrix-vector multiplication on distributed memory systems | 12 | 1.06 | 2008 |
Measuring High-Performance Computing with Real Applications | 7 | 0.54 | 2008 |
Incorporation of OpenMP memory consistency into conventional dataflow analysis | 3 | 0.41 | 2008 |
Dynamic Resource Management in Energy Constrained Heterogeneous Computing Systems Using Voltage Scaling | 40 | 1.23 | 2008 |
Prediction of Resource Availability in Fine-Grained Cycle Sharing Systems Empirical Evaluation | 39 | 1.31 | 2007 |
Failure-aware checkpointing in fine-grained cycle sharing systems | 17 | 0.81 | 2007 |
Programming Distributed Memory Sytems Using OpenMP | 20 | 1.28 | 2007 |
Speculative thread decomposition through empirical optimization | 37 | 1.05 | 2007 |
Open Internet-based Sharing for Desktop Grids in iShare | 0 | 0.34 | 2007 |
Context-sensitive domain-independent algorithm composition and selection | 9 | 0.99 | 2006 |
Optimizing irregular shared-memory applications for distributed-memory systems | 32 | 1.59 | 2006 |
Can transactions enhance parallel programs? | 0 | 0.34 | 2006 |
Exploiting reference idempotency to reduce speculative storage overflow | 7 | 0.55 | 2006 |
Implementing tomorrow's programming languages | 0 | 0.34 | 2006 |
Empirical Studies on the Behavior of Resource Availability in Fine-Grained Cycle Sharing Systems | 14 | 0.77 | 2006 |
Executing MPI programs on virtual machines in an internet sharing system | 3 | 0.44 | 2006 |
Towards automatic translation of OpenMP to MPI | 27 | 1.99 | 2005 |
Dynamic Mapping in Energy Constrained Heterogeneous Computing Systems | 12 | 0.59 | 2005 |
Languages and Compilers for High Performance Computing, 17th International Workshop, LCPC 2004, West Lafayette, IN, USA, September 22-24, 2004, Revised Selected Papers | 33 | 2.20 | 2005 |
On the interaction of tiling and automatic parallelization | 5 | 0.43 | 2005 |
iShare – open internet sharing built on peer-to-peer and web | 9 | 0.50 | 2005 |
Combined compile-time and runtime-driven, pro-active data movement in software DSM systems | 5 | 0.52 | 2004 |