Parallel Sparse Flow-Sensitive Points-To Analysis | 3 | 0.38 | 2018 |
LLVM-based communication optimizations for PGAS programs | 2 | 0.37 | 2015 |
A Composable Deadlock-Free Approach to Object-Based Isolation. | 2 | 0.38 | 2015 |
Parallelizing a discrete event simulation application using the Habanero-Java multicore library | 0 | 0.34 | 2015 |
A Transformation Framework for Optimizing Task-Parallel Programs | 15 | 0.61 | 2013 |
Compiler-Driven Data Layout Transformation for Heterogeneous Platforms. | 8 | 0.70 | 2013 |
Isolation for nested task parallelism | 2 | 0.37 | 2013 |
Accelerating Habanero-Java programs with OpenCL generation | 10 | 0.58 | 2013 |
A decoupled non-SSA global register allocation using bipartite liveness graphs | 1 | 0.36 | 2013 |
Speculative Execution of Parallel Programs with Precise Exception Semantics on GPUs. | 4 | 0.47 | 2013 |
Scalable and precise dynamic datarace detection for structured parallelism | 43 | 1.13 | 2012 |
Finish Accumulators: An Efficient Reduction Construct for Dynamic Task Parallelism. | 2 | 0.43 | 2012 |
Efficient Data Race Detection for Async-Finish Parallelism | 36 | 1.23 | 2012 |
Practical permissions for race-free parallelism | 11 | 0.65 | 2012 |
Intermediate language extensions for parallelism | 8 | 0.58 | 2011 |
Communication Optimizations for Distributed-Memory X10 Programs | 14 | 0.81 | 2011 |
Habanero-Java: the new adventures of old X10 | 109 | 3.54 | 2011 |
Permission regions for race-free parallelism | 6 | 0.51 | 2011 |
The design and implementation of the habanero-java parallel programming language | 4 | 0.46 | 2011 |
Delegated isolation | 15 | 0.87 | 2011 |
Efficient Selection of Vector Instructions Using Dynamic Programming | 26 | 0.94 | 2010 |
SLAW: a scalable locality-aware adaptive work-stealing scheduler for multi-core systems | 26 | 0.91 | 2010 |
Automatic vector instruction selection for dynamic compilation | 0 | 0.34 | 2010 |
Reducing task creation and termination overhead in explicitly parallel programs | 17 | 1.05 | 2010 |
SLAW: A scalable locality-aware adaptive work-stealing scheduler | 46 | 1.94 | 2010 |
Hierarchical place trees: a portable abstraction for task parallelism and data movement | 53 | 2.41 | 2009 |
Constraint based optimization of stationary fields | 1 | 0.35 | 2008 |
Adaptive Loop Tiling for a Multi-cluster CMP | 4 | 0.43 | 2008 |
Optimizing chip multiprocessor work distribution using dynamic compilation | 4 | 0.44 | 2007 |
Loop Parallelisation for the Jikes RVM | 8 | 0.79 | 2005 |