Beyond Explicit Transfers: Shared And Managed Memory In Openmp | 0 | 0.34 | 2021 |
Multi-level spatial and temporal tiling for efficient HPC stencil computation on many-core processors with large shared caches. | 2 | 0.37 | 2019 |
Developments in memory management in OpenMP. | 0 | 0.34 | 2019 |
Supporting Function Variants in OpenMP. | 0 | 0.34 | 2018 |
Performance Optimization Of Fully Anisotropic Elastic Wave Propagation On 2nd Generation Intel (R) Xeon Phi (Tm) Processors | 1 | 0.35 | 2018 |
Assessing Task-to-Data Affinity in the LLVM OpenMP Runtime. | 0 | 0.34 | 2018 |
The Ongoing Evolution of OpenMP. | 1 | 0.41 | 2018 |
Double Buffering for MCDRAM on Second Generation $$\hbox {Intel}^{\circledR }$$ Xeon Phi $$^{\text {TM}}$$ Processors with OpenMP. | 1 | 0.37 | 2017 |
A Functional Safety OpenMP ^* for Critical Real-Time Embedded Systems. | 3 | 0.38 | 2017 |
Effective Use of Large High-Bandwidth Memory Caches in HPC Stencil Computation via Temporal Wave-Front Tiling. | 1 | 0.36 | 2016 |
YASK-yet another stencil kernel: a framework for HPC stencil code-generation and tuning | 7 | 0.44 | 2016 |
Approaches for Task Affinity in OpenMP. | 1 | 0.37 | 2016 |
Optimizing Overlapped Memory Accesses in User-directed Vectorization | 4 | 0.39 | 2015 |
An OpenMP* Barrier Using SIMD Instructions for Intel® Xeon PhiTM Coprocessor. | 2 | 0.37 | 2013 |
On the instrumentation of OpenMP and ompss tasking constructs | 6 | 0.53 | 2012 |
Compiler Automatic Discovery of OmpSs Task Dependencies. | 3 | 0.40 | 2012 |
The Intel® Many Integrated Core Architecture. | 38 | 1.99 | 2012 |
Extending OpenMP* with vector constructs for modern multicore SIMD architectures | 10 | 0.93 | 2012 |
Auto-scoping for OpenMP tasks | 5 | 0.45 | 2012 |
Trace-driven simulation of multithreaded applications | 31 | 1.22 | 2011 |
Onipss: A Proposal For Programming Heterogeneous Multi-Core Architectures | 239 | 9.46 | 2011 |
Productive cluster programming with OmpSs | 47 | 2.16 | 2011 |
Poster: programming clusters of GPUs with OMPSs | 0 | 0.34 | 2011 |
An extension to improve OpenMP tasking control | 2 | 0.40 | 2010 |
Extending OpenMP to Survive the Heterogeneous Multi-Core Era | 39 | 2.50 | 2010 |
Optimizing the exploitation of multicore processors and GPUs with OpenMP and OpenCL | 18 | 1.30 | 2010 |
Towards an error model for OpenMP | 8 | 0.62 | 2010 |
A proposal for user-defined reductions in OpenMP | 2 | 0.40 | 2010 |
A Proposal to Extend the OpenMP Tasking Model for Heterogeneous Architectures | 45 | 3.24 | 2009 |
Unrolling loops containing task parallelism | 10 | 0.68 | 2009 |
A proposal to extend the OpenMP tasking model with dependent tasks | 25 | 1.52 | 2009 |
The Design of OpenMP Tasks | 147 | 6.49 | 2009 |
An adaptive cut-off for task parallelism | 50 | 2.56 | 2008 |
Extending the OpenMP tasking model to allow dependent tasks | 40 | 3.70 | 2008 |
Support for OpenMP tasks in Nanos v4 | 23 | 2.19 | 2007 |
A Proposal for Task Parallelism in OpenMP | 33 | 4.20 | 2007 |
An Experimental Evaluation of the New OpenMP Tasking Model | 25 | 3.33 | 2007 |
Automatic thread distribution for nested parallelism in OpenMP | 18 | 1.01 | 2005 |
Experiences parallelizing a web server with OpenMP | 3 | 0.52 | 2005 |
Runtime adjustment of parallel nested loops | 10 | 0.87 | 2004 |
Dynamic load balancing of MPI+OpenMP applications | 24 | 1.42 | 2004 |
Is the schedule clause really necessary in OpenMP? | 19 | 1.82 | 2003 |