Super-Node SLP: optimized vectorization for code sequences containing operators and their inverse elements | 0 | 0.34 | 2019 |
VW-SLP: auto-vectorization with adaptive vector width | 0 | 0.34 | 2018 |
Look-ahead SLP: auto-vectorization in the presence of commutative operations. | 2 | 0.37 | 2018 |
Lynx: Using OS and Hardware Support for Fast Fine-Grained Inter-Core Communication | 1 | 0.35 | 2016 |
COMET: communication-optimised multi-threaded error-detection technique. | 3 | 0.40 | 2016 |
Throttling Automatic Vectorization: When Less is More. | 9 | 0.51 | 2015 |
PSLP: Padded SLP automatic vectorization | 19 | 0.69 | 2015 |
DRIFT: Decoupled CompileR-Based Instruction-Level Fault-Tolerance. | 7 | 0.50 | 2013 |
LUCAS: latency-adaptive unified cluster assignment and instruction scheduling | 2 | 0.36 | 2013 |
CAeSaR: unified cluster-assignment scheduling and communication reuse for clustered VLIW processors | 0 | 0.34 | 2013 |
CASTED: Core-Adaptive Software Transient Error Detection for Tightly Coupled Cores | 0 | 0.34 | 2013 |
Aligned Scheduling: Cache-Efficient Instruction Scheduling for VLIW Processors. | 0 | 0.34 | 2013 |
UCIFF: Unified Cluster Assignment Instruction Scheduling and Fast Frequency Selection for Heterogeneous Clustered VLIW Cores. | 0 | 0.34 | 2012 |
Cooperative partitioning: Energy-efficient cache partitioning for high-performance CMPs | 28 | 0.82 | 2012 |
Decoupled Processors Architecture for Accelerating Data Intensive Applications using Scratch-Pad Memory Hierarchy | 0 | 0.34 | 2010 |