Multi-Layer In-Memory Processing | 0 | 0.34 | 2022 |
Prodigy: Improving the Memory Latency of Data-Indirect Irregular Workloads Using Hardware-Software Co-Design | 2 | 0.36 | 2021 |
Low-cost prediction-based fault protection strategy. | 0 | 0.34 | 2020 |
PolygraphMR: Enhancing the Reliability and Dependability of CNNs | 1 | 0.36 | 2020 |
POSTER: Pairing Up CNNs for High Throughput Deep Learning | 0 | 0.34 | 2019 |
TF-Net: Deploying Sub-Byte Deep Neural Networks on Microcontrollers | 1 | 0.37 | 2019 |
Iterative Modulo Scheduling. | 0 | 0.34 | 2018 |
DeftNN: addressing bottlenecks for DNN execution on GPUs via synapse vector elimination and near-compute data fission. | 18 | 0.65 | 2017 |
Dynamic Resource Management for Efficient Utilization of Multitasking GPUs. | 14 | 0.53 | 2017 |
Concise loads and stores: The case for an asymmetric compute-memory architecture for approximation. | 4 | 0.39 | 2016 |
Orchestrating Multiple Data-Parallel Kernels on Multiple Devices. | 6 | 0.46 | 2015 |
Fine Grain Cache Partitioning Using Per-Instruction Working Blocks. | 1 | 0.36 | 2015 |
Rumba: an online quality management system for approximate computing | 28 | 0.96 | 2015 |
Chimera: Collaborative Preemption for Multitasking on a Shared GPU | 49 | 1.24 | 2015 |
Mascar: Speeding up GPU warps by reducing memory pitstops | 31 | 0.90 | 2015 |
Tango: Accelerating Mobile Applications through Flip-Flop Replication. | 0 | 0.34 | 2015 |
VAST: the illusion of a large memory space for GPUs | 15 | 0.65 | 2014 |
EFetch: optimizing instruction fetch for event-driven webapplications | 10 | 0.51 | 2014 |
Embracing heterogeneity with dynamic core boosting | 2 | 0.36 | 2014 |
Heterogeneous microarchitectures trump voltage scaling for low-power cores | 17 | 0.64 | 2014 |
Optimal Liveness-Enforcing Control for a Class of Petri Nets Arising in Multithreaded Software. | 6 | 0.53 | 2013 |
Eliminating Concurrency Bugs in Multithreaded Software: A New Approach Based on Discrete-Event Control. | 13 | 0.66 | 2013 |
SAGE: self-tuning approximation for graphics engines | 97 | 2.36 | 2013 |
Efficient Execution Of Augmented Reality Applications On Mobile Programmable Accelerators | 1 | 0.35 | 2013 |
Instant profiling: Instrumentation sampling for profiling datacenter applications | 2 | 0.36 | 2013 |
Illusionist: Transforming lightweight cores into aggressive cores on demand | 4 | 0.38 | 2013 |
APOGEE: adaptive prefetching on GPUs for energy efficiency | 24 | 0.78 | 2013 |
Trace based phase prediction for tightly-coupled heterogeneous cores | 22 | 0.74 | 2013 |
Paragon: collaborative speculative loop execution on GPU and CPU | 10 | 0.64 | 2012 |
A Customized Processor for Energy Efficient Scientific Computing | 7 | 0.57 | 2012 |
Efficient soft error protection for commodity embedded microprocessors using profile information | 16 | 0.62 | 2012 |
Libra: Tailoring SIMD Execution Using Heterogeneous Hardware and Dynamic Configurability | 12 | 0.59 | 2012 |
COMET: code offload by migrating execution transparently | 149 | 5.53 | 2012 |
Composite Cores: Pushing Heterogeneity Into a Core | 71 | 2.27 | 2012 |
Encore: low-cost, fine-grained transient fault recovery | 33 | 1.02 | 2011 |
Dynamically accelerating client-side web applications through decoupled execution | 15 | 0.75 | 2011 |
Archipelago: A polymorphic cache design for enabling robust near-threshold operation | 40 | 1.17 | 2011 |
PEPSC: A Power-Efficient Processor for Scientific Computing | 10 | 0.61 | 2011 |
Sponge: portable stream programming on graphics engines | 58 | 2.12 | 2011 |
StageNet: A Reconfigurable Fabric for Constructing Dependable CMPs | 15 | 0.80 | 2011 |
Dynamic parallelization of JavaScript applications using an ultra-lightweight speculation mechanism | 17 | 0.78 | 2011 |
Deadlock-Avoidance Control Of Multithreaded Software: An Efficient Siphon-Based Algorithm For Gadara Petri Nets | 4 | 0.41 | 2011 |
Maestro: orchestrating lifetime reliability in chip multiprocessors | 21 | 0.80 | 2010 |
StageWeb: Interweaving pipeline stages into a wearout and variation tolerant CMP fabric | 9 | 0.46 | 2010 |
Putting Faulty Cores to Work | 1 | 0.38 | 2010 |
Diet SODA: A power-efficient processor for digital cameras | 11 | 0.67 | 2010 |
Supervisory control of software execution for failure avoidance: Experience from the Gadara project. | 6 | 0.49 | 2010 |
Flextream: Adaptive Compilation of Streaming Applications for Heterogeneous Architectures | 34 | 1.53 | 2009 |
Gadara nets: Modeling and analyzing lock allocation for deadlock avoidance in multithreaded software | 19 | 0.99 | 2009 |
ZerehCache: Armoring cache architectures in high defect density technologies | 47 | 1.51 | 2009 |