Brief Announcement: Spatial Locality and Granularity Change in Caching | 0 | 0.34 | 2022 |
RACOD: algorithm/hardware co-design for mobile robot path planning | 2 | 0.36 | 2022 |
RTRBench: A Benchmark Suite for Real-Time Robotics | 0 | 0.34 | 2022 |
MetaSys: A Practical Open-source Metadata Management System to Implement and Evaluate Cross-layer Optimizations | 0 | 0.34 | 2022 |
The Read-Only Semi-External Model | 1 | 0.36 | 2021 |
The Processing-in-Memory Model | 0 | 0.34 | 2021 |
DriftSurf: Stable-State / Reactive-State Learning under Concept Drift | 0 | 0.34 | 2021 |
TardisTM: incremental repair for transactional memory | 0 | 0.34 | 2020 |
Sage: Parallel Semi-Asymmetric Graph Algorithms for NVRAMs. | 0 | 0.34 | 2020 |
The Non-IID Data Quagmire of Decentralized Machine Learning | 0 | 0.34 | 2020 |
Writeback-Aware Caching (Brief Announcement). | 0 | 0.34 | 2019 |
SysML: The New Frontier of Machine Learning Systems. | 0 | 0.34 | 2019 |
MLtuner: System Support for Automatic Machine Learning Tuning. | 0 | 0.34 | 2018 |
PipeDream: Fast and Efficient Pipeline Parallel DNN Training. | 10 | 0.46 | 2018 |
The Locality Descriptor: A Holistic Cross-Layer Abstraction to Express Data Locality In GPUs. | 7 | 0.39 | 2018 |
Implicit Decomposition for Write-Efficient Connectivity Algorithms | 4 | 0.38 | 2018 |
Zorua: Enhancing Programming Ease, Portability, and Performance in GPUs by Decoupling Programming Models from Resource Management. | 0 | 0.34 | 2018 |
A Case for Richer Cross-Layer Abstractions: Bridging the Semantic Gap with Expressive Memory. | 7 | 0.39 | 2018 |
Proteus: agile ML elasticity through tiered reliability in dynamic resource markets. | 17 | 0.58 | 2017 |
Gaia: Geo-Distributed Machine Learning Approaching Lan Speeds | 17 | 0.82 | 2017 |
Parallel Algorithms for Asymmetric Read-Write Costs. | 9 | 0.51 | 2016 |
Addressing the straggler problem for iterative convergent parallel ML. | 24 | 0.82 | 2016 |
How Emerging Memory Technologies Will Have You Rethinking Algorithm Design. | 0 | 0.34 | 2016 |
Sorting with Asymmetric Read and Write Costs. | 11 | 0.55 | 2016 |
Managed communication and consistency for fast data-parallel iterative analytics | 35 | 1.13 | 2015 |
Bandwidth-efficient distributed k-nearest-neighbor search with dynamic time warping | 1 | 0.35 | 2015 |
Exploiting compressed block size as an indicator of future reuse | 29 | 0.69 | 2015 |
Learning better while sending less: Communication-efficient online semi-supervised learning in client-server settings | 0 | 0.34 | 2015 |
Tracking and Reducing Uncertainty in Dataflow Analysis-Based Dynamic Parallel Monitoring. | 0 | 0.34 | 2015 |
Sequential random permutation, list contraction and tree contraction are highly parallel | 8 | 0.50 | 2015 |
Gather-scatter DRAM: in-DRAM address translation to improve the spatial locality of non-unit strided accesses | 44 | 0.75 | 2015 |
Gleaner: mitigating the blocked-waiter wakeup problem for virtualized multicore applications | 3 | 0.49 | 2014 |
The dirty-block index | 31 | 0.62 | 2014 |
ACM transactions on parallel computing: An introduction | 0 | 0.34 | 2014 |
Guardrail: a high fidelity approach to protecting hardware devices from buggy drivers | 1 | 0.35 | 2014 |
Communication-efficient multi-view keyframe extraction in distributed video sensors | 3 | 0.37 | 2014 |
Internally deterministic parallel algorithms can be fast | 39 | 1.22 | 2012 |
MaSM: efficient online updates in data warehouses | 20 | 0.68 | 2011 |
Sustaining collaboration in multicast despite rational collusion | 2 | 0.37 | 2011 |
Scheduling irregular parallel computations on hierarchical caches | 28 | 0.92 | 2011 |
Rethinking Database Algorithms for Phase Change Memory | 104 | 3.37 | 2011 |
Log-based architectures: using multicore to help software behave correctly | 6 | 0.46 | 2011 |
PR-join: a non-blocking join achieving higher early result rate with statistical guarantees | 13 | 0.54 | 2010 |
Decoupled lifeguards: enabling path optimizations for dynamic correctness checking tools | 13 | 0.52 | 2010 |
Online maintenance of very large random samples on flash storage | 51 | 2.50 | 2010 |
Flash in a DBMS: Where and How? | 7 | 0.53 | 2010 |
Flexible Hardware Acceleration for Instruction-Grain Lifeguards | 5 | 0.46 | 2009 |
Beyond nested parallelism: tight bounds on work-stealing overheads for parallel futures | 15 | 0.77 | 2009 |
Optimal inter-object correlation when replicating for availability | 9 | 0.66 | 2009 |
Synopsis diffusion for robust aggregation in sensor networks | 239 | 10.77 | 2008 |