Symbiote Coprocessor Unit - A Streaming Coprocessor for Data Stream Acceleration. | 0 | 0.34 | 2016 |
Runtime-driven shared last-level cache management for task-parallel programs | 11 | 0.50 | 2015 |
DyReCTape: a <u>dy</u>namically <u>re</u>configurable <u>c</u>ache using domain wall memory <u>tape</u>s. | 1 | 0.35 | 2015 |
Automatic sharing classification and timely push for cache-coherent systems | 1 | 0.35 | 2015 |
Bridging the Virtualization Performance Gap for HPC Using SR-IOV for InfiniBand | 9 | 0.56 | 2014 |
MorphStore: A local file system for Big Data with utility-driven replication and load-adaptive access scheduling | 0 | 0.34 | 2014 |
Accelerating MPI Collective Communications through Hierarchical Algorithms Without Sacrificing Inter-Node Communication Flexibility | 2 | 0.38 | 2014 |
Variation Aware Cache Partitioning for Multithreaded Programs. | 4 | 0.38 | 2014 |
Imbalanced cache partitioning for balanced data-parallel programs | 12 | 0.48 | 2013 |
A mathematical hard disk timing model for full system simulation. | 1 | 0.36 | 2013 |
Integrating High Performance File Systems in a Cloud Computing Environment | 6 | 0.41 | 2012 |
Managing cellular congestion using incentives | 16 | 0.83 | 2012 |
Accelerating multicore reuse distance analysis with sampling and parallelization | 64 | 1.68 | 2010 |
Storage optimization for a peer-to-peer video-on-demand network | 7 | 0.50 | 2010 |
Using data structure knowledge for efficient lock generation and strong atomicity | 4 | 0.44 | 2010 |
Peer-to-peer video on demand: challenges and solutions | 5 | 0.56 | 2009 |
Efficient high performance collective communication for the cell blade | 5 | 0.50 | 2009 |
Advanced collective communication in aspen | 2 | 0.38 | 2008 |
Expressing and exploiting concurrency in networked applications with aspen | 23 | 1.13 | 2007 |
Achieving Reliable Parallel Performance in a VoD Storage Server Using Randomization and Replication | 5 | 0.64 | 2007 |
A Model and Prototype of a Resource-Efficient Storage Server for High-Bitrate Video-on-Demand | 2 | 0.38 | 2007 |
Conservative vs. optimistic parallelization of stateful network intrusion detection | 18 | 1.10 | 2007 |
Improving VoD server efficiency with bittorrent | 64 | 3.66 | 2007 |
Design Alternatives for a High-Performance Self-Securing Ethernet Network Interface | 12 | 0.86 | 2007 |
Seekable sockets: a mechanism to reduce copy overheads in TCP-based messaging | 0 | 0.34 | 2006 |
An efficient programmable 10 gigabit Ethernet network interface card | 20 | 1.19 | 2005 |
Network Interface Data Caching | 7 | 0.55 | 2005 |
Spinach: a liberty-based simulator for programmable network interface architectures | 9 | 0.83 | 2004 |
Isolating the performance impacts of network interface cards through microbenchmarks | 3 | 0.55 | 2004 |
Challenges in Computer Architecture Evaluation | 38 | 3.25 | 2003 |
A flexible and efficient application programming interface (API) for a customizable proxy cache | 4 | 0.61 | 2003 |
RSIM: Simulating Shared-Memory Multiprocessors with ILP Processors | 112 | 8.35 | 2002 |
Comparing and combining read miss clustering and software prefetching | 6 | 1.37 | 2001 |
Code transformations to improve memory parallelism | 35 | 2.91 | 2000 |
The impact of exploiting instruction-level parallelism on shared-memory multiprocessors | 8 | 0.84 | 1999 |
Improving the Accuracy vs. Speed Tradeoff for Simulating Shared-Memory Multiprocessors with ILP Processors | 27 | 4.15 | 1999 |
Recent advances in memory consistency models for hardware shared memory systems | 16 | 1.13 | 1999 |
Analytic evaluation of shared-memory systems with ILP processors | 60 | 3.23 | 1998 |
RSIM: Rice simulator for ILP multiprocessors | 11 | 0.68 | 1997 |
Using speculative retirement and larger instruction windows to narrow the performance gap between memory consistency models | 73 | 5.72 | 1997 |
An evaluation of memory consistency models for shared-memory systems with ILP processors | 30 | 4.28 | 1996 |