UPR: deadlock-free dynamic network reconfiguration by exploiting channel dependency graph compatibility | 0 | 0.34 | 2021 |
Feasible enhancements to congestion control in InfiniBand-based networks. | 1 | 0.35 | 2018 |
Efficient and Cost-Effective Hybrid Congestion Control for HPC Interconnection Networks | 13 | 0.60 | 2015 |
Efficient Routing in Heterogeneous SoC Designs with Small Implementation Overhead | 1 | 0.35 | 2014 |
A new proposal to deal with congestion in InfiniBand-based fat-trees | 18 | 0.70 | 2014 |
BBQ: a straightforward queuing scheme to reduce hol-blocking in high-performance hybrid networks | 13 | 0.60 | 2013 |
An Effective and Feasible Congestion Management Technique for High-Performance MINs with Tag-Based Distributed Routing | 10 | 0.55 | 2013 |
Increasing the Effectiveness of Directory Caches by Avoiding the Tracking of Noncoherent Memory Blocks | 15 | 0.57 | 2013 |
On the Impact of Within-Die Process Variation in GALS-Based NoC Performance | 3 | 0.41 | 2012 |
Enabling High-Performance Crossbars through a Floorplan-Aware Design | 6 | 0.53 | 2012 |
Understanding Cache Hierarchy Contention in CMPs to Improve Job Scheduling | 4 | 0.41 | 2012 |
A Survey and Evaluation of Topology-Agnostic Deterministic Routing Algorithms | 56 | 2.13 | 2012 |
Exploiting SIMD instructions in current processors to improve classical string algorithms | 8 | 0.57 | 2012 |
Progressive Congestion Management Based on Packet Marking and Validation Techniques | 1 | 0.39 | 2012 |
Extending Magny-Cours Cache Coherence | 1 | 0.35 | 2012 |
A New End-to-End Flow-Control Mechanism for High Performance Computing Clusters | 0 | 0.34 | 2012 |
Cache Miss Characterization in Hierarchical Large-Scale Cache-Coherent Systems | 2 | 0.40 | 2012 |
Optimal Configuration of High-Radix Combined Switches | 1 | 0.36 | 2012 |
Page-Based Memory Allocation Policies of Local and Remote Memory in Cluster Computers | 0 | 0.34 | 2012 |
A New Family of Hybrid Topologies for Large-Scale Interconnection Networks | 8 | 0.58 | 2012 |
Efficient and Scalable Starvation Prevention Mechanism for Token Coherence | 1 | 0.35 | 2011 |
Combining Congested-Flow Isolation and Injection Throttling in HPC Interconnection Networks | 5 | 0.46 | 2011 |
MEMSCALETM: A Scalable Environment for Databases | 1 | 0.37 | 2011 |
Evaluation of an Alternative for Increasing Switch Radix | 3 | 0.40 | 2011 |
Improving Last-Level Cache Performance by Exploiting the Concept of MRU-Tour | 0 | 0.34 | 2011 |
OBQA: Smart and cost-efficient queue scheme for Head-of-Line blocking elimination in fat-trees | 12 | 0.57 | 2011 |
Unleash Your Memory-Constrained Applications: A 32-Node Non-coherent Distributed-Memory Prototype Cluster | 0 | 0.34 | 2011 |
C-Switches: Increasing Switch Radix with Current Integration Scale | 2 | 0.39 | 2011 |
Dynamic Fault Tolerance in Fat Trees | 10 | 0.72 | 2011 |
Highly scalable barriers for future high-performance computing clusters | 0 | 0.34 | 2011 |
Towards an Efficient NoC Topology through Multiple Injection Ports | 3 | 0.41 | 2011 |
MRU-Tour-based Replacement Algorithms for Last-Level Caches | 1 | 0.37 | 2011 |
Performance of CUDA Virtualized Remote GPUs in High Performance Clusters | 21 | 1.21 | 2011 |
Fault-Tolerant Vertical Link Design for Effective 3D Stacking | 5 | 0.48 | 2011 |
Enabling CUDA acceleration within virtual machines using rCUDA | 33 | 1.42 | 2011 |
Dealing with Transient Faults in the Interconnection Network of CMPs at the Cache Coherence Level | 1 | 0.36 | 2010 |
Exploiting subtrace-level parallelism in clustered processors | 0 | 0.34 | 2010 |
A Latency-Efficient Router Architecture for CMP Systems | 4 | 0.44 | 2010 |
An efficient strategy for reducing head-of-line blocking in fat-trees | 7 | 0.52 | 2010 |
A Scheduling Heuristic to Handle Local and Remote Memory in Cluster Computers | 2 | 0.39 | 2010 |
Scalable hardware support for conditional parallelization | 2 | 0.36 | 2010 |
Getting Rid of Coherency Overhead for Memory-Hungry Applications | 2 | 0.46 | 2010 |
Efficient and Scalable Hardware-Based Multicast in Fat-Tree Networks | 5 | 0.41 | 2009 |
A new mechanism to deal with process variability in NoC links | 9 | 0.52 | 2009 |
An Efficient Low-Complexity Alternative to the ROB for Out-of-Order Retirement of Instructions | 1 | 0.35 | 2009 |
A Complexity-Effective Out-of-Order Retirement Microarchitecture | 9 | 0.56 | 2009 |
Efficient Deadline-Based QoS Algorithms for High-Performance Networks | 4 | 0.45 | 2008 |
Efficient unicast and multicast support for CMPs | 49 | 1.64 | 2008 |
On the Potentials of Segment-Based Routing for NoCs | 22 | 0.85 | 2008 |
FBICM: efficient congestion management for high-performance networks using distributed deterministic routing | 15 | 0.77 | 2008 |