Title
An Effective and Feasible Congestion Management Technique for High-Performance MINs with Tag-Based Distributed Routing
Abstract
As parallel computing systems increase in size, the interconnection network is becoming a critical subsystem. The current trend in network design is to use as few components as possible to interconnect the end nodes, thereby reducing cost and power consumption. However, this increases the probability of congestion appearing in the network. As congestion may severely degrade network performance, the use of a congestion management mechanism is becoming mandatory in modern interconnects. One of the most cost-effective proposals to deal with the problems derived from congestion situations is the Regional Explicit Congestion Notification (RECN) strategy, based on using special queues to totally isolate the packet flows which contribute to congestion, thereby preventing the Head-of-Line (HoL) blocking effect that these flows may cause to others. Unfortunately, RECN requires the use of source-based routing, thus not being suitable for interconnects with distributed routing, like InfiniBand. Although some RECN-like mechanisms have been proposed for distributed-routing networks, they are not scalable due to the huge amount of control memory that they require in medium-size or large networks. In this paper, we propose Distributed-Routing-Based Congestion Management (DRBCM), a new scalable technique which, following the RECN principles, totally prevents congestion from producing HoL-blocking in multistage interconnection networks (MINs) using tag-based distributed routing. Simulation results indicate that, regardless of network size, DRBCM presents small resource requirements to keep network performance at maximum level even in scenarios of heavy congestion, where it utterly outperforms (with a gain up to 70 percent) current solutions for distributed-routing networks, like the InfiniBand congestion-control mechanism based on injection throttling. Thus, DRBCM is an efficient, cost-effective, and scalable solution for congestion management.
Year
DOI
Venue
2013
10.1109/TPDS.2012.303
IEEE Trans. Parallel Distrib. Syst.
Keywords
Field
DocType
computer network performance evaluation,multiprocessor interconnection networks,parallel processing,telecommunication network routing,DRBCM,InfiniBand congestion control mechanism,RECN principles,congestion management mechanism,congestion management technique,critical subsystem,distributed routing based congestion management,distributed routing networks,high performance MIN,injection throttling,multistage interconnection networks,network design,network performance,packet flows,parallel computing systems,power consumption,regional explicit congestion notification strategy,source based routing,tag based distributed routing,High-performance interconnection networks,congestion management,distributed routing,head-of-line blocking
Network planning and design,InfiniBand,Computer science,Computer network,Real-time computing,Multistage interconnection networks,Flow control (data),Network congestion,Network traffic control,Bandwidth management,Distributed computing,Explicit Congestion Notification
Journal
Volume
Issue
ISSN
24
10
1045-9219
Citations 
PageRank 
References 
10
0.55
16
Authors
5
Name
Order
Citations
PageRank
Jesus Escudero-Sahuquillo1794.12
Pedro J. Garcia21276.17
Francisco J. Quiles343841.67
J. Flich477552.09
Jose Duato589354.65