Title
Partition-Aware Routing to Improve Network Isolation in Infiniband Based Multi-tenant Clusters
Abstract
InfiniBand (IB) is a widely used network interconnect for modern high-performance computing systems. In large IB fabrics, isolation of nodes is provided through partitioning. The routing algorithm, however, is unaware of these partitions in the network, Traffic flows belonging to different partitions might share links inside the network fabric. This sharing of intermediate links creates interference, which is particularly critical to avoid in multi-tenant environments like a cloud. In such systems, each tenant should experience predictable network performance, unaffected by the workload of other tenants. In addition, using current routing schemes, routes crossing partition boundaries are considered when distributing routes onto links in the network, despite the fact that these routes will never be used. The result is degraded load-balancing. In this paper, we present a novel partition-aware fat-tree routing algorithm, pFTree. The pFTree algorithm utilizes several mechanisms to provide network-wide isolation of partitions belonging to different tenant groups. Given the available network resources, pFTree starts by isolating partitions at the physical link level, and then moves on to utilize virtual lanes, if needed. Our experiments and simulations show that pFTree is able to significantly reduce the affect of inter-partition interference without any additional functional overhead. Furthermore, pFTree also provides improved load-balancing over the de facto standard IB fat-tree routing algorithm.
Year
DOI
Venue
2015
10.1109/CCGrid.2015.96
2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing
Keywords
Field
DocType
routing,interconnection networks,performance isolation,InfiniBand,virtual channels
Multipath routing,Link-state routing protocol,Static routing,Policy-based routing,Hierarchical routing,Computer science,Computer network,Routing domain,Routing table,Routing protocol,Distributed computing
Conference
ISSN
Citations 
PageRank 
2376-4414
3
0.38
References 
Authors
16
5
Name
Order
Citations
PageRank
Feroz Zahid1144.60
Ernst Gunnar Gran2979.60
Bartosz Bogdanski3414.53
Bjorn Dag Johnsen4295.92
Tor Skeie5110374.67