Abstract | ||
---|---|---|
BXI, Bull eXascale Interconnect, is the new interconnection network developed by Atos for high-performance computing. It has been designed to meet the requirements of exascale supercomputers. At such scale, faults have to be expected and dealt with transparently so that applications remain unaffected by them. BXI features various mechanisms for this purpose, one of which is based on a clear separation between two modes of routing tables computation: offline mode used during bring-up and online mode used to deal with link failures and recoveries. This new architecture is presented along with several offline and online routing algorithms and their actual performance: the full routing tables for a 64k-node fat-tree can be computed in a few minutes in offline mode; and the online mode can withstand numerous inter-router link failures without any noticeable impact on running applications. |
Year | DOI | Venue |
---|---|---|
2016 | https://doi.org/10.1007/s11227-016-1755-2 | The Journal of Supercomputing |
Keywords | Field | DocType |
Fabric management,Routing,Fault-tolerant routing,BXI,Interconnect management,High-performance computing | Routing architecture,Architecture,Supercomputer,Computer science,Static routing,Parallel computing,Computer network,Routing table,Interconnection,Computation,Routing algorithm,Distributed computing | Journal |
Volume | Issue | ISSN |
72 | 12 | 0920-8542 |
Citations | PageRank | References |
1 | 0.38 | 22 |
Authors | ||
2 |
Name | Order | Citations | PageRank |
---|---|---|---|
P. Vignéras | 1 | 13 | 2.54 |
Jean-Noël Quintin | 2 | 28 | 4.11 |