Title
A Multipath Fault-Tolerant Routing Method for High-Speed Interconnection Networks
Abstract
The intensive and continuous use of high-performance computers for executing computationally intensive applications, coupled with the large number of elements that make them up, dramatically increase the likelihood of failures during their operation. The interconnection network is a critical part of high-performance computer systems that communicates and links together the processing units. Network faults have an extremely high impact because the occurrence of a single fault may prevent the correct finalization of applications. This work focuses on the problem of fault tolerance for high-speed interconnection networks by designing a fault tolerant routing method. The goal is to solve a certain number of link and node failures, considering its impact, and occurrence probability. To accomplish this task we take advantage of communication path redundancy, by means of adaptive multipath routing approaches that fulfill the four phases of fault tolerance: error detection, damage confinement, error recovery, fault treatment and continuous service. Experiments show that our method allows applications to successfully finalize their execution in the presence of several number of faults, with an average performance value of 97% with respect to the fault-free scenarios.
Year
DOI
Venue
2009
10.1007/978-3-642-03869-3_99
Euro-Par
Keywords
Field
DocType
continuous use,multipath fault-tolerant routing method,certain number,continuous service,large number,network fault,computationally intensive application,fault treatment,high-speed interconnection networks,fault tolerant,fault tolerance,single fault,error detection,multipath routing
Multipath propagation,Stuck-at fault,Multipath routing,Computer science,Parallel computing,Computer network,Software fault tolerance,Error detection and correction,Fault tolerance,Redundancy (engineering),Interconnection,Distributed computing
Conference
Volume
ISSN
Citations 
5704
0302-9743
3
PageRank 
References 
Authors
0.42
17
4
Name
Order
Citations
PageRank
Gonzalo Zarza1142.53
Diego Lugones2359.77
daniel franco3246.18
Emilio Luque41097176.18