Title
Fault prediction in distributed systems gone wild
Abstract
We consider the problem of predicting faults in deployed, large-scale distributed systems that are heterogeneous and federated. Motivated by the importance of ensuring reliability of the services these systems provide, we argue that the key step in making these systems reliable is the need to automatically predict faults. For example, doing so is vital for avoiding Internet-wide outages that occur due to programming errors or misconfigurations.
Year
DOI
Venue
2010
10.1145/1859184.1859187
Proceedings of the 4th International Workshop on Large Scale Distributed Systems and Middleware
Keywords
DocType
Citations 
shadow snapshot,key step,internet-wide outages,federated systems,bgp,fault prediction,programming error,spatial and temporal awareness,heterogeneous systems,computer science,distributed system
Conference
3
PageRank 
References 
Authors
0.44
12
4
Name
Order
Citations
PageRank
Marco Canini185760.21
Dejan Novaković22287.25
Vojin Jovanovic31035.03
Dejan Kostic41707119.11