Abstract | ||
---|---|---|
Fault tolerance poses a major challenge for future large-scale systems. Current research on fault tolerance has been principally focused on mitigating the impact of uncorrectable errors: errors that corrupt the state of the machine and require a restart from a known good state. However, correctable errors occur much more frequently than uncorrectable errors and may be even more common on future sy... |
Year | DOI | Venue |
---|---|---|
2021 | 10.1109/Cluster48925.2021.00060 | 2021 IEEE International Conference on Cluster Computing (CLUSTER) |
Keywords | DocType | ISSN |
Fault tolerance,Conferences,Fault tolerant systems,Random access memory,Cluster computing,Hardware,Large-scale systems | Conference | 1552-5244 |
ISBN | Citations | PageRank |
978-1-7281-9666-4 | 0 | 0.34 |
References | Authors | |
0 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Kurt Ferreira | 1 | 639 | 40.78 |
Scott Levy | 2 | 29 | 7.36 |
Victor Kuhns | 3 | 0 | 0.34 |
Nathan DeBardeleben | 4 | 490 | 31.71 |
Sean Blanchard | 5 | 190 | 13.20 |