Abstract | ||
---|---|---|
Users pay for running their applications on cloud infrastructure, and in return they expect high availability, and minimal data loss in case of failure. From a cloud provider perspective, any hardware or software failure must be detected and recovered as quickly as possible to maintain users' trust and avoid financial losses. From a user's perspective, failures must be transparent and should not impact application performance. In order to recover a failed application, cloud providers must perform checkpoints, and periodically save application data, which can then be recovered following a failover. Currently, a checkpoint service can be implemented in many ways, each presenting different performance results. The main research question to be answered is: what is the best checkpoint strategy to use given some users' requirements? In this paper, we performed experiments with different checkpoint service strategies to understand how these are affected by the computing resources. We also provide a discussion about the relationship between service availability and the checkpoint service. |
Year | DOI | Venue |
---|---|---|
2019 | 10.1504/IJGUC.2019.102735 | INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING |
Keywords | DocType | Volume |
checkpoint, failover, performance evaluation, SAF standard | Journal | 10 |
Issue | ISSN | Citations |
6 | 1741-847X | 0 |
PageRank | References | Authors |
0.34 | 0 | 7 |
Name | Order | Citations | PageRank |
---|---|---|---|
Demis Gomes | 1 | 2 | 1.71 |
Glauco Estácio Gonçalves | 2 | 24 | 9.62 |
Patricia Endo | 3 | 121 | 25.32 |
Moisés Rodrigues | 4 | 0 | 0.34 |
Judith Kelner | 5 | 62 | 16.65 |
Djamel Sadok | 6 | 371 | 57.81 |
Calin Curescu | 7 | 74 | 7.77 |