Title
An Experimental Evaluation of Coordinated Checkpointing in a Parallel Machine
Abstract
Coordinated checkpointing represents a very effective solution to assure the continuity of distributed and parallel applications in the occurrence of failures. In previous studies it has been proved that this approach achieved better results than independent checkpointing and message logging. However, we need to know more about the real overhead of coordinated checkpointing and get sustained insights about the best way to implement this technique of fault-tolerance. This paper presents an experimental evaluation of coordinated checkpointing in a parallel machine. It describes some optimization techniques and presents some performance results.
Year
DOI
Venue
1999
10.1007/3-540-48254-7_10
EDCC
Keywords
Field
DocType
optimization technique,coordinated checkpointing,independent checkpointing,effective solution,parallel application,experimental evaluation,parallel machine,performance result,previous study,better result
Message logging,Computer science,Real-time computing,Fault tolerance,Need to know,Distributed memory systems,Distributed computing,Stable storage,Embedded system
Conference
Volume
ISSN
ISBN
1667
0302-9743
3-540-66483-1
Citations 
PageRank 
References 
1
0.39
11
Authors
2
Name
Order
Citations
PageRank
Luís Moura Silva131236.22
João Gabriel Silva261863.55