Title
A Recoverable Distributed Shared Memory Integrating Coherence and Recoverability
Abstract
Large-scale distributed systems are very attractive for the execution of parallel applications requiring a huge computing power. However, their high probability of site failure is unacceptable, especially for long time running applications. In this paper, we address this problem and propose a checkpointing mechanism relying on a recoverable distributed shared memory (DSM) in order to tolerate single node failure. Although most recoverable DSM require specific hardware to store recovery data, our scheme uses standard memories to store both current and recovery data. Moreover, the management of recovery data is merged with the management of current data by extending the DSM's coherence protocol. This approach limits the hardware development and takes advantage of the data replication provided by a DSM in order to limit the amount of transferred pages during the checkpointing. The paper also presents an implementation and preliminary performances evaluation of our recoverable DSM on an Intel Paragon with 56 nodes.
Year
DOI
Venue
1995
10.1109/FTCS.1995.466970
Pasadena, CA, USA
Keywords
Field
DocType
recoverable dsms,intel paragon,standard dsm,site failure,performance degradation,recoverable dsm,checkpointing mechanism,specific hardware,single node failure,recovery data,coherence protocol,shared memory,standard memory,hardware development,current data,preliminary performance evaluation,fault tolerance,hardware,data replication,workstations,stability,scalability,protocols,concurrent computing,distributed shared memory,coherence,distributed computing
Intel Paragon,Replication (computing),Computer science,Workstation,Coherence (physics),Fault tolerance,Concurrent computing,Distributed shared memory,Distributed computing,Scalability
Conference
ISSN
ISBN
Citations 
0731-3071
0-8186-7079-7
35
PageRank 
References 
Authors
5.71
17
5
Name
Order
Citations
PageRank
Anne-Marie Kermarrec16649453.63
Gilbert Cabillic28712.88
Alain Gefflaut317624.33
Christine Morin422626.78
Isabelle Puaut5170889.84