Title
Software-Based Detecting and Recovering from ECC-Memory Faults
Abstract
According to the problem that the ECC cannot correct the multibit error in ECC memory, this paper proposes a memory error processing method on software level. On the foundation of revising the Linux kernel code, the method can discover this area of influence area of memory error according to seek the process information mapping to the mistaken address. This way can avoid wastage to the user due to the system halting caused by memory error. The experimental results show that the method can have a certain degree of memory error repair and do not affect the normal work of the system.
Year
DOI
Venue
2011
10.1109/INCoS.2011.148
INCoS
Keywords
Field
DocType
influence area,error handling,memory error,mistaken address,software-based detecting,system halting,certain degree,memory error processing method,storage management,information mapping process,ecc memory,ecc-memory fault,linux,fault diagnosis,software-based recovery,linux kernel code,software-based detection,ecc-memory faults,ecc,reverse mapping,memory error repair,multibit error,error correction code,reliability,servers,kernel
Kernel (linear algebra),ECC memory,Computer science,Server,Real-time computing,Software,Flat memory model,Computer engineering,Redundant array of independent memory,Memory errors,Distributed computing,Linux kernel
Conference
ISBN
Citations 
PageRank 
978-1-4577-1908-0
0
0.34
References 
Authors
3
6
Name
Order
Citations
PageRank
Xingjun Zhang18134.06
Endong Wang275.62
Dong Zhang312517.08
Yu Wang410347.77
Weiguo Wu514834.44
Xiaoshe Dong617251.44