Title
A Scalable Fault Management Architecture for ccNUMA Server
Abstract
Linux servers with heterogeneous architectures present a new challenge for fault management. With the significant increase in the numbers and types of hardware components, separate fault management becomes more complex and inefficient. It is clear that centralized management, automatic recovering and scalable design must be incorporated in the modern fault management system. Based on the ccNUMA architecture, the paper proposes a scalable fault management architecture, and studies the implementation technologies. It aims to enable computers to automatically detect error, diagnose error and handle fault. The architecture uses modular design and supports distributed environment with good extensibility and scalability. In practice, the architecture is effective and can raise the reliability of servers.
Year
DOI
Venue
2011
10.1109/INCoS.2011.35
INCoS
Keywords
Field
DocType
heterogeneous architecture,scalable fault management architecture,ccnuma,scalable fault management,fault management,fault tolerance,linux server,modern fault management system,linux,scalable design,automatic recovering,diagnose error,ccnuma architecture,memory architecture,centralized management,ccnuma server,separate fault management,modular design,distributed environment,hardware,kernel,fault tolerant,servers,computer architecture
Space-based architecture,General protection fault,Computer science,Server,Fault management,Fault tolerance,Modular design,Memory architecture,Embedded system,Scalability,Distributed computing
Conference
ISBN
Citations 
PageRank 
978-1-4577-1908-0
0
0.34
References 
Authors
3
5
Name
Order
Citations
PageRank
Yan Yang100.68
Xingjun Zhang28134.06
Endong Wang375.62
Nan Wu400.34
Xiaoshe Dong517251.44