Abstract | ||
---|---|---|
Linux servers with heterogeneous architectures present a new challenge for fault management. With the significant increase in the numbers and types of hardware components, separate fault management becomes more complex and inefficient. It is clear that centralized management, automatic recovering and scalable design must be incorporated in the modern fault management system. Based on the ccNUMA architecture, the paper proposes a scalable fault management architecture, and studies the implementation technologies. It aims to enable computers to automatically detect error, diagnose error and handle fault. The architecture uses modular design and supports distributed environment with good extensibility and scalability. In practice, the architecture is effective and can raise the reliability of servers. |
Year | DOI | Venue |
---|---|---|
2011 | 10.1109/INCoS.2011.35 | INCoS |
Keywords | Field | DocType |
heterogeneous architecture,scalable fault management architecture,ccnuma,scalable fault management,fault management,fault tolerance,linux server,modern fault management system,linux,scalable design,automatic recovering,diagnose error,ccnuma architecture,memory architecture,centralized management,ccnuma server,separate fault management,modular design,distributed environment,hardware,kernel,fault tolerant,servers,computer architecture | Space-based architecture,General protection fault,Computer science,Server,Fault management,Fault tolerance,Modular design,Memory architecture,Embedded system,Scalability,Distributed computing | Conference |
ISBN | Citations | PageRank |
978-1-4577-1908-0 | 0 | 0.34 |
References | Authors | |
3 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yan Yang | 1 | 0 | 0.68 |
Xingjun Zhang | 2 | 81 | 34.06 |
Endong Wang | 3 | 7 | 5.62 |
Nan Wu | 4 | 0 | 0.34 |
Xiaoshe Dong | 5 | 172 | 51.44 |