Title
NT-SwiFT: software implemented fault tolerance on Windows NT
Abstract
Today, there are increasing demands to make application software more tolerant to failures. Fault-tolerant applications detect and recover from failures that are not handled by the application's underlying hardware or operating system, In recent years, an increasing number of highly available applications are being implemented on Windows NT. However, the current version of Windows (NT4.0, 2000) and its utilities, such as Microsoft Cluster Server (MSCS), do not provide some facilities (such as transparent checkpointing, and message logging) that are needed to implement fault-tolerant applications. In this paper, we describe a set of reusable software components collectively named software implemented fault tolerance (NT-SwiFT) that facilitates building fault-tolerant and highly available applications on Windows NT, 2000. NT-SwiFT provides components for automatic error detection and recovery, checkpointing, event logging and replay, and communication error recovery, and incremental data replication. Using NT-SwiFT , we conducted fault injection experiments on three commercial server applications--Apache web server, Microsoft IIS web server, and Microsoft SQL to study the failure coverage and the overhead of NT-SwiFT components. Preliminary results show that NT-SwiFT can detect and recover more application failures than MSCS does in all three applications.
Year
DOI
Venue
2004
10.1016/S0164-1212(02)00154-1
Journal of Systems and Software
Keywords
Field
DocType
application software,microsoft cluster server,automatic error detection and recovery,microsoft iis web server,checkpointing,apache web server,application failure,windows nt,software implemented fault tolerance,nt-swift component,and incremental data replications,fault-tolerant application,available application,communication error recovery,fault tolerance,microsoft sql,event logging and replay,data replication,software component,fault tolerant,high availability,operating system
Server Message Block,LAN Manager,Windows NT,Computer science,Internet Authentication Service,Microsoft Transaction Server,Real-time computing,Next-Generation Secure Computing Base,Group Policy,Operating system,Windows Vista,Embedded system
Journal
Volume
Issue
ISSN
71
1-2
The Journal of Systems & Software
Citations 
PageRank 
References 
22
2.81
7
Authors
7
Name
Order
Citations
PageRank
Deron Liang136997.60
P. Emerald Chung215419.69
Yennun Huang3738106.38
Chandra Kintala419629.54
Woei-Jyh Lee514717.08
Timothy K. Tsai664756.27
Chung-Yih Wang7344.31