Title
CORBA Based Runtime Support for Load Distribution and Fault Tolerance
Abstract
Parallel scientific computing in a distributed computing environment based on CORBA requires additional services not (yet) included in the CORBA specification: load distribution and fault tolerance. Both of them are essential for long running applications with high computational demands as in the case of computational engineering applications. The proposed approach for providing these services is based on integrating load distribution into the CORBA naming service which in turn relies on information provided by the underlying WINNER resource management system developed for typical networked Unix workstation environments. The support of fault tolerance is based on error detection and backward reco very by introducing proxy objects which manage checkpointing and restart of services in case of failures. A protoytpical implementation of the complete system is presented, and performance results obtained for the parallel optimization of a mathematical benchmark function are discussed.
Year
Venue
Keywords
2000
IPDPS Workshops
corba naming service,runtime support,complete system,high computational demand,fault tolerance,corba specification,parallel optimization,load distribution,computational engineering application,parallel scientific computing,additional service,distributed computing,fault tolerant
Field
DocType
Volume
Computational Science and Engineering,Object-oriented programming,Distributed Computing Environment,Computer science,Common Object Request Broker Architecture,Error detection and correction,Resource Management System,Fault tolerance,Systems architecture,Operating system,Distributed computing
Conference
1800
ISSN
ISBN
Citations 
0302-9743
3-540-67442-X
1
PageRank 
References 
Authors
0.37
4
5
Name
Order
Citations
PageRank
Thomas Barth1223.55
Gerd Flender210.37
Bernd Freisleben3137.35
Manfred Grauer410420.44
Frank Thilo5212.52