Title
Intelligent Agents for Fault Tolerance: From Multi-agent Simulation to Cluster-Based Implementation
Abstract
Recent research in multi-agent systems incorporate fault tolerance concepts, but does not explore the extension and implementation of such ideas for large scale parallel computing systems. The work reported in this paper investigates a swarm array computing approach, namely 'Intelligent Agents'. A task to be executed on a parallel computing system is decomposed to sub-tasks and mapped onto agents that traverse an abstracted hardware layer. The agents intercommunicate across processors to share information during the event of a predicted core/processor failure and for successfully completing the task. The feasibility of the approach is validated by simulations on an FPGA using a multi-agent simulator, and implementation of a parallel reduction algorithm on a computer cluster using the Message Passing Interface.
Year
DOI
Venue
2010
10.1109/WAINA.2010.21
Advanced Information Networking and Applications Workshops
Keywords
Field
DocType
fault tolerance,multi-agent system,cluster-based implementation,parallel computing system,parallel reduction algorithm,abstracted hardware layer,message passing interface,swarm array computing approach,large scale parallel computing,multi-agent simulation,intelligent agents,agents intercommunicate,multi-agent simulator,computational modeling,parallel processing,fault tolerant,fpga,intelligent agent,message passing,computer simulation,software fault tolerance,parallel algorithms,parallel computer,multi agent systems,hardware,multiagent systems,multi agent system,field programmable gate arrays
Intelligent agent,Parallel algorithm,Computer science,Software fault tolerance,Multi-agent system,Message Passing Interface,Fault tolerance,Message passing,Computer cluster,Distributed computing
Conference
ISBN
Citations 
PageRank 
978-1-4244-6701-3
2
0.38
References 
Authors
10
3
Name
Order
Citations
PageRank
Blesson Varghese135235.03
Gerard McKee2143.84
Vassil Alexandrov36111.49