Title
Using Cloud Constructs and Predictive Analysis to Enable Pre-Failure Process Migration in HPC Systems
Abstract
Accurate failure prediction in conjunction with efficient process migration facilities including some Cloud constructs can enable failure avoidance in large-scale high performance computing (HPC) platforms. In this work we demonstrate a prototype system that incorporates our probabilistic failure prediction system with virtualization mechanisms and techniques to provide a whole system approach to failure avoidance. This work utilizes a failure scenario based on a real-world HPC case study.
Year
DOI
Venue
2010
10.1109/CCGRID.2010.31
Cluster, Cloud and Grid Computing
Keywords
Field
DocType
failure scenario,failure avoidance,hpc systems,enable pre-failure process migration,probabilistic failure prediction system,prototype system,whole system approach,efficient process migration facility,large-scale high performance computing,cloud constructs,accurate failure prediction,real-world hpc case study,cloud construct,predictive analysis,grid computing,fault tolerance,fault tolerant,hpc,meteorology,investments,process control,process migration,application software,high performance computing,prototypes,migration,virtualization,cloud computing
Virtualization,Grid computing,Supercomputer,Computer science,Process migration,Real-time computing,Fault tolerance,Process control,Probabilistic logic,Distributed computing,Cloud computing
Conference
ISBN
Citations 
PageRank 
978-1-4244-6987-1
1
0.38
References 
Authors
12
9
Name
Order
Citations
PageRank
James Brandt110.38
Frank Chen210.38
Vincent De Sapio3556.78
Ann C. Gentile4377.91
Jackson Mayo5437.97
Philippe P. Pébay627327.36
Diana Roe71348.01
David C. Thompson830818.14
Matthew Wong910.38