Title
Personal adaptive clusters as containers for scientific jobs
Abstract
We describe a system for creating personal clusters in user-space to support the submission and management of thousands of compute-intensive serial jobs to the network-connected compute resources on the NSF TeraGrid. The system implements a robust infrastructure that submits and manages job proxies across a distributed computing environment. These job proxies contribute resources to personal clusters created dynamically for a user on-demand. The personal clusters then adapt to the prevailing job load conditions at the distributed sites by migrating job proxies to sites expected to provide resources more quickly. Furthermore, the system allows multiple instances of these personal clusters to be created as containers for individual scientific experiments, allowing the submission environment to be customized for each instance. The version of the system described in this paper allows users to build large personal Condor and Sun Grid Engine clusters on the TeraGrid. Users then manage their scientific jobs, within each personal cluster, with a single uniform interface using the feature-rich functionality found in these job management environments.
Year
DOI
Venue
2007
10.1007/s10586-007-0028-5
Cluster Computing
Keywords
Field
DocType
Cooperative systems,Distributed computing,Resource management
Resource management,TeraGrid,Cluster (physics),Job management,Distributed Computing Environment,Computer science,Database,Grid,Distributed computing
Journal
Volume
Issue
ISSN
10
3
1386-7857
Citations 
PageRank 
References 
16
1.24
7
Authors
4
Name
Order
Citations
PageRank
Edward Walker11049.61
Jeffrey P. Gardner21219.54
Vladimir Litvin3161.24
Evan L. Turner4161.24