Title
Scalable Cluster Administration " Chiba City I Approach and Lessons Learned
Abstract
Systems administrators of large clusters often need to perform the same administrative task hundreds or thousands of times. Administrators have traditionally performed some time-consuming tasks, such as operating system installation, configuration, and maintenance, manually. By combining network services such as DHCP, TFTP, FTP, HTTP, and NFS with remote hardware control and scripted installation, configuration, and maintenance techniques, cluster administrators can automate these administrative tasks. Scalable cluster administration addresses this challenge: What hardware and software design techniques can cluster builders use to automate cluster administration on very large clusters? We describe the approach used in the Mathematics and Computer Science Division of Argonne National Laboratory on Chiba City I, a 314-node Linux cluster; and we analyze the scalability, flexibility, performance and reliability benefits and limitations from that approach.
Year
DOI
Venue
2002
10.1109/CLUSTR.2002.1137749
CLUSTER
Keywords
Field
DocType
administrative task,314-node linux cluster,cluster administration,remote hardware control,administrative task hundred,large cluster,chiba city i approach,scalable cluster administration,cluster builder,lessons learned,maintenance technique,cluster administrator,computer science,mathematics,operating systems,system design,software design,hardware,linux cluster,memory management,linux,automatic control
File Transfer Protocol,Trivial File Transfer Protocol,Software design,Computer science,Parallel computing,Dynamic Host Configuration Protocol,Automation,Memory management,Operating system,Computer cluster,Scalability
Conference
ISBN
Citations 
PageRank 
0-7695-1745-5
7
1.40
References 
Authors
8
4
Name
Order
Citations
PageRank
John-paul Navarro111518.37
Rémy Evard22915.11
Dan Nurmi31015.66
Narayan Desai431929.73