Title
One Phase Commit: A Low Overhead Atomic Commitment Protocol for Scalable Metadata Services
Abstract
As the number of client machines in high end computing clusters increases, the file system cannot keep up with the resulting volume of requests, using a centralized metadata server. This problem will be even more prominent with the advent of the exascale computing age. In this context, the centralized metadata server represents a bottleneck for the scaling of the file system performance as well as a single point of failure. To overcome this problem, file systems are evolving from centralized metadata services to distributed metadata services. The metadata distribution raises a number of additional problems that must be taken into account. In this paper we will focus on the problem of managing distributed namespace operations such as CREATE, DELETE and RENAME. Distributed namespace operations are a side effect of metadata distribution across the cluster of metadata servers. Available protocols for handling distributed namespace operations such as the two phase commitment protocol are expensive since they require the exchange of a large number of messages between metadata servers as well as synchronous writes to stable storage to log vital information. Moreover, such protocols adopt locking schemes to protect the resource during the operation, which force multiple operations on the same directory to be serialized. This severely impacts the performance of high performance computing applications in typical scenarios such as high rate of file create operations. We propose a one phase commit protocol that is tailored to the use for typical inter-metadata messages. We rely on a fast, highly available shared storage for metadata in order to minimize writes, messages, coordination overhead and recovery time in case of failing metadata servers. We present a formal description of the new protocol, a theoretical analysis of its capabilities, a proof of correctness and the evaluation of the protocol in a simulated environment that renders the protocol to be fast and reliable. In simulations the protocol achieved more than 50% better performance compared with the two phase commitment protocol.
Year
DOI
Venue
2012
10.1109/ClusterW.2012.16
Cluster Computing Workshops
Keywords
Field
DocType
available protocol,metadata distribution,new protocol,centralized metadata service,phase commitment protocol,phase commit,atomic commitment protocol,low overhead,metadata service,metadata server,better performance,centralized metadata server,namespace operation,scalable metadata services,meta data
Metadata,Metadata repository,File system,Meta Data Services,Computer science,Server,Parallel computing,Namespace,Storage Resource Broker,Scalability,Distributed computing
Conference
ISBN
Citations 
PageRank 
978-1-4673-2893-7
0
0.34
References 
Authors
7
4
Name
Order
Citations
PageRank
Giuseppe Congiu100.34
Matthias Grawinkel2324.43
Sai Narasimhamurthy3142.43
Andre Brinkmann420227.48