Title
DeltaFS: a scalable no-ground-truth filesystem for massively-parallel computing
Abstract
ABSTRACTHigh-Performance Computing (HPC) is known for its use of massive concurrency. But it can be challenging for a parallel filesystem's control plane to utilize cores when every client process must globally synchronize and serialize its metadata mutations with those of other clients. We present DeltaFS, a new paradigm for distributed filesystem metadata. DeltaFS allows jobs to self-commit their namespace changes to logs, avoiding the cost of global synchronization. Followup jobs selectively merge logs produced by previous jobs as needed, a principle we term No Ground Truth which allows for efficient data sharing. By avoiding unnecessary synchronization of metadata operations, DeltaFS improves metadata operation throughput up to 98X leveraging parallelism on the nodes where job processes run. This speedup grows as job size increases. DeltaFS enables efficient inter-job communication, reducing overall workflow runtime by significantly improving client metadata operation latency up to 49X and resource usage up to 52X.
Year
DOI
Venue
2021
10.1145/3458817.3476148
SC
DocType
Citations 
PageRank 
Conference
0
0.34
References 
Authors
0
7
Name
Order
Citations
PageRank
Qing Zheng1915.40
Charles D. Cranor258252.19
Gregory R. Ganger34560383.16
Garth A. Gibson484961.69
George Amvrosiadis511110.40
Bradley W. Settlemyer600.34
Gary A. Grider700.34