Title
File Systems Fated for Senescence? Nonsense, Says Science!
Abstract
File systems must allocate space for files without knowing what will be added or removed in the future. Over the life of a file system, this may cause suboptimal file placement decisions which eventually lead to slower performance, or aging. Traditional file systems employ heuristics, such as collocating related files and data blocks, to avoid aging, and many file system implementors treat aging as a solved problem. However, this paper describes realistic as well as synthetic workloads that can cause these heuristics to fail, inducing large performance declines due to aging. For example, on ext4 and ZFS, a few hundred git pull operations can reduce read performance by a factor of 2; performing a thousand pulls can reduce performance by up to a factor of 30. We further present microbenchmarks demonstrating that common placement strategies are extremely sensitive to file-creation order; varying the creation order of a few thousand small files in a real-world directory structure can slow down reads by 15-175×, depending on the file system. We argue that these slowdowns are caused by poor layout. We demonstrate a correlation between read performance of a directory scan and the locality within a file system's access patterns, using a dynamic layout score. In short, many file systems are exquisitely prone to read aging for a variety of write workloads. We show, however, that aging is not inevitable. BetrFS, a file system based on write-optimized dictionaries, exhibits almost no aging in our experiments. BetrFS typically outperforms the other file systems in our benchmarks; aged BetrFS even outperforms the unaged versions of these file systems, excepting Btrfs. We present a framework for understanding and predicting aging, and identify the key features of BetrFS that avoid aging.
Year
Venue
Field
2017
FAST
Locality,File system,Computer science,Directory,ext4,Real-time computing,Journaling file system,Heuristics,File system fragmentation,Operating system,Directory structure
DocType
Citations 
PageRank 
Conference
8
0.47
References 
Authors
13
11
Name
Order
Citations
PageRank
Alexander Conway1112.22
Ainesh Bakshi2102.88
Yizheng Jiao3624.94
William Jannen4647.48
Yang Zhan5656.65
Jun Yuan6526.08
Michael A. Bender72144138.24
Rob Johnson856239.43
Bradley C. Kuszmaul91563146.28
Donald E. Porter10242.11
Martin Farach-Colton112402178.67