Title
MBFS: a parallel metadata search method based on Bloomfilters using MapReduce for large-scale file systems
Abstract
The metadata search is an important way to access and manage file systems. Many solutions have been proposed to tackle performance issue of metadata search. However, the existing solutions build a separate metadata index at the internal or external file system through the related data structure or database use semantics and event-notification method to construct the index structure, utilize the sampling-based method to conduct direct metadata search on the namespace, face problems of the high I/O overhead for maintaining consistency between metadata indexes and metadata, have enormous space overhead for metadata indexes storing and low accuracy of results and so on. To address these problems, this paper presents MBFS, a fast, accurate and lightweight metadata search method based on multi-dimensional Bloomfilters. We create a multi-dimensional Bloomfilter structure on the basis of the directory entry that can prune sub-trees to narrow the search scope of namespace. MBFS is capable of producing fast and accurate answers for a class of complex search over a file system after consuming a small number of disk accesses. MBFS residing in the file system does not need additional I/O overhead to maintain consistency. MBFS consists of Bloomfilters which are composed of bits, so it is a lightweight metadata search method that consumes marginal space overhead. Moreover, MBFS employs MapReduce for speeding up search under the environment of multiple metadata servers. Extensive experiments are conducted to prove the effectiveness of MBFS. The experimental results show that MBFS can achieve an excellent performance not only on the search latency, but also on the accuracy of results with low space and time overhead.
Year
DOI
Venue
2016
10.1007/s11227-015-1464-2
The Journal of Supercomputing
Keywords
Field
DocType
Large-scale file systems,Parallel metadata search,Fast and accurate,Lightweight,Multi-dimensional Bloomfilters
Data structure,Metadata,Metadata repository,File system,Directory,Computer science,Parallel computing,Server,Namespace,Semantics,Database,Distributed computing
Journal
Volume
Issue
ISSN
72
8
0920-8542
Citations 
PageRank 
References 
4
0.54
32
Authors
8
Name
Order
Citations
PageRank
zhisheng huo1117.61
Limin Xiao210728.51
qiaoling zhong340.54
shupan li483.71
ang li540.54
Ruan Li6266.75
shouxin wang740.54
lihong fu840.54