Title
A Migratory Heterogeneity-Aware Data Layout Scheme For Parallel File Systems
Abstract
Parallel file systems (PFSs) are widely deployed to speed up the performance of high-performance computing (HPC) applications. In recent years, hybrid PFSs that consist of HDD-SSD servers, have attracted much attention in HPC community. However, existing data layout schemes do not well consider the characteristics of heterogeneous servers and heterogeneous access patterns, thus may experience considerable inefficiencies. In this study, we propose MHA, a migratory heterogeneity-aware data layout scheme to improve the data distribution of hybrid PFS. More specifically, to accommodate heterogeneous access patterns, MHA first migrates file data into several regions, each with similar access patterns. Then, by leveraging a data access cost model, MHA determines the appropriate stripe sizes on heterogeneous servers to get the best performance on each region. We have implemented MHA under MPI-IO library on top of OrangeFS file system. Experimental results show that MHA can significantly improve the hybrid PFS I/O system performance compared to existing data layout schemes.
Year
DOI
Venue
2018
10.1109/IPDPS.2018.00122
2018 32ND IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS)
Keywords
Field
DocType
Parallel I/O System, Parallel File System, Solid State Drive, Data Layout
Data modeling,File system,Data layout,Computer science,Server,Solid-state drive,Data access,Speedup,Distributed computing
Conference
ISSN
Citations 
PageRank 
1530-2075
0
0.34
References 
Authors
0
4
Name
Order
Citations
PageRank
Shuibing He110920.45
Xian-he Sun21987182.64
Yang Wang3839.61
Z. Chen43443271.62