Title
Performance-Aware Data Placement in Hybrid Parallel File Systems.
Abstract
Hybrid parallel file systems (PFS), which consist of both HDD and SSD servers, provide a promising solution for data-intensive applications. In this study, we propose a performance-aware data placement (PADP) strategy to enable efficient data layout in hybrid PFSs. The basic idea of PADP is to dispatch data on different file servers with adaptive varied-size file stripes based on the server storage performance. By using an effective data access cost model and a linear programming optimization method, the appropriate stripe sizes for each file server are determined effectively. We have implemented PADP within OrangeFS, a widely used parallel file system in HPC domain. Experimental results of representative benchmark show that PADP can significantly improve the I/O performance of hybrid PFSs.
Year
Venue
Keywords
2014
ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2014, PT I
Parallel I/O System,Parallel File system,Solid State Drive
Field
DocType
Volume
File system,File server,Data layout,Computer science,Parallel computing,Server,Linear programming,Solid-state drive,Data access,Distributed computing
Conference
8630
ISSN
Citations 
PageRank 
0302-9743
9
0.48
References 
Authors
13
4
Name
Order
Citations
PageRank
Shuibing He110920.45
Xian-he Sun21987182.64
Bo Feng3512.96
Kun Feng4344.32