aHDFS: An Erasure-Coded Data Archival System for Hadoop Clusters. - Citegraph

Paper Info

Title
aHDFS: An Erasure-Coded Data Archival System for Hadoop Clusters.

Abstract
In this paper, we propose an erasure-coded data archival system called aHDFS for Hadoop clusters, where RS(k + r; k) codes are employed to archive data replicas in the Hadoop distributed file system or HDFS. We develop two archival strategies (i.e., aHDFS-Grouping and aHDFS-Pipeline) in aHDFSto speed up the data archival process. aHDFS-Groupinga MapReduce-based data archiving scheme - keeps each m...

Year	DOI	Venue
2017	10.1109/TPDS.2017.2706686	IEEE Transactions on Parallel and Distributed Systems
Keywords	Field	DocType
Mathematical model,Distributed databases,Redundancy,Encoding,Programming,Pipelines,Data models	Block size,Distributed File System,Data modeling,Computer science,Parallel computing,Real-time computing,Shuffling,Distributed database,Erasure code,Encoding (memory),Distributed computing,Speedup	Journal
Volume	Issue	ISSN
28	11	1045-9219
Citations	PageRank	References
3	0.50	21
Authors
5

Authors (5 rows)

Cited by (3 rows)

References (21 rows)

Name	Order	Citations	PageRank
Yuanqi Chen	1	6	3.92
Yi Zhou	2	230	32.97
Shubbhi Taneja	3	5	1.87
Xiao Qin	4	1836	125.69
Jianzhong Huang	5	87	19.32

1