Title
Resilient error-bounded lossy compressor for data transfer
Abstract
ABSTRACTToday's exa-scale scientific applications or advanced instruments are producing vast volumes of data, which need to be shared/transferred through the network/devices with relatively low bandwidth (e.g., data sharing on WAN or transferring from edge devices to supercomputers). Lossy compression is one of the candidate strategies to address the big data issue. However, little work was done to make it resilient against silent errors, which may happen during the stage of compression or data transferring. In this paper, we propose a resilient error-bounded lossy compressor based on the SZ compression framework. Specifically, we design a new independent-block-wise model that decomposes the entire dataset into many independent sub-blocks to compress. Then, we design and implement a series of error detection/correction strategies elaboratively for each stage of SZ. Our method is arguably the first algorithm-based fault tolerance (ABFT) solution for lossy compression. Our proposed solution incurs negligible execution overhead in the fault-free situation. Upon soft errors happening, it ensures decompressed data strictly bounded within user's requirement with a very limited degradation of compression ratio and low overhead.
Year
DOI
Venue
2021
10.1145/3458817.3476195
The International Conference for High Performance Computing, Networking, Storage, and Analysis
Keywords
DocType
ISSN
Lossy compression,Algorithm Based Fault Tolerance,data transfer
Conference
2167-4329
ISBN
Citations 
PageRank 
978-1-6654-8390-2
2
0.36
References 
Authors
27
6
Name
Order
Citations
PageRank
Sihuan Li1656.01
Sheng Di273755.88
Kai Zhao3194.32
Xin Liang410712.74
Zizhong Chen592469.93
Franck Cappello63775251.47