Title | ||
---|---|---|
Cop-Flash: Utilizing hybrid storage to construct a large, efficient, and durable computational storage for DNN training |
Abstract | ||
---|---|---|
Traditional computing architectures that separate computing from storage face severe limitations when processing the data that is continuously produced in the cloud and at the edge. Recently, the computational storage device (CSD) is becoming one of the critical cloud infrastructures which can overcome these limitations. Many studies utilize CSD for DNN training to extract useful information and knowledge from the data quickly and efficiently. However, all previous work has used homogeneous storage, which is not fully considered the requirements of DNN training on CSD. Thus, we exploit the leverage of hybrid NAND flash memory to optimize this problem. Nevertheless, typical hybrid storage architectures have limitations when used for DNN training. Moreover, their management strategies can not fully exploit the heterogeneity of hybrid flash memory. To address this issue, we propose a novel SLC-TLC flash memory called Co-Partitioning Flash (Cop-Flash), which utilizes two different hybrid flash memory partitioning methods to divide storage into three different properties of flash memory. Meanwhile, two key technologies are included in Cop-Flash: 1) lifetime-based I/O identifier is proposed to identify data hotness according to data lifetime to maximize the benefits of heterogeneity and minimize the impact of garbage collection. 2) Erase-aware Adaptive Dual-zone Management is proposed to increase bandwidth utilization and guarantee system reliability. We compared Cop-Flash with two related state-of-the-art hybrid storage using hard partitioning and soft partitioning as well as TLC-only flash memory under real DNN training workloads. Experimental results show that Cop-Flash improves the performance by 29.1%, 38.8%, 56.6% and outperforms them by 2.3x, 1.29x, and 8.3x in terms of lifespan. |
Year | DOI | Venue |
---|---|---|
2022 | 10.1109/CLOUD55607.2022.00041 | 2022 IEEE 15th International Conference on Cloud Computing (CLOUD) |
Keywords | DocType | ISSN |
In-storage computing,DNN training,Hybrid Storage,SLC-TLC | Conference | 2159-6182 |
ISBN | Citations | PageRank |
978-1-6654-8138-0 | 0 | 0.34 |
References | Authors | |
17 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Chunhua Xiao | 1 | 0 | 8.45 |
Shi Qiu | 2 | 0 | 1.01 |
Dandan Xu | 3 | 0 | 1.35 |