Title
When Cloud Storage Meets Rdma
Abstract
A production-level cloud storage system must be high performing and readily available. It should also meet a Service-Level Agreement (SLA). The rapid advancement in storage media has left networking lagging behind, resulting in a major performance bottleneck for new cloud storage generations. Remote Direct Memory Access (RDMA) running on lossless fabrics can potentially overcome this bottleneck. In this paper, we present our experience in introducing RDMA into the storage networks of Pangu, a cloud storage system developed by Alibaba. Since its introduction in 2009, it has proven to be crucial for Alibaba's core businesses. In addition to the performance, availability, and SLA requirements, the deployment planning of Pangu at the production scale should consider storage volume and hardware costs. We present an RDMA-enabled Pangu system that exhibits superior performance, with the availability and SLA standards matching those of traditional TCP-backed versions. RDMA-enabled Pangu has been demonstrated to successfully serve numerous online mission-critical services across four years, including several important shopping festivals.
Year
Venue
DocType
2021
PROCEEDINGS OF THE 18TH USENIX SYMPOSIUM ON NETWORKED SYSTEM DESIGN AND IMPLEMENTATION
Conference
Citations 
PageRank 
References 
0
0.34
0
Authors
24
Name
Order
Citations
PageRank
Yixiao Gao131.41
Qiang Li259954.40
Lingbo Tang3261.85
Yongqing Xi450.80
Pengcheng Zhang550.80
Wenwen Peng600.34
Bo Li7121.22
Yaohui Wu800.34
Shaozong Liu900.34
Lei Yan1000.34
Fei Feng11261.85
Yan Zhuang12231.07
Fan Liu1300.34
Pan Liu1400.34
Xingkui Liu1500.34
Zhongjie Wu1641.07
Junping Wu1700.34
Zheng Cao18231.07
Chen Tian19378.36
Jinbo Wu2032.76
Jiaji Zhu2100.68
Haiyong Wang2200.34
Dennis Cai2300.34
Jiesheng Wu2411.03