Title
Scalable RDMA RPC on Reliable Connection with Efficient Resource Sharing
Abstract
RDMA provides extremely low latency and high bandwidth to distributed systems. Unfortunately, it fails to scale and suffers from performance degradation when transferring data to an increasing number of targets on Reliable Connection (RC). We observe that the above scalability issue has its root in the resource contention in the NIC cache, the CPU cache and the memory of each server. In this paper, we propose ScaleRPC, an efficient RPC primitive using one-sided RDMA verbs on reliable connection to provide scalable performance. To effectively alleviate the resource contention, ScaleRPC introduces 1) connection grouping to organize the network connections into groups, so as to balance the saturation and thrashing of the NIC cache; 2) virtualized mapping to enable a single message pool to be shared by different groups of connections, which reduces CPU cache misses and improve memory utilization. Such scalable connection management provides substantial performance benefits: By deploying ScaleRPC both in a distributed file system and a distributed transactional system, we observe that it achieves high scalability and respectively improves performance by up to 90% and 160% for metadata accessing and SmallBank transaction processing.
Year
DOI
Venue
2019
10.1145/3302424.3303968
Proceedings of the Fourteenth EuroSys Conference 2019
Keywords
Field
DocType
RDMA, Resource Sharing, Scalability
Distributed File System,Transaction processing,CPU cache,Computer science,Cache,Thrashing,Remote direct memory access,Shared resource,Distributed computing,Scalability
Conference
ISBN
Citations 
PageRank 
978-1-4503-6281-8
4
0.41
References 
Authors
0
3
Name
Order
Citations
PageRank
Youmin Chen1284.24
Youyou Lu235630.81
Jiwu Shu370972.71