Title
RecNMP: Accelerating Personalized Recommendation with Near-Memory Processing
Abstract
Personalized recommendation systems leverage deep learning models and account for the majority of data center AI cycles. Their performance is dominated by memory-bound sparse embedding operations with unique irregular memory access patterns that pose a fundamental challenge to accelerate. This paper proposes a lightweight, commodity DRAM compliant, near-memory processing solution to accelerate personalized recommendation inference. The in-depth characterization of production-grade recommendation models shows that embedding operations with high model-, operator- and data-level parallelism lead to memory bandwidth saturation, limiting recommendation inference performance. We propose RecNMP which provides a scalable solution to improve system throughput, supporting a broad range of sparse embedding models. RecNMP is specifically tailored to production environments with heavy co-location of operators on a single server. Several hardware/software co-optimization techniques such as memory-side caching, table-aware packet scheduling, and hot entry profiling are studied, providing up to $9.8 \times$ memory latency speedup over a highly-optimized baseline. Overall, RecNMP offers $4.2 \times$ throughput improvement and 45.8% memory energy savings.
Year
DOI
Venue
2020
10.1109/ISCA45697.2020.00070
2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture (ISCA)
DocType
ISSN
ISBN
Conference
0884-7495
978-1-7281-4661-4
Citations 
PageRank 
References 
19
0.83
60
Authors
21
Name
Order
Citations
PageRank
Liu Ke1293.20
Udit Gupta2746.27
Benjamin Youngjae Cho3190.83
David Brooks45518422.08
Vikas Chandra569159.76
Utku Diril6190.83
Amin Firoozshahian7190.83
Kim M. Hazelwood82465110.46
Bill Jia91265.90
Hsien-Hsin Sean Lee101657102.66
Meng Li11191.84
Bert Maher12190.83
Dheevatsa Mudigere1328919.84
Maxim Naumov146810.29
Martin Schatz15190.83
Mikhail Smelyanskiy16116065.96
Xiaodong Wang171266.24
Brandon Reagen1821013.90
Carole-Jean Wu1943223.81
Mark Hempstead2098081.39
xuan zhang219325.30