Title
Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale
Abstract
Tremendous success of machine learning (ML) and the unabated growth in model complexity motivated many ML-specific designs in hardware architectures to speed up the model inference. While these architectures are diverse, highly optimized low-precision arithmetic is a component shared by most. Nevertheless, recommender systems important to Facebook’s personalization services are demanding and compl...
Year
DOI
Venue
2021
10.1109/MM.2021.3081981
IEEE Micro
Keywords
DocType
Volume
Quantization (signal),Computational modeling,Production,Computer architecture,Adaptation models,Predictive models,Training
Journal
41
Issue
ISSN
Citations 
5
0272-1732
0
PageRank 
References 
Authors
0.34
0
20
Name
Order
Citations
PageRank
Zhaoxia100.34
Deng200.34
Jongsoo Park31039.49
Ping Tak Peter Tang422912.50
Haixin Liu501.35
Jie600.34
Yang7418.06
Hector Yuen861.10
Jianyu Huang992.83
Daya Khudia1000.34
Xiaohan Wei1101.69
Ellie Wen1200.34
Dhruv Choudhary13616.60
Raghuraman Krishnamoorthi14261.85
Carole-Jean Wu1501.35
Nadathur Satish16202099.88
Changkyu Kim17184898.89
Maxim Naumov1800.34
Sam Naghshineh1900.34
Mikhail Smelyanskiy2041.06