Title
Gillis: Serving Large Neural Networks in Serverless Functions with Automatic Model Partitioning
Abstract
The increased use of deep neural networks has stimulated the growing demand for cloud-based model serving platforms. Serverless computing offers a simplified solution: users deploy models as serverless functions and let the platform handle provisioning and scaling. However, serverless functions have constrained resources in CPU and memory, making them inefficient or infeasible to serve large neura...
Year
DOI
Venue
2021
10.1109/ICDCS51616.2021.00022
2021 IEEE 41st International Conference on Distributed Computing Systems (ICDCS)
Keywords
DocType
ISSN
Deep learning,Costs,Machine learning algorithms,Computational modeling,Conferences,Neural networks,Inference algorithms
Conference
1063-6927
ISBN
Citations 
PageRank 
978-1-6654-4513-9
3
0.41
References 
Authors
0
6
Name
Order
Citations
PageRank
Minchen Yu130.75
Zhifeng Jiang230.41
Hok Chun Ng330.75
Wei Wang421817.82
Ruichuan Chen520518.95
Baochun Li69416614.20