Title | ||
---|---|---|
Gillis: Serving Large Neural Networks in Serverless Functions with Automatic Model Partitioning |
Abstract | ||
---|---|---|
The increased use of deep neural networks has stimulated the growing demand for cloud-based model serving platforms. Serverless computing offers a simplified solution: users deploy models as serverless functions and let the platform handle provisioning and scaling. However, serverless functions have constrained resources in CPU and memory, making them inefficient or infeasible to serve large neura... |
Year | DOI | Venue |
---|---|---|
2021 | 10.1109/ICDCS51616.2021.00022 | 2021 IEEE 41st International Conference on Distributed Computing Systems (ICDCS) |
Keywords | DocType | ISSN |
Deep learning,Costs,Machine learning algorithms,Computational modeling,Conferences,Neural networks,Inference algorithms | Conference | 1063-6927 |
ISBN | Citations | PageRank |
978-1-6654-4513-9 | 3 | 0.41 |
References | Authors | |
0 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Minchen Yu | 1 | 3 | 0.75 |
Zhifeng Jiang | 2 | 3 | 0.41 |
Hok Chun Ng | 3 | 3 | 0.75 |
Wei Wang | 4 | 218 | 17.82 |
Ruichuan Chen | 5 | 205 | 18.95 |
Baochun Li | 6 | 9416 | 614.20 |