A Convergence Theory for SVGD in the Population Limit under Talagrand's Inequality T1. | 0 | 0.34 | 2022 |
Scaling Distributed Machine Learning With In-Network Aggregation | 0 | 0.34 | 2021 |
FL_PyTorch: optimization research simulator for federated learning | 0 | 0.34 | 2021 |
Hyperparameter Transfer Learning With Adaptive Complexity | 0 | 0.34 | 2021 |
Page: A Simple And Optimal Probabilistic Gradient Estimator For Nonconvex Optimization | 0 | 0.34 | 2021 |
Linearly Converging Error Compensated SGD | 0 | 0.34 | 2020 |
A Stochastic Derivative Free Optimization Method with Momentum | 1 | 0.35 | 2020 |
SGD with Arbitrary Sampling: General Analysis and Improved Rates | 0 | 0.34 | 2019 |
Stochastic Spectral and Conjugate Descent Methods. | 0 | 0.34 | 2018 |
Coordinate Descent Faceoff: Primal or Dual? | 0 | 0.34 | 2018 |
Randomized Block Cubic Newton Method. | 1 | 0.35 | 2018 |
Stochastic Dual Coordinate Ascent with Adaptive Probabilities. | 24 | 0.86 | 2015 |
Adding vs. Averaging in Distributed Primal-Dual Optimization. | 26 | 1.06 | 2015 |