Direction Matters: On the Implicit Bias of Stochastic Gradient Descent with Moderate Learning Rate. | 0 | 0.34 | 2021 |
How Much Over-parameterization Is Sufficient to Learn Deep ReLU Networks? | 0 | 0.34 | 2021 |
Neural Thompson Sampling. | 0 | 0.34 | 2021 |
Uniform-PAC Bounds for Reinforcement Learning with Linear Function Approximation. | 0 | 0.34 | 2021 |
On the Global Convergence of Training Deep Linear ResNets. | 0 | 0.34 | 2020 |
Stochastic Nested Variance Reduction for Nonconvex Optimization. | 0 | 0.34 | 2020 |
Sample Efficient Policy Gradient Methods with Recursive Variance Reduction. | 0 | 0.34 | 2020 |
Improving Neural Language Generation with Spectrum Control. | 0 | 0.34 | 2020 |
Improving Adversarial Robustness Requires Revisiting Misclassified Examples. | 0 | 0.34 | 2020 |