The Pareto Frontier of Instance-Dependent Guarantees in Multi-Player Multi-Armed Bandits with no Communication. | 0 | 0.34 | 2022 |
Cooperative and Stochastic Multi-Player Multi-Armed Bandit - Optimal Regret With Neither Communication Nor Collisions. | 0 | 0.34 | 2021 |
First-Order Regret Analysis of Thompson Sampling. | 0 | 0.34 | 2019 |
Stabilizing a system with an unbounded random gain using only a finite number of bits. | 1 | 0.37 | 2018 |
Approximating Continuous Functions by ReLU Nets of Minimal Width. | 13 | 0.72 | 2017 |