Matrix games with bandit feedback. | 0 | 0.34 | 2021 |
Sample Efficient Reinforcement Learning With Reinforce | 0 | 0.34 | 2021 |
Making Sense of Reinforcement Learning and Probabilistic Inference | 0 | 0.34 | 2020 |
Hamiltonian descent for composite objectives | 0 | 0.34 | 2019 |
Verification of Non-Linear Specifications for Neural Networks. | 2 | 0.36 | 2019 |
Verification of Non-Linear Specifications for Neural Networks. | 0 | 0.34 | 2019 |
Adversarial Risk and the Dangers of Evaluating Against Weak Attacks. | 18 | 0.74 | 2018 |
Training verified learners with learned verifiers. | 15 | 0.50 | 2018 |
Strength in Numbers: Trading-off Robustness and Computation via Adversarially-Trained Ensembles. | 0 | 0.34 | 2018 |
Variational Bayesian Reinforcement Learning with Regret Bounds. | 1 | 0.35 | 2018 |
The Uncertainty Bellman Equation and Exploration. | 12 | 0.53 | 2018 |
Conic Optimization via Operator Splitting and Homogeneous Self-Dual Embedding. | 57 | 1.98 | 2016 |
Performance Bounds and Suboptimal Policies for Multi–Period Investment | 16 | 0.94 | 2014 |
A Splitting Method for Optimal Control | 48 | 2.35 | 2013 |
Min-max approximate dynamic programming. | 3 | 0.42 | 2011 |