Developing, evaluating and scaling learning agents in multi-agent environments | 0 | 0.34 | 2022 |
From Poincare Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization | 0 | 0.34 | 2021 |
Learning to Play No-Press Diplomacy with Best Response Policy Iteration | 0 | 0.34 | 2020 |
Smooth markets: A basic mechanism for organizing gradient-based learners | 0 | 0.34 | 2020 |
Learning to Resolve Alliance Dilemmas in Many-Player Zero-Sum Games | 1 | 0.34 | 2020 |
Policy Gradient Search: Online Planning and Expert Iteration without Search Trees. | 1 | 0.35 | 2019 |
Thinking Fast and Slow with Deep Learning and Tree Search. | 21 | 0.99 | 2017 |