Learning to Play No-Press Diplomacy with Best Response Policy Iteration | 0 | 0.34 | 2020 |
The Imitation Game: Learned Reciprocity in Markov games | 0 | 0.34 | 2019 |
Should I tear down this wall? Optimizing social metrics by evaluating novel actions | 0 | 0.34 | 2017 |
Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations. | 55 | 2.29 | 2017 |
The Agi Containment Problem | 4 | 0.45 | 2016 |
A generalized-zero-preserving method for compact encoding of concept lattices | 4 | 0.50 | 2010 |