Crossmodal attentive skill learner: learning in Atari and beyond with audio-video inputs. | 0 | 0.34 | 2020 |
Crossmodal Attentive Skill Learner | 0 | 0.34 | 2018 |
Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability. | 27 | 0.83 | 2017 |
Efficient PAC-Optimal Exploration in Concurrent, Continuous State MDPs with Delayed Updates. | 6 | 0.51 | 2016 |
Improving PAC Exploration Using the Median Of Means. | 0 | 0.34 | 2016 |
PAC Optimal Exploration in Continuous Space Markov Decision Processes. | 19 | 0.83 | 2013 |
Sample Complexity and Performance Bounds for Non-Parametric Approximate Linear Programming. | 1 | 0.36 | 2013 |
Generalized Value Functions for Large Action Sets. | 11 | 0.66 | 2011 |
Reinforcement learning in multidimensional continuous action spaces | 13 | 0.77 | 2011 |
Non-Parametric Approximate Linear Programming for MDPs. | 10 | 0.69 | 2011 |
Learning continuous-action control policies | 4 | 0.54 | 2009 |
Binary action search for learning continuous-action control policies | 13 | 0.77 | 2009 |