Evaluating Model-Based Planning and Planner Amortization for Continuous Control | 0 | 0.34 | 2022 |
From motor control to team play in simulated humanoid football. | 0 | 0.34 | 2022 |
Deep neuroethology of a virtual rodent | 0 | 0.34 | 2020 |
Catch & Carry: reusable neural controllers for vision-guided whole-body tasks | 1 | 0.35 | 2020 |
Learning Gentle Object Manipulation with Curiosity-Driven Deep Reinforcement Learning. | 1 | 0.35 | 2019 |
Learning Awareness Models. | 0 | 0.34 | 2018 |
Relative Entropy Regularized Policy Iteration. | 2 | 0.36 | 2018 |
Safe Exploration in Continuous Action Spaces. | 4 | 0.42 | 2018 |
Maximum a Posteriori Policy Optimisation. | 16 | 0.57 | 2018 |
DeepMind Control Suite. | 0 | 0.34 | 2018 |
Learning human behaviors from motion capture by adversarial imitation. | 17 | 0.66 | 2017 |
Emergence of Locomotion Behaviours in Rich Environments. | 52 | 1.67 | 2017 |
Data-efficient Deep Reinforcement Learning for Dexterous Manipulation. | 15 | 0.75 | 2017 |
Learning and Transfer of Modulated Locomotor Controllers. | 21 | 0.94 | 2016 |
Attend, Infer, Repeat: Fast Scene Understanding with Generative Models | 0 | 0.34 | 2016 |
Continuous control with deep reinforcement learning | 0 | 0.34 | 2016 |
Learning Continuous Control Policies by Stochastic Value Gradients. | 0 | 0.34 | 2015 |
Continuous control with deep reinforcement learning | 418 | 14.63 | 2015 |
Real-time behaviour synthesis for dynamic hand-manipulation | 6 | 0.49 | 2014 |
Physically-consistent sensor fusion in contact-rich behaviors | 1 | 0.41 | 2014 |
Control-limited differential dynamic programming | 61 | 2.10 | 2014 |
Value Function Approximation And Model Predictive Control | 6 | 0.70 | 2013 |
STAC: Simultaneous tracking and calibration | 0 | 0.34 | 2013 |
Synthesis and stabilization of complex behaviors through online trajectory optimization. | 116 | 6.87 | 2012 |
MuJoCo: A physics engine for model-based control | 298 | 13.13 | 2012 |
High-order local dynamic programming. | 0 | 0.34 | 2011 |
Iterative local dynamic programming. | 5 | 0.75 | 2009 |
Least Squares Solutions of the HJB Equation With Neural Network Value-Function Approximators | 17 | 1.42 | 2007 |
Receding Horizon Differential Dynamic Programming | 40 | 2.40 | 2007 |