Understanding Acoustic Patterns of Human Teachers Demonstrating Manipulation Tasks to Robots. | 0 | 0.34 | 2022 |
SOPE: Spectrum of Off-Policy Estimators. | 0 | 0.34 | 2021 |
SCAPE - Learning Stiffness Control from Augmented Position Control Experiences. | 0 | 0.34 | 2021 |
Value Alignment Verification | 0 | 0.34 | 2021 |
Universal Off-Policy Evaluation. | 0 | 0.34 | 2021 |
Efficiently Guiding Imitation Learning Agents with Human Gaze. | 0 | 0.34 | 2021 |
Understanding the Relationship between Interactions and Outcomes in Human-in-the-Loop Machine Learning. | 0 | 0.34 | 2021 |
Distributional Depth-Based Estimation of Object Articulation Models. | 0 | 0.34 | 2021 |
Demonstration Of The Empathic Framework For Task Learning From Implicit Human Feedback | 0 | 0.34 | 2021 |
A Review Of Robot Learning For Manipulation: Challenges, Representations, And Algorithms | 0 | 0.34 | 2021 |
ScrewNet: Category-Independent Articulation Model Estimation From Depth Images Using Screw Theory | 0 | 0.34 | 2021 |
Adversarial Intrinsic Motivation for Reinforcement Learning. | 0 | 0.34 | 2021 |
You Only Evaluate Once - a Simple Baseline Algorithm for Offline RL. | 0 | 0.34 | 2021 |
Importance Sampling In Reinforcement Learning With An Estimated Behavior Policy | 0 | 0.34 | 2021 |
Self-Supervised Online Reward Shaping in Sparse-Reward Environments | 0 | 0.34 | 2021 |
Bayesian Robust Optimization for Imitation Learning | 0 | 0.34 | 2020 |
Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences | 0 | 0.34 | 2020 |
PixL2R - Guiding Reinforcement Learning Using Natural Language by Mapping Pixels to Rewards. | 0 | 0.34 | 2020 |
Hypothesis-Driven Skill Discovery for Hierarchical Deep Reinforcement Learning. | 0 | 0.34 | 2020 |
Human Gaze Assisted Artificial Intelligence - A Review. | 0 | 0.34 | 2020 |
Learning Hybrid Object Kinematics for Efficient Hierarchical Planning Under Uncertainty | 0 | 0.34 | 2020 |
The EMPATHIC Framework for Task Learning from Implicit Human Feedback. | 0 | 0.34 | 2020 |
Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrations. | 0 | 0.34 | 2019 |
Learning from Corrective Demonstrations. | 0 | 0.34 | 2019 |
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations. | 1 | 0.35 | 2019 |
Uncertainty-Aware Data Aggregation for Deep Imitation Learning | 2 | 0.36 | 2019 |
Using Natural Language for Reward Shaping in Reinforcement Learning. | 1 | 0.36 | 2019 |
Enhancing Robot Learning with Human Social Cues. | 0 | 0.34 | 2019 |
Understanding Teacher Gaze Patterns for Robot Learning | 0 | 0.34 | 2019 |
Risk-Aware Active Inverse Reinforcement Learning. | 2 | 0.37 | 2019 |
Efficient Probabilistic Performance Bounds for Inverse Reinforcement Learning | 2 | 0.38 | 2018 |
Asking for Help Effectively via Modeling of Human Beliefs. | 0 | 0.34 | 2018 |
Efficient Hierarchical Robot Motion Planning Under Uncertainty and Hybrid Dynamics. | 0 | 0.34 | 2018 |
Learning Multi-Step Robotic Tasks from Observation. | 0 | 0.34 | 2018 |
Towards Online Learning from Corrective Demonstrations. | 0 | 0.34 | 2018 |
Safe Reinforcement Learning via Shielding | 5 | 0.45 | 2018 |
LAAIR: A Layered Architecture for Autonomous Interactive Robots. | 0 | 0.34 | 2018 |
Importance Sampling Policy Evaluation with an Estimated Behavior Policy. | 2 | 0.37 | 2018 |
Human Gaze Following For Human-Robot Interaction | 1 | 0.35 | 2018 |
Machine Teaching for Inverse Reinforcement Learning: Algorithms and Applications | 1 | 0.36 | 2018 |
Bootstrapping with Models: Confidence Intervals for Off-Policy Evaluation. | 3 | 0.47 | 2017 |
Toward Probabilistic Safety Bounds for Robot Learning from Demonstration. | 2 | 0.38 | 2017 |
Data-Efficient Policy Evaluation Through Behavior Policy Search. | 0 | 0.34 | 2017 |
High Confidence Off-Policy Evaluation with Models. | 0 | 0.34 | 2016 |
On the Analysis of Complex Backup Strategies in Monte Carlo Tree Search. | 4 | 0.40 | 2016 |
Policy Evaluation Using the Ω-Return | 1 | 0.39 | 2015 |
Learning grounded finite-state representations from unstructured demonstrations | 30 | 1.06 | 2015 |
Active articulation model estimation through interactive perception | 12 | 0.58 | 2015 |
Online Bayesian changepoint detection for articulated motion models | 5 | 0.47 | 2015 |
Learning pouring skills from demonstration and practice | 0 | 0.34 | 2014 |