Reinforcement learning in many-agent settings under partial observability. | 0 | 0.34 | 2022 |
Cooperative-Competitive Reinforcement Learning with History-Dependent Rewards | 0 | 0.34 | 2021 |
Min-Max Entropy Inverse RL of Multiple Tasks | 0 | 0.34 | 2021 |
I2rl: Online Inverse Reinforcement Learning Under Occlusion | 0 | 0.34 | 2021 |
PALO bounds for reinforcement learning in partially observable stochastic games | 0 | 0.34 | 2021 |
Online Inverse Reinforcement Learning Under Occlusion | 0 | 0.34 | 2019 |
Model-Free IRL using Maximum Likelihood Estimation | 0 | 0.34 | 2019 |
A Framework and Method for Online Inverse Reinforcement Learning. | 0 | 0.34 | 2018 |
Autonomous Acquisition Of Behavior Trees For Robot Control | 4 | 0.43 | 2018 |
Multirobot Systems. | 0 | 0.34 | 2017 |
Multiagent Path Finding With Persistence Conflicts | 0 | 0.34 | 2017 |
Exact and Heuristic Algorithms for Risk-Aware Stochastic Physical Search. | 0 | 0.34 | 2017 |
Reinforcement Learning in Partially Observable Multiagent Settings: Monte Carlo Exploring Policies with PAC Bounds. | 2 | 0.37 | 2016 |
Multi-agent reinforcement learning as a rehearsal for decentralized planning. | 19 | 0.87 | 2016 |
Detection of Plan Deviation in Multi-Agent Systems. | 1 | 0.35 | 2016 |
The complexity of multi-agent plan recognition | 3 | 0.38 | 2015 |
Stackelberg Surveillance. | 0 | 0.34 | 2015 |
Reinforcement Learning of Informed Initial Policies for Decentralized Planning. | 1 | 0.34 | 2014 |
Concurrent reinforcement learning as a rehearsal for decentralized planning under uncertainty | 1 | 0.37 | 2013 |
Pruning for Monte Carlo Distributed Reinforcement Learning in Decentralized POMDPs. | 1 | 0.35 | 2013 |
Informed Initial Policies for Learning in Dec-POMDPs. | 3 | 0.45 | 2012 |
Efficient context free parsing of multi-agent activities for team and plan recognition | 0 | 0.34 | 2012 |
Sample Bounded Distributed Reinforcement Learning for Decentralized POMDPs. | 9 | 0.55 | 2012 |
Efficient context free parsing of multi-agent activities for team and plan recognition | 0 | 0.34 | 2012 |
Strategic best-response learning in multiagent systems. | 1 | 0.40 | 2012 |
Branch and Price for Multi-Agent Plan Recognition. | 0 | 0.34 | 2011 |
Adaptive multi-robot team reconfiguration using a policy-reuse reinforcement learning approach | 2 | 0.38 | 2011 |
Action Discovery for Single and Multi-Agent Reinforcement Learning | 1 | 0.35 | 2011 |
Search Performance of Multi-Agent Plan Recognition in a General Model. | 0 | 0.34 | 2010 |
Fast A* With Iterative Resolution For Navigation | 3 | 0.41 | 2010 |
Evaluation and comparison of multi-agent based crowd simulation systems | 5 | 0.46 | 2010 |
Validation of agent based crowd egress simulation | 3 | 0.44 | 2010 |
Coalition structure generation in multi-agent systems with mixed externalities | 15 | 0.78 | 2010 |
Action discovery for reinforcement learning | 1 | 0.35 | 2010 |
Layered Intelligence for Agent-based Crowd Simulation | 10 | 0.89 | 2009 |
Congestion Avoidance in Multi-Agent-based Egress Simulation | 2 | 0.43 | 2008 |
Advancing the Layered Approach to Agent-Based Crowd Simulation | 18 | 2.11 | 2008 |
General game learning using knowledge transfer | 45 | 2.95 | 2007 |
Generalized multiagent learning with performance bound | 11 | 0.56 | 2007 |
Reactivity and Safe Learning in Multi-Agent Systems | 0 | 0.34 | 2006 |
RVσ(t): a unifying approach to performance and convergence in online multiagent learning | 3 | 0.47 | 2006 |
Efficient no-regret multiagent learning | 11 | 0.91 | 2005 |
On the performance of on-line concurrent reinforcement learners | 0 | 0.34 | 2005 |
Efficient learning of multi-step best response | 14 | 1.10 | 2005 |
Unifying convergence and no-regret in multiagent learning | 0 | 0.34 | 2005 |
Performance bounded reinforcement learning in strategic interactions | 21 | 1.16 | 2004 |
The Role of Reactivity in Multiagent Learning | 3 | 0.39 | 2004 |
On-policy concurrent reinforcement learning | 3 | 0.39 | 2004 |
Adaptive policy gradient in multiagent learning | 14 | 0.95 | 2003 |
Kernel Index for Relevance feedback Retrieval | 0 | 0.34 | 2002 |