Self-Organized Polynomial-Time Coordination Graphs. | 0 | 0.34 | 2022 |
Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization. | 0 | 0.34 | 2021 |
QPLEX: Duplex Dueling Multi-Agent Q-Learning | 0 | 0.34 | 2021 |
Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration. | 0 | 0.34 | 2021 |
MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration | 0 | 0.34 | 2021 |
Learning Subgoal Representations with Slow Dynamics | 0 | 0.34 | 2021 |
Offline Reinforcement Learning with Reverse Model-based Imagination. | 0 | 0.34 | 2021 |
Influence-Based Multi-Agent Exploration | 0 | 0.34 | 2020 |
Object-Oriented Dynamics Learning Through Multi-Level Abstraction | 0 | 0.34 | 2020 |
Learning Nearly Decomposable Value Functions Via Communication Minimization | 0 | 0.34 | 2020 |