Solving Transition-Independent Multi-Agent MDPs with Sparse Interactions. - Citegraph

Paper Info

Title
Solving Transition-Independent Multi-Agent MDPs with Sparse Interactions.

Abstract
In cooperative multi-agent sequential decision making under uncertainty, agents must coordinate to find an optimal joint policy that maximises joint value. Typical algorithms exploit additive structure in the value function, but in the fully-observable multi-agent MDP (MMDP) setting such structure is not present. We propose a new optimal solver for transition-independent MMDPs, in which agents can only affect their own state but their reward depends on joint transitions. We represent these dependencies compactly in conditional return graphs (CRGs). Using CRGs the value of a joint policy and the bounds on partially specified joint policies can be efficiently computed. We propose CoRe, a novel branch-and-bound policy search algorithm building on CRGs. CoRe typically requires less runtime than available alternatives and finds solutions to previously unsolvable problems.

Year	Venue	Field
2016	THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE	Graph,Mathematical optimization,Search algorithm,Computer science,Markov decision process,Exploit,Bellman equation,Artificial intelligence,Solver,Machine learning
DocType	Citations	PageRank
Conference	1	0.34
References	Authors
19	5

Authors (5 rows)

Cited by (1 rows)

References (19 rows)

Name	Order	Citations	PageRank
Joris Scharpff	1	15	2.67
Diederik M. Roijers	2	198	24.72
Frans A. Oliehoek	3	397	40.32
Matthijs T.J. Spaan	4	863	63.84
Mathijs Weerdt	5	411	41.00

1