Multiagent Planning with Trembling-Hand Perfect Equilibrium in Multiagent POMDPs - Citegraph

Paper Info

Title
Multiagent Planning with Trembling-Hand Perfect Equilibrium in Multiagent POMDPs

Abstract
Multiagent Partially Observable Markov Decision Processes are a popular model of multiagent systems with uncertainty. Since the computational cost for finding an optimal joint policy is prohibitive, a Joint Equilibrium-based Search for Policies with Nash Equilibrium (JESP-NE) is proposed that finds a locally optimal joint policy in which each policy is a best response to other policies; i.e., the joint policy is a Nash equilibrium. One limitation of JESP-NE is that the quality of the obtained joint policy depends on the predefined default policy . More specifically, when finding a best response, if some observation have zero probabilities, JESP-NE uses this default policy. If the default policy is quite bad, JESP-NE tends to converge to a sub-optimal joint policy. In this paper, we propose a method that finds a locally optimal joint policy based on a concept called Trembling-hand Perfect Equilibrium (TPE). In finding a TPE, we assume that an agent might make a mistake in selecting its action with small probability. Thus, an observation with zero probability in JESP-NE will have non-zero probability. We no longer use the default policy. As a result, JESP-TPE can converge to a better joint policy than the JESP-NE, which we confirm this fact by experimental evaluations.

Year	DOI	Venue
2007	10.1007/978-3-642-01639-4_2	Agent Computing and Multi-Agent Systems
Keywords	Field	DocType
nash equilibrium,multiagent systems,trembling-hand perfect equilibrium,predefined default policy,multiagent planning,best response,zero probability,partially observable markov decision process,multiagent pomdps,non-zero probability,joint policy,default policy,sub-optimal joint policy,optimal joint policy,small probability	Mathematical economics,Observable,Mistake,Partially observable Markov decision process,Computer science,Best response,Markov decision process,Multi-agent system,Artificial intelligence,Nash equilibrium,Distributed computing,Trembling hand perfect equilibrium	Conference
Volume	ISSN	Citations
5044	0302-9743	0
PageRank	References	Authors
0.34	8	3

Authors (3 rows)

Cited by (0 rows)

References (8 rows)

Name	Order	Citations	PageRank
Yuichi Yabu	1	28	2.09
Makoto Yokoo	2	3632	421.99
Atsushi Iwasaki	3	292	31.81

1