Abstract | ||
---|---|---|
Multiagent Partially Observable Markov Decision Processes are a popular model of multiagent systems with uncertainty. Since the computational cost for finding an optimal joint policy is prohibitive, a Joint Equilibrium-based Search for Policies with Nash Equilibrium (JESP-NE) is proposed that finds a locally optimal joint policy in which each policy is a best response to other policies; i.e., the joint policy is a Nash equilibrium. One limitation of JESP-NE is that the quality of the obtained joint policy depends on the predefined default policy . More specifically, when finding a best response, if some observation have zero probabilities, JESP-NE uses this default policy. If the default policy is quite bad, JESP-NE tends to converge to a sub-optimal joint policy. In this paper, we propose a method that finds a locally optimal joint policy based on a concept called Trembling-hand Perfect Equilibrium (TPE). In finding a TPE, we assume that an agent might make a mistake in selecting its action with small probability. Thus, an observation with zero probability in JESP-NE will have non-zero probability. We no longer use the default policy. As a result, JESP-TPE can converge to a better joint policy than the JESP-NE, which we confirm this fact by experimental evaluations. |
Year | DOI | Venue |
---|---|---|
2007 | 10.1007/978-3-642-01639-4_2 | Agent Computing and Multi-Agent Systems |
Keywords | Field | DocType |
nash equilibrium,multiagent systems,trembling-hand perfect equilibrium,predefined default policy,multiagent planning,best response,zero probability,partially observable markov decision process,multiagent pomdps,non-zero probability,joint policy,default policy,sub-optimal joint policy,optimal joint policy,small probability | Mathematical economics,Observable,Mistake,Partially observable Markov decision process,Computer science,Best response,Markov decision process,Multi-agent system,Artificial intelligence,Nash equilibrium,Distributed computing,Trembling hand perfect equilibrium | Conference |
Volume | ISSN | Citations |
5044 | 0302-9743 | 0 |
PageRank | References | Authors |
0.34 | 8 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yuichi Yabu | 1 | 28 | 2.09 |
Makoto Yokoo | 2 | 3632 | 421.99 |
Atsushi Iwasaki | 3 | 292 | 31.81 |