Title
A pumping algorithm for ergodic stochastic mean payoff games with perfect information
Abstract
In this paper, we consider two-person zero-sum stochastic mean payoff games with perfect information, or BWR-games, given by a digraph G=(V=VB∪VW∪VR, E), with local rewards $r: E \to {\mathbb R}$, and three types of vertices: black VB, white VW, and random VR. The game is played by two players, White and Black: When the play is at a white (black) vertex v, White (Black) selects an outgoing arc (v,u). When the play is at a random vertex v, a vertex u is picked with the given probability p(v,u). In all cases, Black pays White the value r(v,u). The play continues forever, and White aims to maximize (Black aims to minimize) the limiting mean (that is, average) payoff. It was recently shown in [7] that BWR-games are polynomially equivalent with the classical Gillette games, which include many well-known subclasses, such as cyclic games, simple stochastic games (SSG′s), stochastic parity games, and Markov decision processes. In this paper, we give a new algorithm for solving BWR-games in the ergodic case, that is when the optimal values do not depend on the initial position. Our algorithm solves a BWR-game by reducing it, using a potential transformation, to a canonical form in which the optimal strategies of both players and the value for every initial position are obvious, since a locally optimal move in it is optimal in the whole game. We show that this algorithm is pseudo-polynomial when the number of random nodes is constant. We also provide an almost matching lower bound on its running time, and show that this bound holds for a wider class of algorithms. Let us add that the general (non-ergodic) case is at least as hard as SSG′s, for which no pseudo-polynomial algorithm is known.
Year
DOI
Venue
2010
10.1007/978-3-642-13036-6_26
IPCO
Keywords
Field
DocType
white vw,optimal value,random vr,pseudo-polynomial algorithm,perfect information,new algorithm,black aim,initial position,optimal move,ergodic stochastic mean payoff,optimal strategy,black vb,lower bound,potential,markov decision process,canonical form
Upper and lower bounds,Markov decision process,Perfect information,Digraph,Discrete mathematics,Combinatorics,Mathematical optimization,Vertex (geometry),Ergodic theory,Algorithm,Canonical form,Mathematics,Stochastic game
Conference
Volume
ISSN
ISBN
6080
0302-9743
3-642-13035-6
Citations 
PageRank 
References 
13
1.27
18
Authors
4
Name
Order
Citations
PageRank
Endre Boros11779155.63
khaled elbassioni247335.78
Vladimir Gurvich368868.89
Kazuhisa Makino41088102.74