Efficient solution algorithms for factored MDPs - Citegraph

Paper Info

Title
Efficient solution algorithms for factored MDPs

Abstract
This paper addresses the problem of planning under uncertainty in large Markov Decision Processes (MDPs). Factored MDPs represent a complex state space using state variables and the transition model using a dynamic Bayesian network. This representation often allows an exponential reduction in the representation size of structured MDPs, but the complexity of exact solution algorithms for such MDPs can grow exponentially in the representation size. In this paper, we present two approximate solution algorithms that exploit structure in factored MDPs. Both use an approximate value function represented as a linear combination of basis functions, where each basis function involves only a small subset of the domain variables. A key contribution of this paper is that it shows how the basic operations of both algorithms can be performed efficiently in closed form, by exploiting both additive and context-specific structure in a factored MDP. A central element of our algorithms is a novel linear program decomposition technique, analogous to variable elimination in Bayesian networks, which reduces an exponentially large LP to a provably equivalent, polynomial-sized one. One algorithm uses approximate linear programming, and the second approximate dynamic programming. Our dynamic programming algorithm is novel in that it uses an approximation based on max-norm, a technique that more directly minimizes the terms that appear in error bounds for approximate MDP algorithms. We provide experimental results on problems with over 1040 states, demonstrating a promising indication of the scalability of our approach, and compare our algorithm to an existing state-of-the-art approach, showing, in some problems, exponential gains in computation time.

Year	DOI	Venue
2011	10.1613/jair.1000	Journal of Artificial Intelligence Research
Keywords	DocType	Volume
factored mdps,efficient solution algorithm,approximate mdp algorithm,approximate solution algorithm,approximate linear programming,structured mdps,basis function,approximate value function,representation size,dynamic bayesian network,approximate dynamic programming,exact solution,markov decision process,linear program,artificial intelligent,state space,variable elimination,value function,bayesian network,dynamic programming algorithm	Journal	abs/1106.1822
Issue	ISSN	Citations
1	Journal Of Artificial Intelligence Research, Volume 19, pages 399-468, 2003	151
PageRank	References	Authors
7.21	26	4

Search Limit

100151

Authors (4 rows)

Cited by (100 rows)

References (26 rows)

Name	Order	Citations	PageRank
Carlos Guestrin	1	9220	488.92
Daphne Koller	2	14396	1561.19
Ronald Parr	3	2428	186.85
Shobha Venkataraman	4	1027	51.93

1