Title | ||
---|---|---|
Sensitivity-based nested partitions for solving finite-horizon Markov decision processes. |
Abstract | ||
---|---|---|
In this paper, we propose a heuristic for solving finite-horizon Markov decision processes. The heuristic uses the nested partitions (NP) framework to guide an iterative search for the optimal policy. NP focuses the search on certain promising subregions, flexibly determined by the sampling weight of each action branch. Within each subregion, an effective local policy optimization is developed using sensitivity-based approach, which optimizes the sampling weights based on estimated gradient information. Numerical results show the effectiveness of the proposed heuristic. |
Year | DOI | Venue |
---|---|---|
2017 | 10.1016/j.orl.2017.07.006 | Operations Research Letters |
Keywords | Field | DocType |
Approximate dynamic programming,Markov decision processes,Nested partitions,Sensitivity-based approach | Mathematical optimization,Heuristic,Combinatorics,Iterative search,Markov decision process,Sampling (statistics),Finite horizon,Mathematics | Journal |
Volume | Issue | ISSN |
45 | 5 | 0167-6377 |
Citations | PageRank | References |
0 | 0.34 | 7 |
Authors | ||
1 |
Name | Order | Citations | PageRank |
---|---|---|---|
Weiwei Chen | 1 | 125 | 12.21 |