Title
A Hybrid Heuristic Value Iteration Algorithm for POMDP
Abstract
Point-based value iteration methods are a class of effective algorithms for solving POMDP model. However, most of these algorithms explore the belief point set by single heuristic criterion, thus limit the effectiveness. A value iteration algorithm (HHVI) based on hybrid heuristic criteria for exploring belief points set is presented in the paper. HHVI maintains the upper and lower bounds on the value function, filters the belief points whose difference between upper and lower bounds on value function is less than the threshold, and explores the farthest belief point away from the explored point set. HHVI can improve the effect and efficiency of convergence by guaranteeing that the explored point set is effectively and fully distributed in the reachable belief space. Experiment results of four benchmarks show that HHVI can obtain better global optimal solution.
Year
DOI
Venue
2016
10.1109/ICTAI.2016.0054
2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI)
Keywords
Field
DocType
POMDP,HHVI,value function standard,density standard
Convergence (routing),Upper and lower bounds,Computer science,Markov decision process,Artificial intelligence,Approximation algorithm,Mathematical optimization,Heuristic,Algorithm design,Partially observable Markov decision process,Algorithm,Bellman equation,Machine learning
Conference
ISSN
ISBN
Citations 
1082-3409
978-1-5090-4460-3
1
PageRank 
References 
Authors
0.40
3
3
Name
Order
Citations
PageRank
Feng Liu123.51
Hua Xia265.24
Xin Jin320.81