Title
Policy iteration approximate dynamic programming using Volterra series based actor
Abstract
There is an extensive literature on value function approximation for approximate dynamic programming (ADP). Multilayer perceptrons (MLPs) and radial basis functions (RBFs), among others, are typical approximators for value functions in ADP. Similar approaches have been taken for policy approximation. In this paper, we propose a new Volterra series based structure for actor approximation in ADP. The Volterra approx-imator is linear in parameters with global optima attainable. Given the proposed approximator structures, we further develop a policy iteration framework under which a gradient descent training algorithm for obtaining the optimal Volterra kernels can be obtained. Associated with this ADP design, we provide a sufficient condition based on actor approximation error to guarantee convergence of the value function iterations. A finite bound of the final convergent value function is also given. Finally, by using a simulation example we illustrate the effectiveness of the proposed Volterra actor for optimal control of a nonlinear system.
Year
DOI
Venue
2014
10.1109/IJCNN.2014.6889865
IJCNN
Keywords
Field
DocType
radial basis function networks,policy iteration approximate dynamic programming,optimal control,volterra series based structure,radial basis functions,volterra approximator,volterra series based actor,policy approximation,adp design,volterra actor,multilayer perceptrons,rbf,gradient descent training algorithm,nonlinear system,value function iterations,mlp,optimal volterra kernels,finite bound,dynamic programming,actor approximation error,volterra series,value function approximation,approximator structures,policy iteration framework,kernel,approximation algorithms,approximation error,function approximation,convergence
Convergence (routing),Dynamic programming,Gradient descent,Mathematical optimization,Radial basis function,Optimal control,Computer science,Bellman equation,Volterra series,Artificial intelligence,Machine learning,Approximation error
Conference
ISSN
Citations 
PageRank 
2161-4393
3
0.38
References 
Authors
20
4
Name
Order
Citations
PageRank
Wentao Guo1544.60
Jennie Si274670.23
Feng Liu326928.35
Shengwei Mei419634.27