Policy iteration approximate dynamic programming using Volterra series based actor - Citegraph

Paper Info

Title
Policy iteration approximate dynamic programming using Volterra series based actor

Abstract
There is an extensive literature on value function approximation for approximate dynamic programming (ADP). Multilayer perceptrons (MLPs) and radial basis functions (RBFs), among others, are typical approximators for value functions in ADP. Similar approaches have been taken for policy approximation. In this paper, we propose a new Volterra series based structure for actor approximation in ADP. The Volterra approx-imator is linear in parameters with global optima attainable. Given the proposed approximator structures, we further develop a policy iteration framework under which a gradient descent training algorithm for obtaining the optimal Volterra kernels can be obtained. Associated with this ADP design, we provide a sufficient condition based on actor approximation error to guarantee convergence of the value function iterations. A finite bound of the final convergent value function is also given. Finally, by using a simulation example we illustrate the effectiveness of the proposed Volterra actor for optimal control of a nonlinear system.

Year	DOI	Venue
2014	10.1109/IJCNN.2014.6889865	IJCNN
Keywords	Field	DocType
radial basis function networks,policy iteration approximate dynamic programming,optimal control,volterra series based structure,radial basis functions,volterra approximator,volterra series based actor,policy approximation,adp design,volterra actor,multilayer perceptrons,rbf,gradient descent training algorithm,nonlinear system,value function iterations,mlp,optimal volterra kernels,finite bound,dynamic programming,actor approximation error,volterra series,value function approximation,approximator structures,policy iteration framework,kernel,approximation algorithms,approximation error,function approximation,convergence	Convergence (routing),Dynamic programming,Gradient descent,Mathematical optimization,Radial basis function,Optimal control,Computer science,Bellman equation,Volterra series,Artificial intelligence,Machine learning,Approximation error	Conference
ISSN	Citations	PageRank
2161-4393	3	0.38
References	Authors
20	4

Authors (4 rows)

Cited by (3 rows)

References (20 rows)

Name	Order	Citations	PageRank
Wentao Guo	1	54	4.60
Jennie Si	2	746	70.23
Feng Liu	3	269	28.35
Shengwei Mei	4	196	34.27

1