Title
Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control.
Abstract
The model-free optimal control problem of general discrete-time nonlinear systems is considered in this paper, and a data-based policy gradient adaptive dynamic programming (PGADP) algorithm is developed to design an adaptive optimal controller method. By using offline and online data rather than the mathematical system model, the PGADP algorithm improves control policy with a gradient descent sch...
Year
DOI
Venue
2017
10.1109/TCYB.2016.2623859
IEEE Transactions on Cybernetics
Keywords
Field
DocType
Optimal control,Mathematical model,Autoregressive processes,Heuristic algorithms,Nonlinear systems,Cost function,Algorithm design and analysis
Convergence (routing),Dynamic programming,Gradient descent,Control theory,Mathematical optimization,Algorithm design,Nonlinear system,Optimal control,Computer science,Adaptive control
Journal
Volume
Issue
ISSN
47
10
2168-2267
Citations 
PageRank 
References 
37
0.90
47
Authors
5
Name
Order
Citations
PageRank
Biao Luo155423.80
Derong Liu25457286.88
Huai-Ning Wu3210498.52
Ding Wang4187068.16
FRANK L. LEWIS55782402.68