Title | ||
---|---|---|
Neural network-based finite-horizon optimal control of uncertain affine nonlinear discrete-time systems. |
Abstract | ||
---|---|---|
In this paper, the finite-horizon optimal control design for nonlinear discrete-time systems in affine form is presented. In contrast with the traditional approximate dynamic programming methodology, which requires at least partial knowledge of the system dynamics, in this paper, the complete system dynamics are relaxed utilizing a neural network (NN)-based identifier to learn the control coefficient matrix. The identifier is then used together with the actor-critic-based scheme to learn the time-varying solution, referred to as the value function, of the Hamilton-Jacobi-Bellman (HJB) equation in an online and forward-in-time manner. Since the solution of HJB is time-varying, NNs with constant weights and time-varying activation functions are considered. To properly satisfy the terminal constraint, an additional error term is incorporated in the novel update law such that the terminal constraint error is also minimized over time. Policy and/or value iterations are not needed and the NN weights are updated once a sampling instant. The uniform ultimate boundedness of the closed-loop system is verified by standard Lyapunov stability theory under nonautonomous analysis. Numerical examples are provided to illustrate the effectiveness of the proposed method. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1109/TNNLS.2014.2315646 | IEEE Trans. Neural Netw. Learning Syst. |
Keywords | Field | DocType |
uncertain systems,optimal control,actor-critic-based scheme,neurocontrollers,sampling instant,control system synthesis,matrix algebra,hamilton???jacobi???bellman (hjb) equation,time-varying solution,neural network (nn),hamilton-jacobi-bellman (hjb) equation,closed-loop system,finite-horizon optimal control design,neural network-based identifier,nonlinear control systems,finite-horizon,value function,optimal control.,terminal constraint error,discrete time systems,uniform ultimate boundedness,uncertain affine nonlinear discrete-time systems,nn,sampling methods,hjb,dynamic programming,stability,neural network-based finite-horizon optimal control,approximate dynamic programming methodology,nonautonomous analysis,hamilton-jacobi-bellman equation,closed loop systems,standard lyapunov stability theory,time-varying activation functions,lyapunov methods,control coefficient matrix,artificial neural networks,hamilton jacobi bellman equation | Affine transformation,Hamilton–Jacobi–Bellman equation,Mathematical optimization,Optimal control,Nonlinear system,Computer science,Control theory,Lyapunov stability,Bellman equation,System dynamics,Discrete time and continuous time | Journal |
Volume | Issue | ISSN |
26 | 3 | 2162-2388 |
Citations | PageRank | References |
19 | 0.68 | 7 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Qiming Zhao | 1 | 31 | 3.26 |
Hao Xu | 2 | 214 | 14.63 |
Sarangapani Jagannathan | 3 | 1136 | 94.89 |