Title
Finite horizon stochastic optimal control of nonlinear two-player zero-sum games under communication constraint
Abstract
In this paper, the finite horizon stochastic optimal control of nonlinear two-player zero-sum games, referred to as Nonlinear Networked Control Systems (NNCS) two-player zero-sum game, between control and disturbance input players in the presence of unknown system dynamics and a communication network with delays and packet losses is addressed by using neuro dynamic programming (NDP). The overall objective being to find the optimal control input while maximizing the disturbance attenuation. First, a novel online neural network (NN) identifier is introduced to estimate the unknown control and disturbance coefficient matrices which are needed in the generation of optimal control input. Then, the critic and two actor NNs have been introduced to learn the time-varying solution to the Hamilton-Jacobi-Isaacs (HJI) equation and determine the stochastic optimal control and disturbance policies in a forward-in-time manner. Eventually, with the proposed novel NN weight update laws, Lyapunov theory is utilized to demonstrate that all closed-loop signals and NN weights are uniformly ultimately bounded (ÜUB) during the finite horizon with ultimate bounds being a function of initial conditions and final time. Further, the approximated control input and disturbance signals tend close to the saddle-point equilibrium within finite-time. Simulation results are included.
Year
DOI
Venue
2014
10.1109/IJCNN.2014.6889617
IJCNN
Keywords
Field
DocType
lyapunov theory,ndp,disturbance coefficient matrices,optimal control,hamilton-jacobi-isaacs equation,saddle-point equilibrium,stochastic systems,communication constraint,time-varying systems,unknown control matrix,time-varying solution,nonlinear networked control system,online neural network identifier,uniformly ultimately bounded,nonlinear two-player zero-sum games,stochastic optimal control,game theory,nonlinear control systems,dynamic programming,networked control systems,neuro dynamic programming,finite horizon stochastic optimal control,finite horizon
Dynamic programming,Lyapunov function,Mathematical optimization,Nonlinear system,Optimal control,Computer science,Control theory,Automatic control,Zero-sum game,Control system,Stochastic control
Conference
ISSN
Citations 
PageRank 
2161-4393
0
0.34
References 
Authors
5
2
Name
Order
Citations
PageRank
Hao Xu121414.63
Sarangapani Jagannathan2113694.89