Simultaneous policy update algorithms for learning the solution of linear continuous-time H∞ state feedback control - Citegraph

Paper Info

Title
Simultaneous policy update algorithms for learning the solution of linear continuous-time H∞ state feedback control

Abstract
It is well known that the H"~ state feedback control problem can be viewed as a two-player zero-sum game and reduced to find a solution of the algebra Riccati equation (ARE). In this paper, we propose a simultaneous policy update algorithm (SPUA) for solving the ARE, and develop offline and online versions. The offline SPUA is a model-based approach, which obtains the solution of the ARE by solving a sequence of Lyapunov equations (LEs). Its convergence is established rigorously by constructing a Newton's sequence for the fixed point equation. The online SPUA is a partially model-free approach, which takes advantage of the thought of reinforcement learning (RL) to learn the solution of the ARE online without requiring the internal system dynamics, wherein both players update their action policies simultaneously. The convergence of the online SPUA is proved by demonstrating that it is mathematically equivalent to the offline SPUA. Finally, by conducting comparative simulation studies on an F-16 aircraft plant and a power system, the results show that both the offline SPUA and the online SPUA can find the solution of the ARE, and achieve much better convergence than the existing methods.

Year	DOI	Venue
2013	10.1016/j.ins.2012.08.012	Inf. Sci.
Keywords	Field	DocType
online spua,offline spua,better convergence,algebra riccati equation,internal system dynamic,model-free approach,state feedback control,lyapunov equation,linear continuous-time h,model-based approach,online version,fixed point equation,simultaneous policy update algorithm	Convergence (routing),Lyapunov equation,Computer science,Control theory,System dynamics,Artificial intelligence,Reinforcement learning,Lyapunov function,Mathematical optimization,Algorithm,Electric power system,Riccati equation,Fixed point equation,Machine learning	Journal
Volume	ISSN	Citations
222,	0020-0255	19
PageRank	References	Authors
0.77	15	2

Authors (2 rows)

Cited by (19 rows)

References (15 rows)

Name	Order	Citations	PageRank
Huai-Ning Wu	1	2104	98.52
Biao Luo	2	554	23.80

1