Policy Iteration Q-Learning for Data-Based Two-Player Zero-Sum Game of Linear Discrete-Time Systems. - Citegraph

Paper Info

Title
Policy Iteration Q-Learning for Data-Based Two-Player Zero-Sum Game of Linear Discrete-Time Systems.

Abstract
In this article, the data-based two-player zero-sum game problem is considered for linear discrete-time systems. This problem theoretically depends on solving the discrete-time game algebraic Riccati equation (DTGARE), while it requires complete system dynamics. To avoid solving the DTGARE, the $Q$ -function is introduced and a data-based policy iteration $Q$ -learning (PIQL) algorithm is develo...

Year	DOI	Venue
2021	10.1109/TCYB.2020.2970969	IEEE Transactions on Cybernetics
Keywords	DocType	Volume
Games,Game theory,Gold,Heuristic algorithms,Discrete-time systems,Convergence,Dynamic programming	Journal	51
Issue	ISSN	Citations
7	2168-2267	7
PageRank	References	Authors
0.45	26	3

Authors (3 rows)

Cited by (7 rows)

References (26 rows)

Name	Order	Citations	PageRank
Biao Luo	1	554	23.80
Yin Yang	2	1003	52.10
Derong Liu	3	5457	286.88

1