Convergence of Recurrent Neuro-Fuzzy Value-Gradient Learning With and Without an Actor. - Citegraph

Paper Info

Title
Convergence of Recurrent Neuro-Fuzzy Value-Gradient Learning With and Without an Actor.

Abstract
In recent years, a gradient of the n-step temporal-difference [TD(λ)] learning has been developed to present an advanced adaptive dynamic programming (ADP) algorithm, called value-gradient learning [VGL(λ)]. In this paper, we improve the VGL(λ) architecture, which is called the “single adaptive actor network [SNVGL(λ)]” because it has only a single approximator function network (critic) instead of...

Year	DOI	Venue
2020	10.1109/TFUZZ.2019.2912349	IEEE Transactions on Fuzzy Systems
Keywords	DocType	Volume
Noise measurement,Convergence,Optimal control,Adaptive systems,Computer architecture,Dynamic programming,Mobile robots	Journal	28
Issue	ISSN	Citations
4	1063-6706	1
PageRank	References	Authors
0.36	9	2

Authors (2 rows)

Cited by (1 rows)

References (9 rows)

Name	Order	Citations	PageRank
Seaar Al-Dabooni	1	10	1.82
Wunsch II Donald C.	2	1354	91.73

1