Title
Convergence of Recurrent Neuro-Fuzzy Value-Gradient Learning With and Without an Actor.
Abstract
In recent years, a gradient of the n-step temporal-difference [TD(λ)] learning has been developed to present an advanced adaptive dynamic programming (ADP) algorithm, called value-gradient learning [VGL(λ)]. In this paper, we improve the VGL(λ) architecture, which is called the “single adaptive actor network [SNVGL(λ)]” because it has only a single approximator function network (critic) instead of...
Year
DOI
Venue
2020
10.1109/TFUZZ.2019.2912349
IEEE Transactions on Fuzzy Systems
Keywords
DocType
Volume
Noise measurement,Convergence,Optimal control,Adaptive systems,Computer architecture,Dynamic programming,Mobile robots
Journal
28
Issue
ISSN
Citations 
4
1063-6706
1
PageRank 
References 
Authors
0.36
9
2
Name
Order
Citations
PageRank
Seaar Al-Dabooni1101.82
Wunsch II Donald C.2135491.73