Title | ||
---|---|---|
Convergence of Recurrent Neuro-Fuzzy Value-Gradient Learning With and Without an Actor. |
Abstract | ||
---|---|---|
In recent years, a gradient of the n-step temporal-difference [TD(λ)] learning has been developed to present an advanced adaptive dynamic programming (ADP) algorithm, called value-gradient learning [VGL(λ)]. In this paper, we improve the VGL(λ) architecture, which is called the “single adaptive actor network [SNVGL(λ)]” because it has only a single approximator function network (critic) instead of... |
Year | DOI | Venue |
---|---|---|
2020 | 10.1109/TFUZZ.2019.2912349 | IEEE Transactions on Fuzzy Systems |
Keywords | DocType | Volume |
Noise measurement,Convergence,Optimal control,Adaptive systems,Computer architecture,Dynamic programming,Mobile robots | Journal | 28 |
Issue | ISSN | Citations |
4 | 1063-6706 | 1 |
PageRank | References | Authors |
0.36 | 9 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Seaar Al-Dabooni | 1 | 10 | 1.82 |
Wunsch II Donald C. | 2 | 1354 | 91.73 |