Adaptive temporal-difference learning via deep neural network function approximation: a non-asymptotic analysis
Abstract Although deep reinforcement learning has achieved notable practical achievements, its theoretical foundations have been scarcely explored until recent times. Nonetheless, the rate of convergence for current neural temporal-difference (TD) learning algorithms is constrained, largely due to t...
Saved in:
Main Authors: | Guoyong Wang, Tiange Fu, Ruijuan Zheng, Xuhui Zhao, Junlong Zhu, Mingchuan Zhang |
---|---|
Format: | Article |
Language: | English |
Published: |
Springer
2025-01-01
|
Series: | Complex & Intelligent Systems |
Subjects: | |
Online Access: | https://doi.org/10.1007/s40747-024-01757-w |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Boundedness and asymptotic behavior of solutions of a forced difference equation
by: John R. Graef, et al.
Published: (1994-01-01) -
Enhancing navigation performance in unknown environments using spiking neural networks and reinforcement learning with asymptotic gradient method
by: Xiaode Liu, et al.
Published: (2025-01-01) -
Enhanced deep deterministic policy gradient algorithm
by: Jianping CHEN, et al.
Published: (2018-11-01) -
Approximations of Apostol-Tangent Polynomials of Complex Order with Parameters a, b, and c
by: Cristina B. Corcino, et al.
Published: (2024-12-01) -
An embedding of Schwartz distributions in the algebra of asymptotic functions
by: Michael Oberguggenberger, et al.
Published: (1998-01-01)