TD algorithm based on double-layer fuzzy partitioning
When dealing with the continuous space problems,the traditional Q-iteration algorithms based on lookup-table or function approximation converge slowly and are diff lt to get a continuous policy.To overcome the above weak-nesses,an on-policy TD algorithm named DFP-OPTD was proposed based on double-la...
Saved in:
| Main Authors: | Xiang MU, Quan LIU, Qi-ming FU, Hong-kun SUN, Xin ZHOU |
|---|---|
| Format: | Article |
| Language: | zho |
| Published: |
Editorial Department of Journal on Communications
2013-10-01
|
| Series: | Tongxin xuebao |
| Subjects: | |
| Online Access: | http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2013.10.011/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Optimization and Application of Fuzzy Neural Network
by: LI Hao-nan, et al.
Published: (2020-12-01) -
Gradient descent Sarsa(?)algorithm based on the adaptive potential function shaping reward mechanism
by: Fei XIAO, et al.
Published: (2013-01-01) -
Tracking Control of CSTRs Based on Improved OU Noise and the TD3 Algorithm
by: Hongyan Shi, et al.
Published: (2025-01-01) -
Fuzzy clustering based on Forest optimization algorithm
by: Arash Chaghari, et al.
Published: (2018-01-01) -
Double Critics and Double Actors Deep Deterministic Policy Gradient for Mobile Robot Navigation Using Adaptive Parameter Space Noise and Parallel Experience Replay
by: Wenjie Hu, et al.
Published: (2024-01-01)