Bayesian Q learning method with Dyna architecture and prioritized sweeping
In order to balance this trade-off, a probability distribution was used in Bayesian Q learning method to de-scribe the uncertainty of the Q value and choose actions with this distribution. But the slow convergence is a big problem for Bayesian Q-Learning. In allusion to the above problems, a novel B...
Saved in:
| Main Authors: | Jun YU, Quan LIU, Qi-ming FU, Hong-kun SUN, Gui-xing CHEN |
|---|---|
| Format: | Article |
| Language: | zho |
| Published: |
Editorial Department of Journal on Communications
2013-11-01
|
| Series: | Tongxin xuebao |
| Subjects: | |
| Online Access: | http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2013.11.015/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Bayesian Q-learning in multi-objective reward model for homophobic and transphobic text classification in low-resource languages: A hypothesis testing framework in multi-objective setting
by: Vivek Suresh Raj, et al.
Published: (2025-06-01) -
Advanced Cooperative Formation Control in Variable-Sweep Wing UAVs via the MADDPG–VSC Algorithm
by: Zhengyang Cao, et al.
Published: (2024-10-01) -
Evidence for sweep signatures in antibiotic-resistant strains in three species of bacteria
by: Anjani Pradhananga, et al.
Published: (2024-10-01) -
Microscopic Experiments to Assess the Macroscopic Sweep Characteristics of Carbon Dioxide Flooding
by: Rujun Wang, et al.
Published: (2024-10-01) -
Attention Transfer Reinforcement Learning for Test Case Prioritization in Continuous Integration
by: Qingran Su, et al.
Published: (2025-02-01)