Bayesian Q learning method with Dyna architecture and prioritized sweeping

Bayesian Q learning method with Dyna architecture and prioritized sweeping

In order to balance this trade-off, a probability distribution was used in Bayesian Q learning method to de-scribe the uncertainty of the Q value and choose actions with this distribution. But the slow convergence is a big problem for Bayesian Q-Learning. In allusion to the above problems, a novel B...

Full description

Saved in:

Bibliographic Details
Main Authors:	Jun YU, Quan LIU, Qi-ming FU, Hong-kun SUN, Gui-xing CHEN
Format:	Article
Language:	zho
Published:	Editorial Department of Journal on Communications 2013-11-01
Series:	Tongxin xuebao
Subjects:	reinforcement learning Markov decision process prioritized sweeping Dyna architecture Bayesian Q learning
Online Access:	http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2013.11.015/
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Bayesian Q-learning in multi-objective reward model for homophobic and transphobic text classification in low-resource languages: A hypothesis testing framework in multi-objective setting
by: Vivek Suresh Raj, et al.
Published: (2025-06-01)

Advanced Cooperative Formation Control in Variable-Sweep Wing UAVs via the MADDPG–VSC Algorithm
by: Zhengyang Cao, et al.
Published: (2024-10-01)

Evidence for sweep signatures in antibiotic-resistant strains in three species of bacteria
by: Anjani Pradhananga, et al.
Published: (2024-10-01)

Microscopic Experiments to Assess the Macroscopic Sweep Characteristics of Carbon Dioxide Flooding
by: Rujun Wang, et al.
Published: (2024-10-01)

Attention Transfer Reinforcement Learning for Test Case Prioritization in Continuous Integration
by: Qingran Su, et al.
Published: (2025-02-01)

Formability Studies on Magnesium Based AZ31B Alloy Sheet in LS Dyna Program Code
by: B. Viswanadhapalli, et al.
Published: (2025-03-01)

Optimization of kinematic parameters of continuous miner based on LS-DYNA simulation analysis and NSGA-II algorithm
by: Xunan Liu, et al.
Published: (2025-06-01)

Making virtual learning environment more intelligent: application of Markov decision process
by: Dalia Baziukaitė
Published: (2004-12-01)

Intelligent Robot in Unknown Environments: Walk Path Using Q-Learning and Deep Q-Learning
by: Mouna El Wafi, et al.
Published: (2025-03-01)

Bayesian Reinforcement Learning for Adaptive Balancing in an Assembly Line With Human-Robot Collaboration
by: Hyun-Rok Lee, et al.
Published: (2024-01-01)

XSQ-Learning: Adaptive Similarity Thresholds for Accelerated and Stable Q-Learning
by: Ansel Y. Rodríguez González, et al.
Published: (2025-06-01)

Effects of Leading Edge Sweep on the Cavitating Characteristics of Inducer Pumps
by: Allan J. Acosta, et al.
Published: (2001-01-01)

Selective sweep and GWAS provide insights into adaptive variation of Populus cathayana leaves
by: Xinglu Zhou, et al.
Published: (2024-01-01)

Bayesian curriculum generation in sparse reward reinforcement learning environments
by: Onur Akgün, et al.
Published: (2025-06-01)

Transmission scheduling scheme based on deep Q learning in wireless network
by: Jiang ZHU, et al.
Published: (2018-04-01)

اثر تدريبات بشدة سرعة المنافسة باستخدام جهاز(DYNA FOOT) لتطوير تحمل السرعة وبعض المتغيرات البيوكينماتيكية وانجاز 400 متر حواجز
by: Israa Kamil Hasan, et al.
Published: (2023-09-01)

q-Rung orthopair fuzzy 2-tuple linguistic WASPAS algorithm for patients’ prioritization based on prioritized Maclaurin symmetric mean aggregation operators
by: Fatima Abbas, et al.
Published: (2024-05-01)

Design of an iterative method for disease prediction in finger millet leaves using graph networks, dyna networks, autoencoders, and recurrent neural networks
by: Shailendra Tiwari, et al.
Published: (2024-12-01)

Research of the Fast Modeling for Spiral Bevel Gear with Spherical Involute based on the SWEEP
by: Li Tongzhong, et al.
Published: (2015-01-01)

Reinforcement Learning-Based Augmentation of Data Collection for Bayesian Optimization Towards Radiation Survey and Source Localization
by: Jeremy Marquardt, et al.
Published: (2025-04-01)

Detection of Transformer Faults: AI-Supported Machine Learning Application in Sweep Frequency Response Analysis
by: Hakan Çuhadaroğlu, et al.
Published: (2025-05-01)

Preliminary study on the connotation of flexibility in dynamically reconfigurable networks
by: Dong-nian CHENG, et al.
Published: (2012-08-01)

Research of the Characteristic of the Working Mode of Hydraulic Machinery Compound Transmission Sweeping Vehicle
by: Fuyi Cao, et al.
Published: (2019-02-01)

Genetic Diversity Estimation and Genome-Wide Selective Sweep Analysis of the Bazhou Yak
by: Baigao Yang, et al.
Published: (2025-03-01)

SASDL and RBATQ: Sparse Autoencoder With Swarm Based Deep Learning and Reinforcement Based Q-Learning for EEG Classification
by: Sunil Kumar Prabhakar, et al.
Published: (2022-01-01)

Q-learning global path planning for UAV navigation with pondered priorities
by: Kevin B. de Carvalho, et al.
Published: (2025-03-01)

A survey of neural architecture search
by: Mingjie HE, et al.
Published: (2019-05-01)

Fuzzy Inference System for Modeling the Contribution of the Sweep Mechanism in the Coagulation Process for Water Treatment
by: D. G. Marques, et al.
Published: (2025-07-01)

A deep q-networks model for optimising decision-making process in the context of energy transition modelling
by: Ana Tănăsescu, et al.
Published: (2025-01-01)

Introducing B-Sweep: An Innovative Bird-Repelling Device Powered by Solar Cells and Sound Waves, Efficiently Protecting Against Bird Strikes in Airport Airsides
by: Muhammad Rafli Fazal, et al.
Published: (2025-06-01)

An Intelligent Method for C++ Test Case Synthesis Based on a Q-Learning Agent
by: Serhii Semenov, et al.
Published: (2025-08-01)

Analysis of anomalous behaviour in network systems using deep reinforcement learning with convolutional neural network architecture
by: Mohammad Hossein Modirrousta, et al.
Published: (2024-12-01)

Numerical Study of the Effect of Winglets with Multiple Sweep Angles on Wind Turbine Blade Performance
by: Bayu K. Wardhana, et al.
Published: (2025-03-01)

Design and Analysis of an MPC-PID-Based Double-Loop Trajectory Tracking Algorithm for Intelligent Sweeping Vehicles
by: Zhijun Guo, et al.
Published: (2025-04-01)

Computational Fluid Dynamics Modeling of Sweep Gas Flow Rate-Dependent Carbon Dioxide Removal in Oxygenators
by: Keira Askew, et al.
Published: (2025-06-01)

CDBR: A semi-automated collaborative execute-before-after dependency-based requirement prioritization approach
by: Ankita Gupta, et al.
Published: (2022-02-01)

Using reinforcement learning in genome assembly: in-depth analysis of a Q-learning assembler
by: Kleber Padovani, et al.
Published: (2025-08-01)

Analysis of the Effect of Cut Sweep Ratio of Lily Impeller on the Distribution of Dissolved Oxygen
by: Mohammad Tauviqirrahman, et al.
Published: (2024-12-01)

Stationary Bayesian–Markov Equilibria in Bayesian Stochastic Games with Periodic Revelation
by: Eunmi Ko
Published: (2024-09-01)

AI-Enabled Condition Monitoring Framework for Autonomous Pavement-Sweeping Robots
by: Sathian Pookkuttath, et al.
Published: (2025-07-01)