TD algorithm based on double-layer fuzzy partitioning

TD algorithm based on double-layer fuzzy partitioning

When dealing with the continuous space problems,the traditional Q-iteration algorithms based on lookup-table or function approximation converge slowly and are diff lt to get a continuous policy.To overcome the above weak-nesses,an on-policy TD algorithm named DFP-OPTD was proposed based on double-la...

Full description

Saved in:

Bibliographic Details
Main Authors:	Xiang MU, Quan LIU, Qi-ming FU, Hong-kun SUN, Xin ZHOU
Format:	Article
Language:	zho
Published:	Editorial Department of Journal on Communications 2013-10-01
Series:	Tongxin xuebao
Subjects:	reinforcement learning on-policy gradient descent double layer fuzzy partitioning continuous action policy
Online Access:	http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2013.10.011/
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Optimization and Application of Fuzzy Neural Network
by: LI Hao-nan, et al.
Published: (2020-12-01)

Gradient descent Sarsa(?)algorithm based on the adaptive potential function shaping reward mechanism
by: Fei XIAO, et al.
Published: (2013-01-01)

Tracking Control of CSTRs Based on Improved OU Noise and the TD3 Algorithm
by: Hongyan Shi, et al.
Published: (2025-01-01)

Fuzzy clustering based on Forest optimization algorithm
by: Arash Chaghari, et al.
Published: (2018-01-01)

Double Critics and Double Actors Deep Deterministic Policy Gradient for Mobile Robot Navigation Using Adaptive Parameter Space Noise and Parallel Experience Replay
by: Wenjie Hu, et al.
Published: (2024-01-01)

SOME TYPES OF CONTIONUOUS FUNCTION VIA (r0, s1)-FUZZY αm- CLOSED SETS
by: Fatimah M. Mohammed, et al.
Published: (2018-08-01)

Generalized Inverse of Quadri-Partitioned Neutrosophic Fuzzy Matrices and its Application to Decision-Making Problems
by: R. Jaya, et al.
Published: (2025-07-01)

Partitioned Maclaurin symmetric mean operators in bipolar complex fuzzy sets for multiattribute decision making
by: Ubaid ur Rehman, et al.
Published: (2025-04-01)

Trajectory Based Prioritized Double Experience Buffer for Sample-Efficient Policy Optimization
by: Shengxiang Li, et al.
Published: (2021-01-01)

A Hierarchical Inverse Lithography Method Considering the Optimization and Manufacturability Limit by Gradient Descent
by: Haifeng Sun, et al.
Published: (2025-07-01)

Research of the Adaptive Fuzzy PID Control System of New Type Double Conical Continuously Variable Transmission
by: Li Jingkui, et al.
Published: (2016-01-01)

A Novel Approach for Differential Privacy-Preserving Federated Learning
by: Anis Elgabli, et al.
Published: (2025-01-01)

Algebraic Properties of Interval -Valued Quadri Partitioned Neutrosophic Fuzzy Matrices and their Application in Multi-Criteria Decision- Making Problem
by: P.Tharini, et al.
Published: (2025-04-01)

Online Three-Dimensional Fuzzy Multi-Output Support Vector Regression Learning Modeling for Complex Distributed Parameter Systems
by: Gang Zhou, et al.
Published: (2025-03-01)

Fuzzy Clustering Approaches Based on Numerical Optimizations of Modified Objective Functions
by: Erind Bedalli, et al.
Published: (2025-05-01)

Determinant Theory of Quadri-Partitioned Neutrosophic Fuzzy Matrices and its Application to Multi-Criteria Decision-Making Problems
by: M. Anandhkumar, et al.
Published: (2025-04-01)

Comparison of the efficiency of zero and first order minimization methods in neural networks
by: E. A. Gubareva, et al.
Published: (2022-12-01)

BIT*+TD3 Hybrid Algorithm for Energy-Efficient Path Planning of Unmanned Surface Vehicles in Complex Inland Waterways
by: Yunze Xie, et al.
Published: (2025-03-01)

Decision-making algorithm with complex hesitant fuzzy partitioned maclaurin symmetric mean aggregation operators and SWARA method
by: Jawad Ali, et al.
Published: (2025-05-01)

Stability of Back Propagation Training Algorithm for Neural Networks
by: Baghdad Science Journal
Published: (2012-12-01)

Design of Swarm Intelligence Control Based on Double-Layer Deep Reinforcement Learning
by: Xiangpei Yan, et al.
Published: (2025-04-01)

Enhancing cold storage efficiency: Continuous deep deterministic policy gradient approach to energy optimization utilizing strategic sensor input data
by: Jong-Whi Park, et al.
Published: (2025-04-01)

Hybrid Fuzzy–DDPG Approach for Efficient MPPT in Partially Shaded Photovoltaic Panels
by: Diana Ortiz-Munoz, et al.
Published: (2025-04-01)

Nanogenerators via dynamic regulation of electrical double layer
by: Xiang Li, et al.
Published: (2024-12-01)

Robust Algorithm for Calculating the Alignment of Guide Rolls in Slab Continuous Casting Machines
by: Robert Rosenthal, et al.
Published: (2025-07-01)

Intelligent maneuver decision-making for UAVs using the TD3–LSTM reinforcement learning algorithm under uncertain information
by: Tongle Zhou, et al.
Published: (2025-08-01)

Congruences modulo $4$ for the number of $3$-regular partitions
by: Ballantine, Cristina, et al.
Published: (2023-11-01)

Variable-Parameter Impedance Control of Manipulator Based on RBFNN and Gradient Descent
by: Linshen Li, et al.
Published: (2024-12-01)

A Reinforcement Learning-Based Double Layer Controller for Mobile Robot in Human-Shared Environments
by: Jian Mi, et al.
Published: (2025-07-01)

Refined Minimization of Trapezoidal Fuzzy Quadratic Function: A Fuzzy-Parametric Steepest Descent
by: Shalini K, et al.
Published: (2025-07-01)

Fault classification of meta-action unit using CEEMDAN double-layer decomposition and COA-SVM
by: Anxiang Guo, et al.
Published: (2025-12-01)

Actor-critic algorithm with incremental dual natural policy gradient
by: Peng ZHANG, et al.
Published: (2017-04-01)

The Mixed Partition Dimension: A New Resolvability Parameter in Graph Theory
by: Siti Norziahidayu Amzee Zamri, et al.
Published: (2025-01-01)

IMPLEMENTATION OF FUZZY C-MEANS AND FUZZY POSSIBILISTIC C-MEANS ALGORITHMS ON POVERTY DATA IN INDONESIA
by: Dian Kurniasari, et al.
Published: (2024-07-01)

LQR and Fuzzy-PID Control Design on Double Inverted Pendulum
by: Erlyana Trie Damayanti, et al.
Published: (2024-05-01)

Tight analyses for subgradient descent I: Lower bounds
by: Harvey, Nicholas J. A., et al.
Published: (2024-07-01)

On congruence properties of the partition function
by: Jayce Getz
Published: (2000-01-01)

Uncertainty-Aware Earthquake Forecasting Using a Bayesian Neural Network with Elastic Weight Consolidation
by: Changchun Liu, et al.
Published: (2025-08-01)

Onto Proximality in Non Negative Matrix Factorization for Recommender Systems
by: Rachana Mehta, et al.
Published: (2025-01-01)

SGD-TripleQNet: An Integrated Deep Reinforcement Learning Model for Vehicle Lane-Change Decision
by: Yang Liu, et al.
Published: (2025-01-01)