A study of value iteration and policy iteration for Markov decision processes in Deterministic systems

A study of value iteration and policy iteration for Markov decision processes in Deterministic systems

In the context of deterministic discrete-time control systems, we examined the implementation of value iteration (VI) and policy (PI) algorithms in Markov decision processes (MDPs) situated within Borel spaces. The deterministic nature of the system's transfer function plays a pivotal role, as...

Full description

Saved in:

Bibliographic Details
Main Authors:	Haifeng Zheng, Dan Wang
Format:	Article
Language:	English
Published:	AIMS Press 2024-11-01
Series:	AIMS Mathematics
Subjects:	markov decision processes deterministic system value iteration policy iteration average cost criterion
Online Access:	https://www.aimspress.com/article/doi/10.3934/math.20241613
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SOLPS-ITER simulation of W limiter start-up on ITER
by: Y. Zhang, et al.
Published: (2025-01-01)

A new preconditioned Richardson iterative method
by: Hassan Jamali, et al.
Published: (2024-12-01)

SOLPS-ITER simulations of the ITER divertor with improved plasma-facing component geometry
by: A.A. Pshenov, et al.
Published: (2025-03-01)

Fixed Point Iteration Method
by: Mehmet Karakas
Published: (2013-05-01)

Conclusive benchmark of SOLPS-ITER against the SOLPS4.3 ITER divertor design reference
by: S. Wiesen, et al.
Published: (2025-01-01)

THE PROGRAM ITERATIONS METHOD IN GAME PROBLEM OF GUIDANCE AND SET-VALUED QUASISTRATEGIES
by: Alexander G. Chentsov
Published: (2016-07-01)

APPLICATION OF ITERATIVE DYNAMIC PROGRAMMING TO OPTIMAL FEED-BACK CONTROL PROBLEM
by: A. V. Panteleev, et al.
Published: (2016-12-01)

Iterative Dissipativity of Partial Difference Equation Dynamics in Open-Loop Iterative Learning Control Mode
by: Tengfei Xiao
Published: (2024-10-01)

A study on the growth of generalist iterated entire functions
by: Ratan Kumar Dutta
Published: (2020-12-01)

On the speed of convergence of iteration of a function
by: Vladimir Drobot
Published: (1994-01-01)

Convergence and Stability of the Ishikawa Iterative Process for a class of ϕ-quasinonexpansive Mappings
by: F. D. Ajibade, et al.
Published: (2022-08-01)

Low-complexity FTN receivers based on frequency domain iterative decision feedback equalization
by: Juan ZENG, et al.
Published: (2017-04-01)

AN ADAPTED ANALYTICAL SOLUTION TO THE MULHOLLAND EQUATION: MODIFIED DIRECT ITERATION PROCEDURE
by: Sabrina Sultana, et al.
Published: (2025-02-01)

Notes on Iterative Summation of Alternating Factorials
by: Vladimir Kanovei, et al.
Published: (2025-06-01)

Pseudo-Target Optimization Strategy Based on Policy Iteration Algorithm
by: Yiming Meng, et al.
Published: (2025-01-01)

Acceptance testing of a CT scanner with a knowledge-based iterative reconstruction algorithm
by: Maryangel Jhoseline Medina, et al.
Published: (2019-01-01)

An overview of iterative methods based on orthogonal projections
by: Touraj Nikazad, et al.
Published: (2025-06-01)

EVALUATION OF ITERATIVE ALGORITHMS FOR TOMOGRAPHY IMAGE RECONSTRUCTION
by: Alexandre F. Velo, et al.
Published: (2019-02-01)

Image skeletonization based on combination of one- and two-sub-iterations models
by: J. Ma, et al.
Published: (2020-06-01)

Structural Properties of Optimal Maintenance Policies for <i>k</i>-out-of-<i>n</i> Systems with Interdependence Between Internal Deterioration and External Shocks
by: Mizuki Kasuya, et al.
Published: (2025-02-01)

Iterative solution of negative exponent Emden-Fowler problems
by: C. D. Luning, et al.
Published: (1990-01-01)

Iterative Image Interpolation vs. Traditional Interpolation Methods
by: Samia Lazar
Published: (2011-07-01)

Initial design concepts for solid boron injection in ITER
by: J.A. Snipes, et al.
Published: (2024-12-01)

Compression Image by Using Iterated Function Systems
by: Basil Al-khayat, et al.
Published: (2012-07-01)

Iterative approach for photonic crystal devices design
by: Pavel V. Mokshin, et al.
Published: (2024-10-01)

Iterative multistage adaptive Rake receiver for CDMA wireless system
by: YE Li-bing, et al.
Published: (2005-01-01)

Iterative multistage adaptive Rake receiver for CDMA wireless system
by: YE Li-bing, et al.
Published: (2005-01-01)

The iterative properties of solutions for a singular k-Hessian system
by: Xinguang Zhang, et al.
Published: (2023-12-01)

Compact Bistatic Iterative Passive Radar Based on Terrestrial Digital Video Broadcasting Signals
by: Víctor P. Gil Jiménez, et al.
Published: (2025-03-01)

Iterative Schemes of Mean Nonexpansive Mapping
by: CUI Yunan, et al.
Published: (2021-02-01)

Detective Gadget: Generic Iterative Entity Resolution over Dirty Data
by: Marcello Buoncristiano, et al.
Published: (2024-11-01)

A tutorial review of policy iteration methods in reinforcement learning for nonlinear optimal control
by: Yujia Wang, et al.
Published: (2025-06-01)

Stochastic List Generator for Iterative MIMO Detection
by: Stephen N. Jenkins, et al.
Published: (2025-01-01)

Approximating the zeros of accretive operators by the Ishikawa iteration process
by: Zhou Haiyun, et al.
Published: (1996-01-01)

Fixed point approximation of contractive-like mappings using a stable iterative family and its dynamics via quadratic polynomials
by: Munish Kansal, et al.
Published: (2025-07-01)

Operation Margin of the ITER Central Solenoid During the Plasma Scenario
by: Lorenzo Cavallucci, et al.
Published: (2025-03-01)

SOLPS-ITER modification for impurity transport modelling in the tokamak pedestal region
by: V. Korzueva, et al.
Published: (2025-03-01)

Predictive Processing in Autism Spectrum Disorder: The Atypical Iterative Prior Updating Account
by: Zhuanghua Shi, et al.
Published: (2025-05-01)

Iterative Shaping of Error Patterns For Normal Syndrome Decoding of Iterative Codes
by: X. H. Ren, et al.
Published: (2022-03-01)

IMAGE DEHAZING USING FAST ITERATIVE DOMAIN GUIDED IMAGE FILTERING WITH GRAY WORLD OPTIMIZATION
by: Bhaskar Reddy Bada, et al.
Published: (2025-03-01)