A study of value iteration and policy iteration for Markov decision processes in Deterministic systems
In the context of deterministic discrete-time control systems, we examined the implementation of value iteration (VI) and policy (PI) algorithms in Markov decision processes (MDPs) situated within Borel spaces. The deterministic nature of the system's transfer function plays a pivotal role, as...
Saved in:
| Main Authors: | Haifeng Zheng, Dan Wang |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
AIMS Press
2024-11-01
|
| Series: | AIMS Mathematics |
| Subjects: | |
| Online Access: | https://www.aimspress.com/article/doi/10.3934/math.20241613 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
SOLPS-ITER simulation of W limiter start-up on ITER
by: Y. Zhang, et al.
Published: (2025-01-01) -
A new preconditioned Richardson iterative method
by: Hassan Jamali, et al.
Published: (2024-12-01) -
SOLPS-ITER simulations of the ITER divertor with improved plasma-facing component geometry
by: A.A. Pshenov, et al.
Published: (2025-03-01) -
Fixed Point Iteration Method
by: Mehmet Karakas
Published: (2013-05-01) -
Conclusive benchmark of SOLPS-ITER against the SOLPS4.3 ITER divertor design reference
by: S. Wiesen, et al.
Published: (2025-01-01)