Asymptotic Optimality and Rates of Convergence of Quantized Stationary Policies in Continuous-Time Markov Decision Processes

This paper is concerned with the asymptotic optimality of quantized stationary policies for continuous-time Markov decision processes (CTMDPs) in Polish spaces with state-dependent discount factors, where the transition rates and reward rates are allowed to be unbounded. Using the dynamic programmin...

Full description

Saved in:

Bibliographic Details
Main Authors:	Xiao Wu, Yanqiu Tang
Format:	Article
Language:	English
Published:	Wiley 2022-01-01
Series:	Discrete Dynamics in Nature and Society
Online Access:	http://dx.doi.org/10.1155/2022/1080946
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832555161040453632
author	Xiao Wu Yanqiu Tang
author_facet	Xiao Wu Yanqiu Tang
author_sort	Xiao Wu
collection	DOAJ
description	This paper is concerned with the asymptotic optimality of quantized stationary policies for continuous-time Markov decision processes (CTMDPs) in Polish spaces with state-dependent discount factors, where the transition rates and reward rates are allowed to be unbounded. Using the dynamic programming approach, we first establish the discounted optimal equation and the existence of its solutions. Then, we obtain the existence of optimal deterministic stationary policies under suitable conditions by more concise proofs. Furthermore, we discretize and incentivize the action space and construct a sequence of quantizer policies, which is the approximation of the optimal stationary policies of the CTMDPs, and get the approximation result and the rates of convergence on the expected discounted rewards of the quantized stationary policies. Also, we give an iteration algorithm on the approximate optimal policies. Finally, we give an example to illustrate the asymptotic optimality.
format	Article
id	doaj-art-00b1158acfad44d6b4ea046be09a5fa2
institution	Kabale University
issn	1607-887X
language	English
publishDate	2022-01-01
publisher	Wiley
record_format	Article
series	Discrete Dynamics in Nature and Society
spelling	doaj-art-00b1158acfad44d6b4ea046be09a5fa22025-02-03T05:49:21ZengWileyDiscrete Dynamics in Nature and Society1607-887X2022-01-01202210.1155/2022/1080946Asymptotic Optimality and Rates of Convergence of Quantized Stationary Policies in Continuous-Time Markov Decision ProcessesXiao Wu0Yanqiu Tang1School of Mathematics and StatisticsSchool of Mathematics and StatisticsThis paper is concerned with the asymptotic optimality of quantized stationary policies for continuous-time Markov decision processes (CTMDPs) in Polish spaces with state-dependent discount factors, where the transition rates and reward rates are allowed to be unbounded. Using the dynamic programming approach, we first establish the discounted optimal equation and the existence of its solutions. Then, we obtain the existence of optimal deterministic stationary policies under suitable conditions by more concise proofs. Furthermore, we discretize and incentivize the action space and construct a sequence of quantizer policies, which is the approximation of the optimal stationary policies of the CTMDPs, and get the approximation result and the rates of convergence on the expected discounted rewards of the quantized stationary policies. Also, we give an iteration algorithm on the approximate optimal policies. Finally, we give an example to illustrate the asymptotic optimality.http://dx.doi.org/10.1155/2022/1080946
spellingShingle	Xiao Wu Yanqiu Tang Asymptotic Optimality and Rates of Convergence of Quantized Stationary Policies in Continuous-Time Markov Decision Processes Discrete Dynamics in Nature and Society
title	Asymptotic Optimality and Rates of Convergence of Quantized Stationary Policies in Continuous-Time Markov Decision Processes
title_full	Asymptotic Optimality and Rates of Convergence of Quantized Stationary Policies in Continuous-Time Markov Decision Processes
title_fullStr	Asymptotic Optimality and Rates of Convergence of Quantized Stationary Policies in Continuous-Time Markov Decision Processes
title_full_unstemmed	Asymptotic Optimality and Rates of Convergence of Quantized Stationary Policies in Continuous-Time Markov Decision Processes
title_short	Asymptotic Optimality and Rates of Convergence of Quantized Stationary Policies in Continuous-Time Markov Decision Processes
title_sort	asymptotic optimality and rates of convergence of quantized stationary policies in continuous time markov decision processes
url	http://dx.doi.org/10.1155/2022/1080946
work_keys_str_mv	AT xiaowu asymptoticoptimalityandratesofconvergenceofquantizedstationarypoliciesincontinuoustimemarkovdecisionprocesses AT yanqiutang asymptoticoptimalityandratesofconvergenceofquantizedstationarypoliciesincontinuoustimemarkovdecisionprocesses

Asymptotic Optimality and Rates of Convergence of Quantized Stationary Policies in Continuous-Time Markov Decision Processes

Similar Items