检索结果-南通市图书馆

Convergence of Markov decision processes with constraints and state-action dependent discount factors

Science China Mathematics 2020年第1期63卷 167-182页

作者： Xiao Wu Xianping Guo School of Mathematics and Statistics Zhaoqing UniversityZhaoqing 526061China School of Mathematics Sun Yat-sen UniversityGuangzhou 510275China

This paper is concerned with the convergence of a sequence of discrete-time Markov decision processes(DTMDPs)with constraints,state-action dependent discount factors,and possibly unbounded *** the convex analytic approach under mild conditions,we prove that the optimal values and optimal policies of the original DTMDPs converge to those of the"limit"***,we show that any countablestate DTMDP can be approximated by a sequence of finite-state DTMDPs,which are constructed using the truncation ***,we illustrate the approximation by solving a controlled queueing system numerically,and give the corresponding error bound of the approximation.

关键词： discrete-time Markov decision processes state-action dependent discount factors unbounded costs convergence

来源：

维普期刊数据库

同方期刊数据库评论

在线全文

学校读者我要写书评

暂无评论

An average-value-at-risk criterion for Markov decision processes with unbounded costs

引用

Frontiers of Mathematics in China 2022年第4期17卷 673-687页

作者： Qiuli LIU Wai-Ki CHING Junyu ZHANG Hongchu WANG School of Mathematical Sciences South China Normal UniversityGuangzhou510631China Advanced Modeling and Applied Computing Laboratory Department of MathematicsThe University of Hong KongHong KongChina School of Mathematics Sun Yat-Sen UniversityGuangzhou510275China

We study the Markov decision processes under the average-value-at-risk *** state space and the action space are Borel spaces,the costs are admitted to be unbounded from above,and the discount factors are state-action *** suitable conditions,we establish the existence of optimal deterministic stationary ***,we apply our main results to a cash-balance model.

关键词： Markov decision processes average-value-at-risk(AVaR) state-action dependent discount factors optimal policy

来源：

维普期刊数据库评论

在线全文

维普期刊数据库

学校读者我要写书评

暂无评论

欢迎您,

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

在线全文

在线全文

请选择保存的检索档案：

请选择收藏分类：

通借通还

欢迎您,

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

在线全文

在线全文

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：