咨询与建议

限定检索结果

文献类型

  • 2 篇 期刊文献

馆藏范围

  • 2 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 2 篇 理学
    • 2 篇 数学
  • 1 篇 管理学
    • 1 篇 管理科学与工程(可...

主题

  • 2 篇 state-action dep...
  • 1 篇 markov decision ...
  • 1 篇 convergence
  • 1 篇 discrete-time ma...
  • 1 篇 average-value-at...
  • 1 篇 unbounded costs
  • 1 篇 optimal policy

机构

  • 2 篇 school of mathem...
  • 1 篇 school of mathem...
  • 1 篇 school of mathem...
  • 1 篇 advanced modelin...

作者

  • 1 篇 junyu zhang
  • 1 篇 hongchu wang
  • 1 篇 xianping guo
  • 1 篇 qiuli liu
  • 1 篇 wai-ki ching
  • 1 篇 xiao wu

语言

  • 2 篇 英文
检索条件"主题词=state-action dependent discount factors"
2 条 记 录,以下是1-10 订阅
排序:
Convergence of Markov decision processes with constraints and state-action dependent discount factors
收藏 引用
Science China Mathematics 2020年 第1期63卷 167-182页
作者: Xiao Wu Xianping Guo School of Mathematics and Statistics Zhaoqing UniversityZhaoqing 526061China School of Mathematics Sun Yat-sen UniversityGuangzhou 510275China
This paper is concerned with the convergence of a sequence of discrete-time Markov decision processes(DTMDPs)with constraints,state-action dependent discount factors,and possibly unbounded *** the convex analytic appr... 详细信息
来源: 维普期刊数据库 维普期刊数据库 同方期刊数据库 同方期刊数据库 评论
An average-value-at-risk criterion for Markov decision processes with unbounded costs
收藏 引用
Frontiers of Mathematics in China 2022年 第4期17卷 673-687页
作者: Qiuli LIU Wai-Ki CHING Junyu ZHANG Hongchu WANG School of Mathematical Sciences South China Normal UniversityGuangzhou510631China Advanced Modeling and Applied Computing Laboratory Department of MathematicsThe University of Hong KongHong KongChina School of Mathematics Sun Yat-Sen UniversityGuangzhou510275China
We study the Markov decision processes under the average-value-at-risk *** state space and the action space are Borel spaces,the costs are admitted to be unbounded from above,and the discount factors are state-action ... 详细信息
来源: 维普期刊数据库 维普期刊数据库 评论