Markov decision processes with iterated coherent risk measures

Chu Shanyun; Zhang Yi<sup>*</sup>

doi:10.1080/00207179.2014.909947

登录

免费注册

赞收藏引用

科研之友

微信

新浪微博

Facebook

分享链接

Markov decision processes with iterated coherent risk measures

作者：Chu Shanyun; Zhang Yi^*

来源：International Journal of Control, 2014, 87(11): 2286-2293.

DOI：10.1080/00207179.2014.909947

摘要

This paper considers a Markov decision process in Borel state and action spaces with the aggregated (or say iterated) coherent risk measure to be minimised. For this problem, we establish the Bellman optimality equation as well as the value and policy iteration algorithms, and show the existence of a deterministic stationary optimal policy. The cost function, while being allowed to be unbounded from below (in the sense that its negative part needs be bounded by some nonnegative real-valued possibly unbounded weight function), can be arbitrarily unbounded from above and possibly infinitely valued.

出版日期2014

全文

访问全文

收藏分享被引浏览

更新时间：2019-02-21 06:22

相似论文
引用论文
参考文献

产品服务

科研之友科研之友机构版科创云

站内浏览

科研成果科研人员科研机构

服务支持

帮助中心隐私政策服务条款

联系方式

在线客服：【立即咨询】客户热线：400-1616-289 电子邮箱：support@scholarmate.com

微信公众号