Markov Decision Problems Where Means Bound Variances

Arlotto Alessandro<sup>*</sup>; Gans Noah; Steele J Michael

doi:10.1287/opre.2014.1281

登录

免费注册

赞收藏引用

科研之友

微信

新浪微博

Facebook

分享链接

Markov Decision Problems Where Means Bound Variances

作者：Arlotto Alessandro^*; Gans Noah; Steele J Michael

来源：Operations Research, 2014, 62(4): 864-875.

DOI：10.1287/opre.2014.1281

摘要

We identify a rich class of finite-horizon Markov decision problems (MDPs) for which the variance of the optimal total reward can be bounded by a simple linear function of its expected value. The class is characterized by three natural properties: reward nonnegativity and boundedness, existence of a do-nothing action, and optimal action monotonicity. These properties are commonly present and typically easy to check. Implications of the class properties and of the variance bound are illustrated by examples of MDPs from operations research, operations management, financial engineering, and combinatorial optimization.

出版日期2014-8

全文

访问全文

收藏分享被引(4) 浏览

更新时间：2019-03-27 19:19

相似论文
引用论文
参考文献

产品服务

科研之友科研之友机构版科创云

站内浏览

科研成果科研人员科研机构

服务支持

帮助中心隐私政策服务条款

联系方式

在线客服：【立即咨询】客户热线：400-1616-289 电子邮箱：support@scholarmate.com

微信公众号