Analysis for some properties of discrete time Markov decision processes

Hu QY; Yue WI<sup>*</sup>

doi:10.1080/02331930310001611493

登录

免费注册

赞收藏引用

科研之友

微信

新浪微博

Facebook

分享链接

Analysis for some properties of discrete time Markov decision processes

作者：Hu QY; Yue WI^*

来源：Optimization, 2003, 52(4-5): 495-505.

DOI：10.1080/02331930310001611493

摘要

This paper investigates properties of the optimality equation and optimal policies in discrete time Markov decision processes with expected discounted total rewards under weak conditions that the model is well defined and the optimality equation is true. The optimal value function is characterized as a solution of the optimality equation and the structure of optimal policies is also given.

单位
上海大学

全文

下载全文

收藏分享被引(4) 浏览

更新时间：2018-08-02 21:41

相似论文
引用论文
参考文献

产品服务

科研之友科研之友机构版科创云

站内浏览

科研成果科研人员科研机构

服务支持

帮助中心隐私政策服务条款

联系方式

在线客服：【立即咨询】客户热线：400-1616-289 电子邮箱：support@scholarmate.com

微信公众号