A note on policy algorithms for discounted Markov decision problems

Ng, MK<sup>*</sup>

doi:10.1016/S0167-6377(99)00051-6

免费注册

赞收藏引用

科研之友

微信

新浪微博

Facebook

分享链接

A note on policy algorithms for discounted Markov decision problems

作者：Ng, MK^*

来源：Operations Research Letters, 1999, 25(4): 195-197.

DOI：10.1016/S0167-6377(99)00051-6

摘要

In this note, we show that the evaluation phase in the policy iteration algorithm for the infinite horizon discounted Markov decision problem can be done in O(mN(2)) operations, where N is the number of states of the Markov decision process and m is the number of states in which the decision changes during the policy improvement phase.

出版日期1999-11
单位香港大学

全文

访问全文

收藏分享被引(2) 浏览

更新时间：2019-11-01 11:41

相似论文
引用论文
参考文献

产品服务

科研之友科研之友机构版科创云

站内浏览

科研成果科研人员科研机构

服务支持

帮助中心隐私政策服务条款

联系方式

在线客服：【立即咨询】客户热线：400-1616-289 电子邮箱：support@scholarmate.com

微信公众号