Accelerated modified policy iteration algorithms for Markov decision processes

Shlakhter Oleksandr<sup>*</sup>; Lee Chi Guhn

doi:10.1007/s00186-013-0432-y

免费注册

赞收藏引用

科研之友

微信

新浪微博

Facebook

分享链接

Accelerated modified policy iteration algorithms for Markov decision processes

作者：Shlakhter Oleksandr^*; Lee Chi Guhn

来源：Mathematical Methods of Operations Research, 2013, 78(1): 61-76.

DOI：10.1007/s00186-013-0432-y

摘要

We propose a new approach to accelerate the convergence of the modified policy iteration method for Markov decision processes with the total expected discounted reward. In the new policy iteration an additional operator is applied to the iterate generated by Markov operator, resulting in a bigger improvement in each iteration.

出版日期2013-8

全文

访问全文

收藏分享被引浏览

更新时间：2021-04-26 12:04

相似论文
引用论文
参考文献

产品服务

科研之友科研之友机构版科创云

站内浏览

科研成果科研人员科研机构

服务支持

帮助中心隐私政策服务条款

联系方式

在线客服：【立即咨询】客户热线：400-1616-289 电子邮箱：support@scholarmate.com

微信公众号