Approximating Ergodic Average Reward Continuous-Time Controlled Markov Chains

Prieto Rumeau Tomas<sup>*</sup>; Maria Lorenzo Jose

doi:10.1109/TAC.2009.2033848

免费注册

赞收藏引用

科研之友

微信

新浪微博

Facebook

分享链接

Approximating Ergodic Average Reward Continuous-Time Controlled Markov Chains

作者：Prieto Rumeau Tomas^*; Maria Lorenzo Jose

来源：IEEE Transactions on Automatic Control, 2010, 55(1): 201-207.

DOI：10.1109/TAC.2009.2033848

摘要

We study the approximation of an ergodic average reward continuous-time denumerable state Markov decision process (MDP) by means of a sequence of MDPs. Our results include the convergence of the corresponding optimal policies and the optimal gains. For a controlled upwardly skip-free process, we show some computational results to illustrate the convergence theorems.

出版日期2010-1

全文

访问全文

收藏分享被引(5) 浏览

更新时间：2017-06-28 09:09

相似论文
引用论文
参考文献

产品服务

科研之友科研之友机构版科创云

站内浏览

科研成果科研人员科研机构

服务支持

帮助中心隐私政策服务条款

联系方式

在线客服：【立即咨询】客户热线：400-1616-289 电子邮箱：support@scholarmate.com

微信公众号