A survey of recent results on continuous-time Markov decision processes

Guo, Xianping<sup>*</sup>; Hernandez Lerma, Onesimo; Prieto Rumeau, Tomas

doi:10.1007/BF02837562

登录

免费注册

赞收藏引用

科研之友

微信

新浪微博

Facebook

分享链接

A survey of recent results on continuous-time Markov decision processes

作者：Guo, Xianping^*; Hernandez Lerma, Onesimo; Prieto Rumeau, Tomas

来源：Top, 2006, 14(2): 177-243.

DOI：10.1007/BF02837562

摘要

This paper is a survey of recent results on continuous-time Markov decision processes (MDPs) with unbounded transition rates, and reward rates that may be unbounded from above and from below. These results pertain to discounted and average reward optimality criteria, which are the most commonly used criteria, and also to more selective concepts, such as bias optimality and sensitive discount criteria. For concreteness, we consider only MDPs with a countable state space, but we indicate how the results can be extended to more general MDPs or to Markov games.

出版日期2006-12
单位中山大学

全文

访问全文

收藏分享被引(51) 浏览

更新时间：2019-09-03 16:31

相似论文
引用论文
参考文献

产品服务

科研之友科研之友机构版科创云

站内浏览

科研成果科研人员科研机构

服务支持

帮助中心隐私政策服务条款

联系方式

在线客服：【立即咨询】客户热线：400-1616-289 电子邮箱：support@scholarmate.com

微信公众号