NEW DISCOUNT AND AVERAGE OPTIMALITY CONDITIONS FOR CONTINUOUS-TIME MARKOV DECISION PROCESSES

Guo, Xianping<sup>*</sup>; Ye, Liuer

doi:10.1239/aap/1293113146

摘要

This paper deals with continuous-time Markov decision processes in Polish spaces, under the discounted and average cost criteria. All underlying Markov processes are determined by given transition rates which are allowed to be unbounded, and the costs are assumed to be bounded below. By introducing an occupation measure of a randomized Markov policy and analyzing properties of occupation measures, we first show that the family of all randomized stationary policies is 'sufficient' within the class of all randomized Markov policies. Then, under the semicontinuity and compactness conditions, we prove the existence of a discounted cost optimal stationary policy by providing a value iteration technique. Moreover, by developing a new average cost, minimum nonnegative solution method, we prove the existence of an average cost optimal stationary policy under some reasonably mild conditions. Finally, we use some examples to illustrate applications of our results. Except that the costs are assumed to be bounded below, the conditions for the existence of discounted cost (or average cost) optimal policies are much weaker than those in the previous literature, and the minimum nonnegative solution approach is new.

出版日期2010-12
单位中山大学

全文

访问全文

收藏分享被引(9) 浏览

更新时间：2019-09-04 10:16

NEW DISCOUNT AND AVERAGE OPTIMALITY CONDITIONS FOR CONTINUOUS-TIME MARKOV DECISION PROCESSES

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友