NONSTATIONARY CONTINUOUS-TIME MARKOV DECISION-PROCESSES IN A SEMI-MARKOV   ENVIRONMENT WITH DISCOUNTED CRITERION

HU QY<sup>*</sup>

doi:10.1006/jmaa.1995.1322

摘要

This paper deals with the nonstationary continuous time Markov decision process in a semi-Markov environment with discounted criterion. The model can describe a system that itself can be modeled by a countable state nonstationary continuous time Markov decision process with nonhomogeneous transition rate family and reward rate function, but the system is influenced by its environment, which is modeled after a semi-Markov process. And with each change of the environment';s states, (1) an instantaneous state (of the system) transition occurs; (2) an instantaneous reward occurs; and (3) the parameters of the nonstationary continuous time Markov decision processes vary. The precise formulation of the model is presented, and the optimality equation and the existence of epsilon (>0) optimal policies are proved.

出版日期1995-9-15
单位西安电子科技大学

全文

访问全文

收藏分享被引(2) 浏览

更新时间：2018-08-02 21:00

NONSTATIONARY CONTINUOUS-TIME MARKOV DECISION-PROCESSES IN A SEMI-MARKOV ENVIRONMENT WITH DISCOUNTED CRITERION

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友