A Sensitivity-Based Construction Approach to Variance Minimization of Markov Decision Processes

Huang, Yonghao; Chen, Xi<sup>*</sup>

doi:10.1002/asjc.1875

摘要

With a long-run average performance as the primary criterion for a Markov decision process, variance measures are studied as its secondary criteria. The steady-state variance and the limiting average variance along a sample path are discussed. The latter one is difficult to handle due to its special form. With a sensitivity-based approach, the difference formula for the sample-path variance under different policies is intuitively constructed and then the optimality equation is presented. Moreover a policy iteration algorithm is developed. This work extends the sensitivity-based construction approach to Markov decision processes with non-standard performance criteria. The difference between these two types of variance and bias criteria is illustrated with a numerical example.

出版日期2019-5
单位清华大学

全文

访问全文

收藏分享被引(1) 浏览

更新时间：2021-06-30 09:59

A Sensitivity-Based Construction Approach to Variance Minimization of Markov Decision Processes

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友