An Argument for the Bayesian Control of Partially Observable Markov Decision Processes

Vargo Erik<sup>*</sup>; Cogill Randy

doi:10.1109/TAC.2014.2314527

摘要

This technical note concerns the control of partially observable Markov decision processes characterized by a prior distribution over the underlying hidden Markov model parameters. In such instances, the control problem is commonly simplified by first choosing a point estimate from the model prior, and then selecting the control policy that is optimal with respect to the point estimate. Our contribution is to demonstrate, through a tractable yet nontrivial example, that even the best control policies constructed in this manner can significantly underperform the Bayes optimal policy. While this is an operative assumption in the Bayes-adaptive Markov decision process literature, to our knowledge no such illustrative example has been formally proposed.

出版日期2014-10

全文

访问全文

收藏分享被引(4) 浏览

更新时间：2021-04-17 16:30

An Argument for the Bayesian Control of Partially Observable Markov Decision Processes

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友