Derman%26apos;s book as inspiration: some results on LP for MDPs

作者:Kallenberg Lodewijk*
来源:Annals of Operations Research, 2013, 208(1): 63-94.
DOI:10.1007/s10479-011-1047-4

摘要

In 1976 I was looking for a suitable subject for my PhD thesis. My thesis advisor Arie Hordijk and I found a lot of inspiration in Derman%26apos;s book (Finite state Markovian decision processes, Academic Press, New York, 1970). Since that time I was interested in linear programming methods for Markov decision processes. In this article I will describe some results in this area on the following topics: (1) MDPs with the average reward criterion; (2) additional constraints; (3) applications. These topics are the main elements of Derman%26apos;s book.

  • 出版日期2013-9