Monday, 29 April 2019

B\"uchi Objectives in Countable MDPs. (arXiv:1904.11573v1 [math.PR])

We study countably infinite Markov decision processes with B\"uchi objectives, which ask to visit a given subset of states infinitely often. A question left open by T.P. Hill in 1979 is whether there always exist $\varepsilon$-optimal Markov strategies, i.e., strategies that base decisions only on the current state and the number of steps taken so far. We provide a negative answer to this question by constructing a non-trivial counterexample. On the other hand, we show that Markov strategies with only 1 bit of extra memory are sufficient.



from cs updates on arXiv.org http://bit.ly/2ULk09Z
//

Related Posts:

0 comments:

Post a Comment