hw9 - ability in the start state, and d) the optimal policy...

Info iconThis preview shows page 1. Sign up to view the full content.

View Full Document Right Arrow Icon
Markov Decision Process Models Invent a simple Markov Decision Process Model with the following properties: a) it has a goal state, b) its immediate action costs are all negative, c) all of its actions can result with some prob-
Background image of page 1
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: ability in the start state, and d) the optimal policy without discounting differs from the optimal policy with discounting and a discount factor of 0.9. Prove d) using value-iteration....
View Full Document

Ask a homework question - tutors are online