hw9 - ability in the start state and d the optimal policy...

Info iconThis preview shows page 1. Sign up to view the full content.

View Full Document Right Arrow Icon
Markov Decision Process Models Invent a simple Markov Decision Process Model with the following properties: a) it has a goal state, b) its immediate action costs are all negative, c) all of its actions can result with some prob-
Background image of page 1
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: ability in the start state, and d) the optimal policy without discounting differs from the optimal policy with discounting and a discount factor of 0.9. Prove d) using value-iteration....
View Full Document

{[ snackBarMessage ]}

Ask a homework question - tutors are online