This preview shows page 1. Sign up to view the full content.
Unformatted text preview: ability in the start state, and d) the optimal policy without discounting differs from the optimal policy with discounting and a discount factor of 0.9. Prove d) using value-iteration....
View Full Document
- Fall '08