{[ promptMessage ]}

Bookmark it

{[ promptMessage ]}

HW4 - CMPSCI 383 Fall 2011 Homework 4 Due in class or in...

Info iconThis preview shows page 1. Sign up to view the full content.

View Full Document Right Arrow Icon
CMPSCI 383, Fall 2011 Homework 4 Due in class or in the main office of the Computer Science building by 4:00 PM, December 6, 2011 Problem 1: (10 points) Exercise 14.1 on page 558 Problem 2: (10 points) Exercise 14.4 on page 559 Problem 3: (10 points) Exercise 14.8 on page 561 Problem 4: (20 points) Exercise 16.5 on page 641 Problem 5: (10 points) Exercise 17.2 on page 688 Problem 6: (15 points) Exercise 17.4 on page 688 Programming Assignment: (25 points) For this programming assignment, you will implement the value iteration al- gorithm for a 5 × 5 gridworld with no walls and a terminal goal in the bottom right corner. Use γ = 0 . 9 . The agent has four possible actions, up, down, left, right . Each action achieves the intended effect with probability 0 . 8 , but the rest of the time, the action moves the agent at right angles to the intended direction (as in Figure 17.1). If the movement would take the agent into a wall, the agent does not move. Your program should read an input file,
Background image of page 1
This is the end of the preview. Sign up to access the rest of the document.

{[ snackBarMessage ]}