Adding time to Bayes nets:
HMMs and Kalman filters
Recap: Bayesian nets encode conditional
independence relationships
Some oddities of the model
A snap shot of what happened at some point
in time
In a day? A month?
Assumes the past is not important
Cr
Admissible heuristics
E.g., for the 8-puzzle:
h1(n) = number of misplaced tiles
h2(n) = total Manhattan distance (i.e., no. of squares from desired location
of each tile)
h1(S) = ?
h2(S) = ?
Admissible heuristics
E.g., for the 8-puzzle:
h1(n) = numbe
Informed search
Romania with step costs in km
Adding knowledge: straight-line
distance to Bucharest
SLD: How short would drive be if
didnt need to travel on roads
366
Greedy best-first search
Evaluation function f(n) = h(n) (heuristic)
= estimate of cos
Representing independence
Computing conditional probability
What if you want P(Cavity | toothache)
P(C | t) = P(C, t) / P(t)
Note: P(C | t) means we want to know the
probability of each outcome of Cavity
<P(C=true | t), P(C=false | t)>
Joint probability
Reinforcement learning
Reinforcement learning
Long history
Psychology: ~1910 (Skinner)
Computer Science: 1955 (Samuels checkers
player)
Similar to neural nets, interplay between
human and computer learning
Psychology: operant conditioning
RL success
/*
* This code is created for cs 4341 AI 2013a at WPI. All rights are reserved.
*/
package referee;
import java.io.*;
import java.util.*;
/*
* @author lzhu
*
*/
public class Referee cfw_
Board board;
int N=4;
int board_height=6;
int board_width=7;
Timer a
Homework and Project Goal:
The goal of Project 1 is to help you understand exactly how different search strategies work.
You will implement each of nine net search algorithms. Among the searches are basic searches,
heuristically informed searches, and opt
GAME DESCRIPTION
Tic-Tac-Toe is a two player game (one of them being your computer program). We would deviate
from the classical game which is in 2 dimensions and instead implement a 3-dimensional version
of the game. We would have a 3D board which is a 4
Problem I. Decision Trees (25 points)
Consider the following dataset that specifies the type of contact lenses that is prescribed to a
patient based on the patient's age, astigmatism, and tear production rate. The purpose of this
problem is to use informa
Problem I. Search (20 points)
Suppose that you need to find a path between S and G in the following graph. The number
attached to each edge in the graph represents the COST of traversing the edge. The number
inside each node represents a heuristic under-e
Samuels checkers
Why I adore this paper
Considered two approaches
Large, randomly connected neural networks
Size of network is a problem with 1950s hardware
(were just getting there now)
Vs. special-built approach for just 1 task
Checkers has been sol