CX4240 Homework 1
Deadline: 2/26 Fri, 9:30am
Typing with Latex is highly recommended.

ISyE 3833: Two Practice Questions on Generic LP Models
Exercise 1. A small mining company currently works two coal seams and produces three
grades of coal. It costs $1900 per hour to work the Moran seam, obtaining in that time 50
tons of anthracite

CX 4240 Homework 3
Deadline: 4/8 Friday, 11:55PM
Clustering
Computational Data Analytics
CX 4240, Spring 2015
Clustering images
Image
Databases
Goal of clustering:
Divide object into groups,
and objects within a group
are more similar than
those outside the group
Cluster other things
Cluster

Clustering Nodes in Graphs
Computational Data Analytics
CX 4240, Spring 2015
Clustering images
Image
Databases
Goal of clustering:
Divide object into groups,
and objects within a group
are more similar than
those outside the group
Formal statement

Lecture Notes 1
It can be shown that X = Y if and only if FX (t) = FY (t) for all t.
Brief Review of Basic Probability
(Casella and Berger Chapters 1-4)
1.2
Expected Values
The mean or expected value of g(X) is
Probability Review
Chapters 1-4 are a review.

Lecture Notes 1
Brief Review of Basic Probability
(Casella and Berger Chapters 1-4)
Probability Review
Chapters 1-4 are a review. I will assume you have read and understood Chapters
1-4. Let us recall some of the key ideas.
1.1
Random Variables
A random

MATLAB Primer Third Edition
Kermit Sigmon Department of Mathematics University of Florida
Department of Mathematics University of Florida Gainesville, FL 32611 [email protected] Copyright c 1989, 1992, 1993 by Kermit Sigmon
On the Third Edition

A = [4 -3; 5 5]
b = [-13;10]
x = inv(A)*b
transpose(A)
%inner product: transpose(b)*V returns single value, multiply corresponding
%elements and sum them together
%outer product: b*transpose(V) returns an nxn matrix
%Av = [A(1,:)*v;A(2,:)*v.A(m,:)*v]
Cij

Dimensionality Reduction &
Principal Component Analysis
Le Song
Introduction to Computational Data Analysis
(Machine Learning)
CX4240, Spring 2015
Image databases
Image
Databases
What are the relations
between data points?
Handwritten digits
What are the

Visualization: Matlab Demos
Introduction to Computational Data Analysis
(Machine Learning)
CX4240, Spring 2015
Outline
Logical operation
If-else statement
Loop statement
Line Plot
Histogram
Scatter Plot
Surface Plot
Contour Plot
File read/write
Im

Probability and Statistics:
Matlab Demos
Introduction to Computational Data Analysis
(Machine Learning)
CX4240, Spring 2015
Outline
Calculating Probabilities through Simulation
Calculating Joint and Conditional through Simulation
Generating random

%
% Rolling a dice
% Simulation of events: 3 dots show (probability of 1/6),
outcomes = []; %keep outcomes here
M=50000 %rolls
for i= 1:M
outcomes = [outcomes ceil( 6*rand )];
%ceil(6*rand) rounds up (ceiling) random
number uniform on (0,6), thus the outcome

Probability and Statistics Review
Introduction to Computational Data Analysis
(Machine Learning)
CX4240, Spring 2015
Basic Probability Concepts
A sample space S is the set of all possible outcomes of a
conceptual or physical, repeatable experiment

Linear Algebra with
Matlab
Introduction to Computational Data Analysis
(Machine Learning)
CX4240, Spring 2015
Introduction to MATLAB
What is MATLAB?
MATLAB provides a language and environment for numerical
computation, data analysis, visualisation

Linear Algebra Review
Introduction to Computational Data Analysis
(Machine Learning)
CX4240, Spring 2015
Outline
Motivating Example Eigenfaces
Basics
Dot and Vector Products
Identity, Diagonal and Orthogonal Matrices
Trace
Norms
Rank and linear in

%
clear; % clear all variables;
clc; % clean the command window;
% define a vector
a = [1, 3, 4, 5]'
a = [1; 3; 4; 5]
%
% define a matrix;
% semicolon used to supress the output;
A = [1, 2; 3, 4; 5, 6];
% ouput the second entry in a;
tmp = a(2)
% o

Introduction
Introduction to Computational Data Analysis
(Machine Learning)
CX4240, Spring 2015
What is machine learning (ML)
Study of algorithms that improve their performance at some
task with experience
Common to industrial scale problems
13

Feature Selection
Computational Data Analysis
CX 4240, Spring 2017
Feature selection
What are the best pixels for classifying photos of boys and girls?
A feature selection algorithm
Given a dataset
,
1, ,
For each value of the label
,
,
,
,
Esti

Density Estimation
Computational Data Analytics
CX 4240, Spring 2017
Why do we need density estimation?
Learn more about the shape of the data cloud
Assess the likelihood of seeing a particular data point
Is this a typical data point? (high density)

ISyE 3833: Two Practice Questions on Generic LP Models
with Hints
Exercise 1. A small mining company currently works two coal seams and produces three
grades of coal. It costs $1900 per hour to work the Moran seam, obtaining in that time 50
tons of anthracite

ISyE 3833: Two Practice Questions on Generic LP Models
with Hints and Answers
Exercise 1. A small mining company currently works two coal seams and produces three
grades of coal. It costs $1900 per hour to work the Moran seam, obtaining in that time 50
to

Probability and Statistics Review
Introduction to Computational Data Analysis
(Machine Learning)
CX4240, Spring 2016
Basic Probability Concepts
A sample space S is the set of all possible outcomes of a
conceptual or physical, repeatable experiment

Linear Algebra Review
Introduction to Computational Data Analysis
(Machine Learning)
CX4240, Spring 2016
Outline
Motivating Example Eigenfaces
Basics
Dot and Vector Products
Identity, Diagonal and Orthogonal Matrices
Trace
Norms
Rank and linear in

CX 4240 Mid-term Exam - (PRACTICE)
March 13, 2017
Name:
GT ID:
E-mail:
Problem
Point
1
25
2
25
3
25
4
25
Total
100
Your Score
Instructions:
Try your best to be clear as much as possible. No credit may be given to unreadable writing.
The exam

Answers
January 19, 2017
1
Probability and Statistics 1
1. C
2. a2 V ar[X]
Pn
n )2 or
3. n1 i=1 (Xi X
1
n1
Pn
i=1 (Xi
n )2 where X
n =
X
4. C
5.
f (x) =
1
0
if
0x1
otherwise
6. C
7. D
8.
5
36
9. TRUE
10. 1 (1 pn )m
2
Probability and Statistics 2
1. C
2.

CX 4240 Mid-term Exam (2017 Spring)
3/16 Thursday, 12:05 - 1:25 pm
Name:
GT ID:
E-mail:
Problem
Point
1
15
2
25
3
25
4
25
5
25
Total
115
Your Score
Instructions:
Try your best to be clear as much as possible. No credit may be given to unreadable writing.

%
clear;
clc;
%
%
%
%
%
%
%
%
%
%
%
% meaning of each column of scores
% dim 1: HW1 score
% dim 2: stats background score
% dim 3: linear algebra background score
% dim 4: midterm score
scores = load('scoredata.txt');
X = scores(:,1:3)';
m = size(X, 2);
X

1
Basic Concepts and Notation
Linear algebra provides a way of compactly representing and operating on sets of linear
equations. For example, consider the following system of equations:
Linear Algebra Review and Reference
4x1 5x2 = 13
2x1 + 3x2 = 9.
Zico

CX 4240 Homework 2
Deadline: 3/17 Friday, 11:55pm
Submit your answers as an electronic copy on T-square.
No unapproved extension of deadline is allowed. Zero credit will be assigned for late submissions.
Email request for late submission may not be answered.

CX 4240 Homework 4
Deadline: 4/23 Sunday, 11:55PM
Submit your answers as an electronic copy on T-square.
No unapproved extension of deadline is allowed. Late submission will lead to 0 credit.
Typing with Latex is highly recommended.

CX 4240 Homework 3
Deadline: 4/8 Sunday, 11:55PM
Submit your answers as an electronic copy on T-square.
No unapproved extension of deadline is allowed. Late submission will lead to 0 credit.
Typing with Latex is highly recommended.

CX4240 Homework 1
Deadline: 2/28 Tuesday, 11:55PM
Submit your answers as an electronic copy on T-square.
No unapproved extension of deadline is allowed. Late submission will lead to 0 credit.
Typing with Latex is highly recommended.