Solutions to Problem Set 1
October 7, 2009
Exercise 1: (20 points) Some years ago, greek video-club chain Seven had the following oer to their customers: every time a customer rented a DVD, he was giv
Problem Set 1
September 14, 2009
Due date:
Monday, September 28 2009 at 4pm; before class.
Exercise 1: (20 points) Some years ago, greek video-club chain Seven had the following oer to their customers
Boston University Department of Computer Science CS 565 Data Mining
Midterm Exam Date: Oct 14, 2009 Write Your University Number Here: Answer all questions. Good luck! Problem 1 [25 points] True or Fa
Boston University Department of Computer Science CS 565 Data Mining
Midterm Exam Solutions Date: Oct 14, 2009 Write Your University Number Here: Answer all questions. Good luck! Problem 1 [25 points]
Clustering Aggregation
References
A. Gionis, H. Mannila, P. Tsaparas: Clustering
aggregation, ICDE 2004
N. Ailon, M. Charikar, A. Newman: Aggregating
inconsistent information: Ranking and clusterin
Lecture outline
Classification
Decision-tree classification
What is classification?
What is classification?
Classification is the task of learning a
target function f that maps attribute set x
to o
Hierarchical Clustering
Hierarchical Clustering
Produces a set of nested clusters
organized as a hierarchical tree
Can be visualized as a dendrogram
A tree-like diagram that records the
sequences o
BU CS565 - Project 1 - Fall 2016
October 3, 2016
Due date:
Nov 2, 2016 before class.
Description: For the first project, you are requested to predict star ratings associated with user reviews
from Ama
Model Evaluation
Metrics for Performance Evaluation
How to evaluate the performance of a
model?
Methods for Performance Evaluation
How to obtain reliable estimates?
Methods for Model Comparison
Homework 2
October 11, 2016
Due date:
Mon, Oct 28, 2016 at 11:59pm.
Exercise 1 (25 points):
1. Consider a set of d-dimensional points X = cfw_x1 , . . . , xn and distance function
D2 (xi , xj ) =
d
X
Problem Set 1
September 17, 2016
Due date:
Oct 10, 2016 at midnight.
Remeber: For any question you answer I do not know you get 20% of the grade associated with this
question. A totally wrong answer g
Clustering Aggregation
References
A. Gionis, H. Mannila, P. Tsaparas: Clustering
aggregation, ICDE 2004
N. Ailon, M. Charikar, A. Newman: Aggregating
inconsistent information: Ranking and clusterin
Lecture outline
Nearest-neighbor search in low
dimensions
kd-trees
Nearest-neighbor search in high
dimensions
LSH
Applications to data mining
Wednesday, September 18, 13
Definition
Given: a set
Problem Set 1
September 13, 2013
Due date:
Mon, Sept 30 2013 at 4pm; before class.
Exercise 1 (20 points): You are given a set V consisting of n integers. The task is to report all n
products of the n
Hierarchical Clustering
Friday, October 4, 13
Hierarchical Clustering
Produces a set of nested clusters
organized as a hierarchical tree
Can be visualized as a dendrogram
A tree-like diagram that r
Clustering: Partition
Clustering
Wednesday, October 2, 13
Lecture outline
Distance/Similarity between data
objects
Data objects as geometric data points
Clustering problems and algorithms
K-means
Measuring distance/
similarity of data objects
Wednesday, September 11, 13
Multiple data types
Records of users
Graphs
Images
Videos
Text (webpages, books)
Strings (DNA sequences)
Timeseries
How do we
Epidemics and Information
Propagation in Social
Networks
Epidemic Processes
Viruses, diseases
Online viruses, worms
Fashion
Adoption of technologies
Behavior
Ideas
Example: Ebola virus
First emerged
Graph Clustering
Outline
Min s-t cut problem
Min cut problem
Multiway cut
Minimum k-cut
Other normalized cuts and spectral
graph partitionings
Min s-t cut
Weighted graph G(V,E)
An s-t cut C = (S,T)
Basics of network analysis
and network models
Measuring Networks
Degree distributions
Small world phenomena
Clustering Coefficient
Mixing patterns
Degree correlations
Communities and clusters
Degree d
Homework 1
September 11, 2017
Due date:
Sept 22, 2017 at midnight.
Instructions:
1. For any question you answer I do not know you get 20% of the grade associated with this question.
A totally wrong an
Matrix Completion
References
R. Meka, P. Jain, I. Dhilon: Matrix Completion from Powerlaw distributed samples, NIPS 2009
N. Ruchansky, M. Crovella, E. Terzi: Matrix Completion with
Queries, KDD 201
08B-Clustering-III
October 31, 2016
1
Clustering data with k-means
Today well do an extended example showing k-means clustering in practice and in the context of
the python libraries scikit-learn.
1.1
2-Pandas
September 3, 2016
1
1.1
Getting to know your data with Pandas
Pandas
Pandas is the Python Data Analysis Library.
Pandas is an extremely versatile tool for manipulating datasets.
It also produ
11-Dimensionality-Reduction-SVD-II
October 18, 2016
1
Dimensionality Reduction - SVD II
In the last lecture we learned about the SVD as a tool for constructing
low-rank matrices.
Today well look at it
4-Linear-Algebra-Refresher
September 20, 2016
1
Linear Algebra Refresher
Today well review the essentials of linear algebra. Given the prerequisites for
this course, I assume that you learned all of t
CS 591 S1 Computational Audio - Spring,
2017
Wayne Snyder
Computer Science Department
Boston University
Lecture 5
Modulation synthesis: amplitude, ring, and frequency modulation
Amplitude modulation:
CS 512, Spring 2017, Handout 14
Binary Decision Diagrams (BDDs)
Assaf Kfoury
February 26, 2017
Assaf Kfoury, CS 512, Spring 2017, Handout 14
page 1 of 28
background and reading material
I The last cha