Gareth James Daniela Witten Trevor Hastie Robert Tibshirani
An Introduction to Statistical Learning
with Applications in R
An Introduction to Statistical Learning provides an accessible overview of th
Sequential Pattern Mining
Why sequential pattern mining?
GSP algorithm
FreeSpan and PrefixSpan
79
CS249: Big Data Analytics
Sequence Databases and
Sequential Pattern Analysis
(Temporal) order is impor
A brief Introduction to
Deep Learning
Presenter: Guangyu Zhou
ScAi Lab, CS @ UCLA
CS 249 Big Data Analytics
1
Deep learning vs human
CS 249 Big Data Analytics
2
DL is also providing breakthrough resul
Mining Frequent Subgraphs
Wei Wang
The UNIVERSITY of CALIFORNIA, LOS ANGELES
Overview
v Introduction
q Finding recurring subgraphs from graph databases.
q FSG
q gSpan
q FFSM
1L06
2
10/31/17
Labeled Gr
CS249 Sample Questions
We want to build a classification model using CBA and the following 5 transactions as the training data.
The class label is a Boolean variable (Yes, No). Assume that min_suppo
Association Rule Mining
CS249
Fall 2017
The UNIVERSITY of CALIFORNIA at LOS ANGELES
Outline
What is association rule mining?
Methods for association rule mining
Extensions of association rule
2
CS249:
CS 249
Big Data Analytics
Instructor: Wei Wang
Fall 2017
The UNIVERSITY of CALIFORNIA, LOS ANGELES
Big Data are Everywhere
The UNIVERSITY of CALIFORNIA, LOS ANGELES
So are the Challenges
3
The UNIVERS
Bi-Clustering
Wei Wang
The UNIVERSITY of CALIFORNIA at LOS ANGELES
Data Mining: Clustering
k
2
K-means clustering minimizes dist ( xi , ct )
Where
2
t 1 ict
m
dist ( xi , ct )
(x
j 1
ij
ctj ) 2
The
KDD 2017 Research Paper
KDD17, August 1317, 2017, Halifax, NS, Canada
Similarity Forests
Saket Sathe
Charu C. Aggarwal
IBM T. J. Watson Research Center
Yorktown Heights, NY 10598
[email protected]
IBM
Data Mining: The Textbook
Charu C. Aggarwal
Data Mining
The Textbook
Charu C. Aggarwal
IBM T.J. Watson Research Center
Yorktown Heights
New York
USA
A solution manual for this book is available on Spr
Association Rule Mining
CS249
Winter 2015
The UNIVERSITY of CALIFORNIA at LOS ANGELES
Outline
What is association rule mining?
Methods for association rule mining
Extensions of association rule
2
CS24
CS 249
Big Data Analytics
Instructor: Wei Wang
Winter 2015
The UNIVERSITY of CALIFORNIA at LOS ANGELES
Big Data are Everywhere
The UNIVERSITY of CALIFORNIA at LOS ANGELES
So are the Challenges
3
The U
1. KDD Cup 2014: Predicting Excitement at DonorsChoose.org
a) Problem Description: DonorsChoose.org is an online charity that makes it
easy to help students in need through school donations. At any ti
The following papers come from KDD 2014, ICDM 2014, ICDE 2014, CIKM 2014,
and VLDB 2014
1. Graph Classification
(1) Scalable SVM-based Classification in Dynamic Graphs (ICDM14)
(2) Multi-Graph-View Le
Association Rule Mining
CS249
Winter 2015
The UNIVERSITY of CALIFORNIA at LOS ANGELES
Sequential Pattern Mining
Why sequential pattern mining?
GSP algorithm
FreeSpan and PrefixSpan
Boarder Collapsing
Association Rule Mining
CS249
Winter 2015
The UNIVERSITY of CALIFORNIA at LOS ANGELES
Partition: Scan Database Only
Twice
Partition the database into n partitions
Itemset X is frequent X is frequent i
CS249: ADVANCED DATA MINING
Classification Evaluation and Practical
Issues
Instructor: Yizhou Sun
[email protected]
April 24, 2017
Announcements
Homework 2 out
Due May 3rd (11:59pm)
Course project
CS249: ADVANCED DATA MINING
Vector Data: Clustering: Part II
Instructor: Yizhou Sun
[email protected]
May 2, 2017
Methods to Learn: Last Lecture
Vector Data
Classification
Decision Tree; Nave
Bayes; L
CS249: ADVANCED DATA MINING
Text Data: Topic Models
Instructor: Yizhou Sun
[email protected]
May 8, 2017
Methods to Learn
Vector Data
Classification
Decision Tree; Nave
Bayes; Logistic
Regression
SVM;
CS249: ADVANCED DATA MINING
Clustering Evaluation and Practical Issues
Instructor: Yizhou Sun
[email protected]
May 2, 2017
Announcements
Homework 2 due later today
Due May 3rd (11:59pm)
Course pro
CS249: ADVANCED DATA MINING
Classification Evaluation and Practical
Issues
Instructor: Yizhou Sun
[email protected]
April 24, 2017
Announcements
Homework 2 out
Due May 3rd (11:59pm)
Course project
CS249: ADVANCED DATA MINING
Decision Trees, Regression Trees, and
Random Forest
Instructor: Yizhou Sun
[email protected]
April 12, 2017
Announcements
Course Project
Team formation due today
Homewor
CS249: ADVANCED DATA MINING
Probabilistic Classifiers and Nave Bayes
Instructor: Yizhou Sun
[email protected]
April 24, 2017
Announcements
Homework 1
Due end of the day of this Friday (11:59pm)
Rem
CS249: ADVANCED DATA MINING
1: Introduction
Instructor: Yizhou Sun
[email protected]
(Instructor for Todays class: Ting Chen)
April 9, 2017
Course Information
Course homepage:
http:/web.cs.ucla.edu/~
CS249: ADVANCED DATA MINING
Text Data: Topic Models
Instructor: Yizhou Sun
[email protected]
May 10, 2017
Announcements
Course project proposal
Due May 8th (11:59pm)
Homework 3 out
Due May 10th (1
CS249: ADVANCED DATA MINING
Support Vector Machine and Neural
Network
Instructor: Yizhou Sun
[email protected]
April 24, 2017
Announcements
Homework 1
Due end of the day of this Friday (11:59pm)
Re
CS249: ADVANCED DATA MINING
Vector Data: Clustering: Part I
Instructor: Yizhou Sun
[email protected]
April 26, 2017
Methods to Learn
Vector Data
Classification
Decision Tree; Nave
Bayes; Logistic
Regr
CS249: ADVANCED DATA MINING
Text Data: Word Embedding
Instructor: Yizhou Sun
[email protected]
May 10, 2017
Announcements
Homework 3 due today
Due May 10th (11:59pm)
Midterm Exam
In class May 15th
CS249: ADVANCED DATA MINING
Recommender Systems
Instructor: Yizhou Sun
[email protected]
May 17, 2017
Methods Learnt: Last Lecture
Vector Data
Classification
Decision Tree; Nave
Bayes; Logistic
Regres
CS249: ADVANCED DATA MINING
Linear Regression, Logistic Regression, and
GLMs
Instructor: Yizhou Sun
[email protected]
April 24, 2017
About WWW2017 Conference
2
Turing Award Winner
Sir Tim Berners-Lee
CS249: ADVANCED DATA MINING
Support Vector Machine and Neural
Network
Instructor: Yizhou Sun
[email protected]
April 24, 2017
Announcements
Homework 1
Due end of the day of this Friday (11:59pm)
Re
CS249: ADVANCED DATA MINING
Graph and Network
Instructor: Yizhou Sun
[email protected]
May 31, 2017
Methods Learnt
Vector Data
Classification
Decision Tree; Nave
Bayes; Logistic
Regression
SVM; NN
Clu
CS249: ADVANCED DATA MINING
Recommender Systems II
Instructor: Yizhou Sun
[email protected]
May 22, 2017
Recommender Systems
Recommendation via Information Network
Analysis
Hybrid Collaborative Filt
CS249: ADVANCED DATA MINING
Decision Trees, Regression Trees, and
Random Forest
Instructor: Yizhou Sun
[email protected]
April 12, 2017
Announcements
Course Project
Team formation due today
Homewor
CS249: ADVANCED DATA MINING
Vector Data: Clustering: Part II
Instructor: Yizhou Sun
[email protected]
May 3, 2017
Methods to Learn: Last Lecture
Vector Data
Classification
Decision Tree; Nave
Bayes; L