1
CS 570 Data Mining Classification and Prediction 3
Cengiz Gunay
Partial slide credits: Li Xiong, Han, Kamber, and Pan, Tan,Steinbach, Kumar
1
Classification and Prediction 3 Cengiz Gunay Partial slide credits: - - PowerPoint PPT Presentation
CS 570 Data Mining Classification and Prediction 3 Cengiz Gunay Partial slide credits: Li Xiong, Han, Kamber, and Pan, Tan,Steinbach, Kumar 1 1 Collaborative Filtering Examples Movielens: movies Moviecritic: movies again My launch:
1
Partial slide credits: Li Xiong, Han, Kamber, and Pan, Tan,Steinbach, Kumar
1
February 12, 2008 Data Mining: Concepts and Techniques
Movielens: movies Moviecritic: movies again My launch: music Gustos starrater: web pages Jester: Jokes TV Recommender: TV shows Suggest 1.0 : different products
2
February 12, 2008 Data Mining: Concepts and Techniques 3
3
February 12, 2008 Data Mining: Concepts and Techniques 4
4
February 12, 2008 Li Xiong 5
w1=∑
i=1 ∣D∣
( xi−̄ x)( y i−̄ y)
i=1 ∣D∣
( xi−̄ x)
2
5
February 12, 2008 Li Xiong 7
7
February 12, 2008 Data Mining: Concepts and Techniques 8
8
February 12, 2008 Li Xiong 9
9
Subset selection Lasso is defined Using a small t forces some coefficients to 0 Explains the model with fewer variables Ref: Hastie, Tibshirani, Friedman. The
February 12, 2008 Data Mining: Concepts and Techniques 11
11
February 12, 2008 Data Mining: Concepts and Techniques 12
February 12, 2008 Data Mining: Concepts and Techniques 13
13
February 12, 2008 Data Mining: Concepts and Techniques 14
14
Find linear separation in input space 15
February 12, 2008 Data Mining: Concepts and Techniques 16
16
Inspired by the
Formalized by
February 12, 2008 Data Mining: Concepts and Techniques 18
For Example y=sign(∑
i=0 n
wi xi+μk)
18
February 12, 2008 Data Mining: Concepts and Techniques 19
Input: {(x(1), y(1)), …} Output: classification function f(x) f(x(i)) > 0 for y(i) = +1 f(x(i)) < 0 for y(i) = -1 f(x) => uses inner product w x + b = 0
Learning updates w :
Learning updates w :
19
February 12, 2008 Data Mining: Concepts and Techniques 23
23
February 12, 2008 Data Mining: Concepts and Techniques 24
24
February 12, 2008 Data Mining: Concepts and Techniques 25
25
February 12, 2008 Data Mining: Concepts and Techniques 26
26
February 12, 2008 Data Mining: Concepts and Techniques 27
27
February 12, 2008 Data Mining: Concepts and Techniques 28
i
−I j
k
28
February 12, 2008 Data Mining: Concepts and Techniques 29
29
February 12, 2008 Data Mining: Concepts and Techniques 30
30
February 12, 2008 Data Mining: Concepts and Techniques 31
31
February 12, 2008 Data Mining: Concepts and Techniques 32
32
No local minima, but takes longer, must design problem well.
February 12, 2008 Data Mining: Concepts and Techniques 34
34
February 12, 2008 Data Mining: Concepts and Techniques 35
35
February 12, 2008 Data Mining: Concepts and Techniques 36 36