CS345a: Data Mining Jure Leskovec and Anand Rajaraman j
Stanford University
CS345a: Data Mining Jure Leskovec and Anand Rajaraman j Stanford - - PowerPoint PPT Presentation
CS345a: Data Mining Jure Leskovec and Anand Rajaraman j Stanford University Would like to do prediction: Would like to do prediction: learn a function: y = f(x) Where y can be: h b Real: Regression Categorical: Classification
Stanford University
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 2
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 3
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 4
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 5
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 6
d(xi,xj)=(xi1‐xj1)2+(xi2‐xj2)2 d(xi,xj)=(xi1‐xj1)2+(3xi2‐3xj2)2
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 7
k=9
K=10 K=20 K=80 K=10 K=20 K=80
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 8
q p
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 9
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 10
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 11
q
if C intersects with the ball of radius r around q, add C to the stack
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 12
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 13
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 14
A C G I A B F G H J D E J
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining #15
A C G I P1 P3 A B F G H J
D E J P2 P4
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining #16
A C G I P1 P3
A B F G H J
D E J P2 P4
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining #17
A C G I P1 P3
A B F G H J
D E J P2 P4
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining #18
A C G I P1 P3
A B F G H J
D E J P2 P4
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining #19
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 20
3
geria wx=0
nig Spam=1 x1 x2
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 21
viagra Ham=‐1 w
w3x
w3 biggest w1 w2 w1x biggest w2x biggest
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 22
biggest
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 23
Cost per iteration Time to reach accuracy Time for
GD O(md) O(mdlog(1/)) O(d2/ log2(1/)) g g 2nd order GD O(d(d+m)) O(md log log(1/)) O(d2/ log(1/)log log(1/)) Stochastic GD O(d) O(d/) O(d/) [Bottou‐LeCun ‘04] m number of examples
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 24
[Bottou LeCun 04] m… number of examples d… number of features … condition number
wt yx wt+1
x
t 1
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 25
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 26
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 27
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 28
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 29
SVM with “hard” constraints
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 30
t t
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 31
t t
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 32
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 33
What if we subsample the dataset?
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 34
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 35
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 36
2/23/2010 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining 37
[Shalev‐Shwartz et al. ICML ‘07] |At| = S |At| = 1 |At| S Subgradient method |
t|
Stochastic gradient
2/23/2010 38 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining
[Shalev‐Shwartz et al. ICML ‘07]
2/23/2010 39 Jure Leskovec & Anand Rajaraman, Stanford CS345a: Data Mining