Intro Logistic Regression Gradient Descent + SGD
Machine Learning for Big Data CSE547/STAT548, University of Washington Sham Kakade March 29, 2016
1
Logistic Regression Gradient Descent + SGD Machine Learning for Big - - PowerPoint PPT Presentation
Case Study 1: Estimating Click Probabilities Intro Logistic Regression Gradient Descent + SGD Machine Learning for Big Data CSE547/STAT548, University of Washington Sham Kakade March 29, 2016 1 Ad Placement Strategies Companies bid on
1
– Naively: – But: – Instead:
2
3
– Batch: – Online:
boosting,…)
– Focus on logistic regression; captures main concepts, ideas generalize to other approaches
4
5
Logistic function (or Sigmoid):
Learn P(Y|X) directly
Assume a particular functional form Sigmoid applied to a linear function
Z
6
1
7
8
9
10
– e.g., Conjugate gradient ascent much better (see reading)
11
Gradient: Step size, >0 Update rule:
12
(t) (t)
13
14
(t) (t)
– Negative second derivative bounded away from zero:
15
16
17
(t)
– User enters query, and ad must be selected:
– User either clicks or doesn’t click on ad:
– Google gets a reward if user clicks on ad
– Weights must be updated for next time:
18
– Given dataset:
– Loss function, e.g., hinge loss, logistic loss,… – We often minimize loss in training data:
19
20
– Unbiased estimate of gradient – Very noisy! – Called stochastic gradient ascent (or descent)
– VERY useful in practice!!!
21
– Want to find maximum
– Get a sample data point xt – Update parameters:
22
– Online setting:
23
– (see Nemirovski et al ‘09 from readings) – Let f be a strongly convex stochastic function – Assume gradient of f is Lipschitz continuous and bounded – Then, for step sizes: – The expected loss decreases as O(1/t):
24
– If func is strongly convex: O(ln(1/ϵ)) iterations
– If func is strongly convex: O(1/ϵ) iterations
– Total running time, e.g., for logistic regression:
– See readings for more details
25
– Estimate probability of clicking – Can be modeled as logistic regression
– Convergence rates and stopping criterion
– Convergence rates of SGD
26