SLIDE 1
Motivation
Linear regression/classification: ◮ Dataset with n entries: xi ∈ Rd, yi ∈ R, i = 1, . . . , n ◮ Prediction of y as a linear model x⊤β ◮ Minimization of a penalized cost function: (∀β ∈ Rd) F(β) = 1 n
n
- i=1
ℓ(yi, x⊤
i β) + λR(β)
Examples of loss/regularizers: ◮ Quadratic loss: ℓ(y, x) = 1
2(x − y)2
◮ Logistic loss: ℓ(y, x) = log(1 + exp(−yx)) ◮ Ridge penalty R(β) = 1
2β2
◮ Lasso penalty R(β) = β1
: