SLIDE 1
MOTIVATION
A discriminant approach for directly modeling the posterior probabilities π(x | θ) of the labels is logistic regression. For now, let’s focus on the binary case y ∈ {0, 1} and use empirical risk minimization. arg min
θ∈Θ
Remp(θ) = arg min
θ∈Θ
n
- i=1
L
- y(i), π
- x(i) | θ
- .
A naive approach would be to model
π(x | θ) = θT x.
NB: We will often suppress the intercept in notation. Obviously this could result in predicted probabilities π(x | θ) ∈ [0, 1].
c
- Introduction to Machine Learning – 1 / 8