Final review
LING572 Advanced Statistical Methods for NLP March 12, 2020
1
Final review LING572 Advanced Statistical Methods for NLP March - - PowerPoint PPT Presentation
Final review LING572 Advanced Statistical Methods for NLP March 12, 2020 1 Topics covered Supervised learning: eight algorithms kNN, NB: training and decoding DT: training and decoding (with binary features) MaxEnt: training
LING572 Advanced Statistical Methods for NLP March 12, 2020
1
2
3
4
– Tune hyperparameters on the dev data – Error analysis – Conclusion
5
6
7
instances
8
9
10
11
12
➔ numerical optimization
13
14
15
Naïve Bayes MaxEnt SVM Modeling Maximize P(X,Y|θ) Maximize P(Y|X, θ) Maximize the minimal margin Training Learn P(c) and P(f|c) Learn λi for feature function Learn αi for each (xi, yi) Decoding Calc P(y) P(x | y) Calc P(y | x) Calc f(x) Things to decide Features Delta for smoothing Features Regularization Training algorithm Kernel function Regularization Training algorithm C for penalty
16
17
18
19
20
21
22
= ∈
N i Y y i j i j p
1
23
24
complex NN
25
26