Statistical Machine Learning
Lecture 07: Clustering and Evaluation
Kristian Kersting TU Darmstadt
Summer Term 2020
- K. Kersting based on Slides from J. Peters· Statistical Machine Learning· Summer Term 2020
1 / 38
Statistical Machine Learning Lecture 07: Clustering and Evaluation - - PowerPoint PPT Presentation
Statistical Machine Learning Lecture 07: Clustering and Evaluation Kristian Kersting TU Darmstadt Summer Term 2020 K. Kersting based on Slides from J. Peters Statistical Machine Learning Summer Term 2020 1 / 38 Todays Objectives Make
1 / 38
2 / 38
3 / 38
4 / 38
1 2 3 4 5 6 40 50 60 70 80 90 100 1 2 3 4 5 6 40 50 60 70 80 90 100
5 / 38
6 / 38
7 / 38
8 / 38
9 / 38
10 / 38
11 / 38
Region of interest Center of mass Mean Shift vector
Objective : Find the densest region
12 / 38
13 / 38
14 / 38
15 / 38
[DARPA Neural Network Study (1988-89), AFCEA International Press]
16 / 38
3 6 9 12 Polynomial Degree n 0.0 0.1 0.3 Mean Squared Error 1
2N N i
(yi (xi) )2 Optimal Polynomial Degree 5 Error on Test Set Training Error Test Error 3 6 9 12 Polynomial Degree n 0.0 0.1 0.3 Mean Squared Error 1
2N N i
(yi (xi) )2 Optimal Polynomial Degree 5
Underfitting
Error on Test Set Training Error Test Error 3 6 9 12 Polynomial Degree n 0.0 0.1 0.3 Mean Squared Error 1
2N N i
(yi (xi) )2 Optimal Polynomial Degree 5
Underfitting Overfitting
Error on Test Set Training Error Test Error 3 6 9 12 Polynomial Degree n 0.0 0.1 0.3 Mean Squared Error 1
2N N i
(yi (xi) )2 Optimal Polynomial Degree 5
Underfitting Overfitting About right
Error on Test Set Training Error Test Error
17 / 38
18 / 38 William of Ockham (1285-1347)
19 / 38
20 / 38
21 / 38
22 / 38
23 / 38
N
N
24 / 38
25 / 38
26 / 38
f (xq) = ED,ǫ
fD(x)
ǫ + ED
ǫ + bias2
fD(xq)}
fD(xq)}
27 / 38
28 / 38
5 10 15 20 Polynomial Degree n 10
6
10
5
10
4
10
3
10
2
10
1
Bias², Variance, and their Sum Optimal n * = 5 Precision 10
6
Error due to Bias and Variance
Bias² Variance Bias²+Variance
29 / 38
Training Set Validation Set Test Set
30 / 38
31 / 38
Training Set Training Set Training Set Training Set Training Set Training Set Validation Set Validation Set Validation Set
32 / 38
3 6 9 12 Polynomial Degree n 0.0 0.1 0.3 0.4 Mean Squared Error 1
2N N i
(yi (xi) )2
Error: Test Set vs Crossvalidation
Training Error Test Error 4-fold Crossvalidation Error Optimal Polynomial Degree (Test Error) Optimal Polynomial Degree (CV Error)
3 6 9 12 Polynomial Degree n 0.0 0.1 0.3 0.4 Mean Squared Error 1
2N N i
(yi (xi) )2
Error: Test Set vs Crossvalidation
Training Error Test Error Leave One Out Crossvalidation Error Optimal Polynomial Degree (Test Error) Optimal Polynomial Degree (CV Error)
33 / 38
[https://devpost.com]
34 / 38
35 / 38
36 / 38
37 / 38
38 / 38