SLIDE 23 Individual Result: Medium Image Shift on MNIST
101 102 103 104 Number of samples from test 0.0 0.2 0.4 0.6 0.8 1.0 p-value
(a) Test w/ 10%.
101 102 103 104 Number of samples from test 0.0 0.2 0.4 0.6 0.8 1.0 p-value
(b) Test w/ 50%.
101 102 103 104 Number of samples from test 0.0 0.2 0.4 0.6 0.8 1.0 p-value
NoRed PCA SRP UAE TAE BBSDs BBSDh Classif
(c) Test w/ 100%. (d) Top different.
101 102 103 104 Number of samples from test 0.90 0.95 1.00 Accuracy
(e) Acc. w/ 10%.
101 102 103 104 Number of samples from test 0.4 0.6 0.8 1.0 Accuracy
(f) Acc. w/ 50%.
101 102 103 104 Number of samples from test 0.2 0.4 0.6 0.8 1.0 Accuracy
p q Classif
(g) Acc. w/ 100%. (h) Top similar.
Failing Loudly: Detecting Dataset Shift 23