SLIDE 41 The mathematics of the PAC-Bayes Theory PAC-Bayes bounds and algorithms References Specialization to Linear classifiers Majority votes of weak classifiers Answer # 1: go back to linear classifier specialization Answer # 2: PAC-Bayes on a general loss function The algorithm MinCq Conclusion
From the C-bound to the MinCq learning algorithm
Our first attempts to minimize the C-bound has confronted us to two problems. Problem 1: an empirical C-bound minimization without any regularization tends to overfit the training data. Problem 2: most of the time, the distributions Q minimizing the C-bound C
S
Q are such that both MS
Q and MS Q2 are very close to 0.
Since C
S
Q = 1 − MS
Q/MS Q2 , this gives a 0/0 numerical instability.
Moreover, since MD
Q /MD Q2 can only be empirically estimated by
MS
Q/MS Q2, Problem 2, therefore, amplifies Problem 1. Fran¸ cois Laviolette, Laboratoire du GRAAL, Universit´ e Laval, Qu´ ebec, Canada La th´ eorie PAC-Bayes en apprentissage supervis´ e