AMMI – Introduction to Deep Learning 6.4. Batch normalization
Fran¸ cois Fleuret https://fleuret.org/ammi-2018/ Sun Sep 30 10:42:14 CAT 2018
ÉCOLE POLYTECHNIQUE FÉDÉRALE DE LAUSANNE
AMMI Introduction to Deep Learning 6.4. Batch normalization Fran - - PowerPoint PPT Presentation
AMMI Introduction to Deep Learning 6.4. Batch normalization Fran cois Fleuret https://fleuret.org/ammi-2018/ Sun Sep 30 10:42:14 CAT 2018 COLE POLYTECHNIQUE FDRALE DE LAUSANNE We saw that maintaining proper statistics of the
ÉCOLE POLYTECHNIQUE FÉDÉRALE DE LAUSANNE
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 1 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 2 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 2 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 3 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 3 / 15
B
B
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 4 / 15
B
B
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 4 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 5 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 5 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 6 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 7 / 15
B
B
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 8 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 9 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 9 / 15
B
B
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 9 / 15
B
B
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 9 / 15
B
B
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 9 / 15
5M 10M 15M 20M 25M 30M 0.4 0.5 0.6 0.7 0.8 Inception BN−Baseline BN−x5 BN−x30 BN−x5−Sigmoid Steps to match Inception
Figure 2: Single crop validation accuracy of Inception and its batch-normalized variants, vs. the number of training steps. Model Steps to 72.2% Max accuracy Inception 31.0 · 106 72.2% BN-Baseline 13.3 · 106 72.7% BN-x5 2.1 · 106 73.0% BN-x30 2.7 · 106 74.8% BN-x5-Sigmoid 69.8% Figure 3: For Inception and the batch-normalized variants, the number of training steps required to reach the maximum accuracy of Inception (72.2%), and the maximum accuracy achieved by the net- work.
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 10 / 15
5M 10M 15M 20M 25M 30M 0.4 0.5 0.6 0.7 0.8 Inception BN−Baseline BN−x5 BN−x30 BN−x5−Sigmoid Steps to match Inception
Figure 2: Single crop validation accuracy of Inception and its batch-normalized variants, vs. the number of training steps. Model Steps to 72.2% Max accuracy Inception 31.0 · 106 72.2% BN-Baseline 13.3 · 106 72.7% BN-x5 2.1 · 106 73.0% BN-x30 2.7 · 106 74.8% BN-x5-Sigmoid 69.8% Figure 3: For Inception and the batch-normalized variants, the number of training steps required to reach the maximum accuracy of Inception (72.2%), and the maximum accuracy achieved by the net- work.
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 10 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 11 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 12 / 15
Linear BN ReLU
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 13 / 15
Linear BN ReLU
Linear ReLU BN
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 13 / 15
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 14 / 15
D
D
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 15 / 15
D
D
Fran¸ cois Fleuret AMMI – Introduction to Deep Learning / 6.4. Batch normalization 15 / 15