Norm matters: efficient and accurate normalization schemes in deep networks
1
Elad Hoffer*, Ron Banner*, Itay Golan*, Daniel Soudry
*Equal contribution Spotlight , NeurIPS 2018
Norm Matters - Poster #27
Norm matters: efficient and accurate normalization schemes in deep - - PowerPoint PPT Presentation
Norm matters: efficient and accurate normalization schemes in deep networks Elad Hoffer*, Ron Banner*, Itay Golan*, Daniel Soudry Spotlight , NeurIPS 2018 Norm Matters - Poster #27 1 *Equal contribution Batch normalization Shortcomings:
1
*Equal contribution Spotlight , NeurIPS 2018
Norm Matters - Poster #27
2) , numerically unstable.
2
Norm Matters - Poster #27
𝑥 𝑥
3
Norm Matters - Poster #27
4
Norm Matters - Poster #27
With WD Without WD Without WD + LR correction
5
Resnet 50, ImageNet
Weight normalization, for a channel 𝑗:
Bounded Weight Normalization:
𝜍 - constant determined from chosen initialization
Norm Matters - Poster #27
6
2
1
1 𝑜 𝑦− 𝑦 2
Norm Matters - Poster #27
7
Norm Matters - Poster #27
8
Norm Matters - Poster #27
Regular BN in FP16 fails L1 BN in FP16 works as well as L2 in FP32
9
Norm Matters - Poster #27
8 bit Full Precision
10
Norm Matters - Poster #27