DENSELY CONNECTED CONVOLUTIONAL NETWORKS
Gao Huang*, Zhuang Liu*, Laurens van der Maaten, Kilian Q. Weinberger
CVPR 2017
Cornell University Tsinghua University Facebook AI Research
DENSELY CONNECTED CONVOLUTIONAL NETWORKS Gao Huang*, Zhuang Liu*, - - PowerPoint PPT Presentation
Best paper award DENSELY CONNECTED CONVOLUTIONAL NETWORKS Gao Huang*, Zhuang Liu*, Laurens van der Maaten, Kilian Q. Weinberger Cornell University Tsinghua University Facebook AI Research CVPR 2017 CONVOLUTIONAL NETWORKS LeNet AlexNet VGG
Cornell University Tsinghua University Facebook AI Research
Deep residual learning for image recognition: [He, Zhang, Ren, Sun] (CVPR 2015)
C C C C
C
k channels k channels k channels k channels
C C C C
Batc ReL Con
Batch Norm ReLU Convolution (3x3)
Batch Norm ReLU Convolution (3x3) Batch Norm ReLU Convolution (1x1)
Convolution Pooling Convolution Pooling Convolution Pooling Linear
Dense Block 1 Dense Block 2 Dense Block 3
Output
Deeply supervised Net: [Lee, Xie, Gallagher, Zhang, Tu] (2015)
C C lXk
k
C
r e l a t e d f e a t u r e s
k: Growth rate
k<<C
D i v e r s i fi e d f e a t u r e s
Input Output Input Output
hl hl
Standard Connectivity:
Dense Connectivity:
C C C C
Test Error (%) 2.0 3.0 4.0 5.0 6.0 7.0 8.0 9.0 10.0 11.0 12.0
3.6 4.5 4.62 6.41
4.2
ResNet (110 Layers, 1.7 M) ResNet (1001 Layers, 10.2 M) DenseNet (100 Layers, 0.8 M) DenseNet (250 Layers, 15.3 M) Previous SOTA 2.0 3.0 4.0 5.0 6.0 7.0 8.0 9.0 10.0 11.0 12.0 5.2 5.9 10.56 11.26 7.3 Previous SOTA
With data augmentation Without data augmentation
Test Error (%) 10.0 15.0 20.0 25.0 30.0 35.0
17.6 22.3 22.71 27.22
20.5
ResNet (110 Layers, 1.7 M) ResNet (1001 Layers, 10.2 M) DenseNet (100 Layers, 0.8 M) DenseNet (250 Layers, 15.3 M) Previous SOTA 10.0 15.0 20.0 25.0 30.0 35.0 19.6 24.2 33.47 35.58 28.2 Previous SOTA
With data augmentation Without data augmentation
Top-1 error (%) 20.0 22.0 24.0 26.0 28.0 GFLOPs 3 10 16 23 29
DenseNet ResNet
Top-1 error (%) 20.0 22.0 24.0 26.0 28.0 # Parameters (M) 20 40 60 80
DenseNet ResNet
ResNet-152 ResNet-101 ResNet-50 ResNet-34 ResNet-152 ResNet-101 ResNet-50 ResNet-34 DenseNet-264(k=48) DenseNet-264 DenseNet-201 DenseNet-169 DenseNet-121 DenseNet-121 DenseNet-169 DenseNet-201 DenseNet-264 DenseNet-264(k=48)
Classifier 4 Classifier 2 Classifier 3 Classifier 1
Classifier 2 Classifier 3 Classifier 1 cat: 0.2 0.2 ≱ threshold cat: 0.4 0.4 ≱ threshold cat: 0.6 0.6 > threshold
Multi-Scale DenseNet: [Huang, Chen, Li, Wu, van der Maaten, Weinberger] (arXiv Preprint: 1703.09844)
Classifier 4 Classifier 2 Classifier 3 Classifier 1
Memory efficient Torch implementation: https://github.com/liuzhuang13/DenseNet
Our Caffe Implementation Our memory-efficient Caffe Implementation. Our memory-efficient PyTorch Implementation. PyTorch Implementation by Andreas Veit. PyTorch Implementation by Brandon Amos. MXNet Implementation by Nicatio. MXNet Implementation (supports ImageNet) by Xiong Lin. Tensorflow Implementation by Yixuan Li. Tensorflow Implementation by Laurent Mazare. Tensorflow Implementation (with BC structure) by Illarion Khlestov. Lasagne Implementation by Jan Schlüter. Keras Implementation by tdeboissiere. Keras Implementation by Roberto de Moura Estevão Filho. Keras Implementation (with BC structure) by Somshubra Majumdar. Chainer Implementation by Toshinori Hanya. Chainer Implementation by Yasunori Kudo.
Other implementations:
preprint arXiv:1703.09844 (2017)
1707.06990 (2017)