Self-supervised Label Augmentation via Input Transformations
Hankook Lee, Sung Ju Hwang, Jinwoo Shin Korea Advanced Institute of Science and Technology (KAIST) International Conference on Machine Learning (ICML 2020)
- 2020. 06. 15.
Self-supervised Label Augmentation via Input Transformations - - PowerPoint PPT Presentation
Self-supervised Label Augmentation via Input Transformations Hankook Lee, Sung Ju Hwang, Jinwoo Shin Korea Advanced Institute of Science and Technology (KAIST) International Conference on Machine Learning (ICML 2020) 2020. 06. 15. Outline
Hankook Lee, Sung Ju Hwang, Jinwoo Shin Korea Advanced Institute of Science and Technology (KAIST) International Conference on Machine Learning (ICML 2020)
Self-supervised Learning
Self-supervised Label Augmentation (SLA)
Experiments
2
Self-supervised Learning
Self-supervised Label Augmentation (SLA)
Experiments
3
Self-supervised learning approaches
Transformation-based self-supervision
4
Input Neural Network
5
Permutation Patch Sampling Predict patch location Predict permutation
[Doersch et al., 2015] Unsupervised visual representation learning by context prediction, ICCV 2015 [Noroozi and Favaro, 2016] Unsupervised learning of visual representations by solving jigsaw puzzles, ECCV 2016
6
Remove Colors Rotation Predict RGB values Predict rotation degree
[Larsson et al., 2017] Colorization as a proxy task for visual understanding, CVPR 2017 [Gidaris et al., 2018] Unsupervised representation learning by predicting image rotations, ICLR 2018
7 S4L [Zhai et al., 2019] SSGAN [Chen et al., 2019]
[Zhai et al., 2019] S4L: Self-supervised semi-supervised learning [Berthelot et al., 2020] Remixmatch: Semi-supervised learning with distribution matching and augmentation anchoring, ICLR 2020 [Hendrycks et al., 2019] Using self-supervised learning can improve model robustness and uncertainty, NeurIPS 2019 [Chen et al., 2019] Self-supervised gans via auxiliary rotation loss, CVPR 2019
and optimize their objectives simultaneously
8 Original Head Self-supervision Head
Dog or Cat ? 0° or 90°?
and optimize their objectives simultaneously
9
Q) How can we effectively utilize the self-supervision for fully-supervised classification tasks?
Self-supervised Learning
Self-supervised Label Augmentation (SLA)
Experiments
10
11
Dog or Cat ?
Original
Not depending on
12
Dog or Cat ? 0° or 90°?
Original Self-supervision
Depending on
13
Dog or Cat ? 0° or 90°?
Original Self-supervision This enforces invariance to transformations ⇒ more difficult optimization
14
Learning discriminability from transformations ⇒ Self-supervised learning (SSL) Learning invariance to transformations ⇒ Data augmentation (DA)
Baseline: Data Augmentation: Multi-task Learning: Notation
15
Learning discriminability from transformations ⇒ Self-supervised learning (SSL) Learning invariance to transformations ⇒ Data augmentation (DA)
Learning invariance to rotations degrades performance!
16
Learning discriminability from transformations ⇒ Self-supervised learning (SSL) Learning invariance to transformations ⇒ Data augmentation (DA)
[Cubuk et al., 2019] Autoaugment: Learning augmentation strategies from data, CVPR 2019 [Chen et al., 2020] A simple framework for contrastive learning of visual representations, 2020
17 Joint-label Head
(Dog, 0°), (Dog, 90°), (Cat, 0°), or (Cat, 90°)?
18
(Dog, 0°), (Dog, 90°), (Cat, 0°), or (Cat, 90°)? Joint-label Original labels Self-supervised labels
Self-supervised Label Augmentation (SLA)
19
Original labels Self-supervised labels
20
Joint-label
Original Self-supervision
Original Data Augmentation (DA) Multi-task Learning (MT) Self-supervised Label Augmentation (SLA, ours)
21
Joint-label
.05 .90 .05 0° 90° 180° 270° Cat Dog
22
Joint-label
.10 .03 .85 .02 0° 90° 180° 270° Cat Dog
23
Joint-label
.05 .05 .90 0° 90° 180° 270° Cat Dog
24
Joint-label
.10 .05 .05 .80 0° 90° 180° 270° Cat Dog
25
(Dog, 0°)
(Dog, 90°) (Dog, 180°) (Dog, 270°)
Aggregated Score where
26
where
27
(Dog, 0°)
(Dog, 90°) (Dog, 180°) (Dog, 270°)
Aggregated Score
28
(Dog, 0°)
(Dog, 90°) (Dog, 180°) (Dog, 270°)
Aggregated Score Self-distillation
Additional Head Distillation term Classification term
Same network
Self-supervised Learning
Self-supervised Label Augmentation (SLA)
Experiments
29
30 0° 180° 90° 270° RGB GRB RBG GBR BRG BGR
31
augmentation techniques
32
33
These show that SLA can be easily combined with existing approaches in various classification tasks!
using self-supervised transformations
scenarios including few-shot and imbalanced settings
directions for future research
34
[Chen et al., 2020] A simple framework for contrastive learning of visual representations, 2020
35
hankook.lee @ kaist.ac.kr