Transferability vs. Discriminability: Batch Spectral Penalization - - PowerPoint PPT Presentation

transferability vs discriminability
SMART_READER_LITE
LIVE PREVIEW

Transferability vs. Discriminability: Batch Spectral Penalization - - PowerPoint PPT Presentation

Transferability vs. Discriminability: Batch Spectral Penalization for Adversarial Domain Adaptation Xinyang Chen, Sinan Wang, Mingsheng Long, Jianmin Wang School of Software BNRist, Research Center for Big Data Tsinghua University


slide-1
SLIDE 1

Transferability vs. Discriminability:

Batch Spectral Penalization for Adversarial Domain Adaptation

Xinyang Chen, Sinan Wang, Mingsheng Long, Jianmin Wang

School of Software BNRist, Research Center for Big Data Tsinghua University

International Conference on Machine Learning, 2019

  • X. Chen et al. (Tsinghua Univ.)

BSP: Batch Spectral Penalization June 12, 2019 1 / 9

slide-2
SLIDE 2

Motivation

Transfer Learning: Unsupervised Domain Adaptation

Non-IID distributions P = Q Only unlabeled data in target domain Model Model Representation

P(x,y)≠Q(x,y)

2D Renderings Real Images Source Domain Target Domain

f :x → y f :x → y

  • X. Chen et al. (Tsinghua Univ.)

BSP: Batch Spectral Penalization June 12, 2019 2 / 9

slide-3
SLIDE 3

Motivation

Adversarial Domain Adaption

Matching distributions across source and target domains s.t. P ≈ Q Adversarial adaptation: learning features indistinguishable across domains min

F,G E(F, G) + γdisP↔Q(F, D)

max

D

disP↔Q(F, D), (1) We analysis features extracted by DANN1 with a ResNet-502 pretrained on Imagenet E(F, G) = E(xs

i ,ys i )∼PL(G(F(xs

i )), ys i )

distP↔Q(F, D) = Exs

i ∼P log[D(fs

i )]

+ Ext

i ∼Q log[1 − D(ft

i )]

(2)

1Ganin et al. Unsupervised domain adaptation by backpropagation. ICML ’15. 2He et al. Deep residual learning for image recognition. CVPR ’15.

  • X. Chen et al. (Tsinghua Univ.)

BSP: Batch Spectral Penalization June 12, 2019 3 / 9

slide-4
SLIDE 4

Motivation

Discriminability of Feature Representations

Two key criteria that characterize the goodness of feature representations Transferability: the ability of feature to bridge the discrepancy across domains Discriminability: the easiness of separating different categories by a supervised classifier trained over the feature representations Discriminability of features extracted by DANN, worse discriminability is found:

ResNet-50 DANN

10 20 30 40 50

max J(W) A to W W to A A to D D to A

(a) max J(W )

Source Target Average

0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35

Error rate

ResNet-50 DANN

(b) Classification error rate

Figure: Analysis of discriminability of feature: (a) LDA, (b) MLP.

  • X. Chen et al. (Tsinghua Univ.)

BSP: Batch Spectral Penalization June 12, 2019 4 / 9

slide-5
SLIDE 5

Method

Why Discriminability Is Weakened?

Corresponding Angles: corresponding angle is the angle between two eigenvectors corresponding to the same singular value index, which are equally important in their feature matrices.

2 4 6 8 10

Index

0.4 0.5 0.6 0.7 0.8 0.9 1.0

singular value singular values

ResNet_source ResNet_target DANN_source DANN_target

(a) σ

2 4 6 8 10

Index

0.20 0.25 0.30 0.35 0.40 0.45 0.50 0.55 0.60

Corresponding angle corresponding angles ResNet DANN

(b) cos(ψ)

2 4 6 8 10

Index

0.4 0.5 0.6 0.7 0.8 0.9 1.0

Corresponding angle corresponding angles ResNet DANN

(c) cos(ψ)

Figure: SVD analysis. We compute (a) the singular values (normalized version); (b) corresponding angles (unnormalized version); (c) corresponding angles (normalized version). In normalized version we scale all values so that the largest value is 1.

Only the eigenvectors with largest singular values tend to carry transferable knowledge

  • X. Chen et al. (Tsinghua Univ.)

BSP: Batch Spectral Penalization June 12, 2019 5 / 9

slide-6
SLIDE 6

Method

BSP: Batch Spectral Penalization

F x

cross- entropy L

BSP Lbsp

SVD f SVD

binary cross- entropy GRL

! " # $ D G

batch batch

BSP combined with DANN to strengthen discriminability of feature min

F,G E(F, G) + γdisP↔Q(F, D) + βLbsp(F)

max

D

disP↔Q(F, D), (3) Lbsp(F) =

k

  • i=1

(σ2

s,i + σ2 t,i),

(4)

  • X. Chen et al. (Tsinghua Univ.)

BSP: Batch Spectral Penalization June 12, 2019 6 / 9

slide-7
SLIDE 7

Experiments

Results

Table: Accuracy (%) on Office-31 for unsupervised domain adaptation

Method A → W D → W W → D A → D D → A W → A Avg ResNet-50 68.4 ± 0.2 96.7 ± 0.1 99.3 ± 0.1 68.9 ± 0.2 62.5 ± 0.3 60.7 ± 0.3 76.1 DAN 80.5 ± 0.4 97.1 ± 0.2 99.6 ± 0.1 78.6 ± 0.2 63.6 ± 0.3 62.8 ± 0.2 80.4 DANN 82.0 ± 0.4 96.9 ± 0.2 99.1 ± 0.1 79.7 ± 0.4 68.2 ± 0.4 67.4 ± 0.5 82.2 JAN 85.4 ± 0.3 97.4 ± 0.2 99.8 ± 0.2 84.7 ± 0.3 68.6 ± 0.3 70.0 ± 0.4 84.3 GTA 89.5 ± 0.5 97.9 ± 0.3 99.8 ± 0.4 87.7 ± 0.5 72.8 ± 0.3 71.4 ± 0.4 86.5 CDAN 93.1 ± 0.2 98.2 ± 0.2 100.0 ± 0.0 89.8 ± 0.3 70.1 ± 0.4 68.0 ± 0.4 86.6 CDAN+E 94.1 ± 0.1 98.6 ± 0.1 100.0 ± 0.0 92.9 ± 0.2 71.0 ± 0.3 69.3 ± 0.3 87.7 BSP+DANN (Proposed) 93.0 ± 0.2 98.0 ± 0.2 100.0 ± 0.0 90.0 ± 0.4 71.9 ± 0.3 73.0 ± 0.3 87.7 BSP+CDAN (Proposed) 93.3 ± 0.2 98.2 ± 0.2 100.0 ± 0.0 93.0 ± 0.2 73.6 ± 0.3 72.6 ± 0.3 88.5

  • X. Chen et al. (Tsinghua Univ.)

BSP: Batch Spectral Penalization June 12, 2019 7 / 9

slide-8
SLIDE 8

Experiments

Analysis

2 4 6 8 10

Index

0.4 0.5 0.6 0.7 0.8 0.9 1.0

singular value singular values

ResNet_source ResNet_target DANN_source DANN_target BSP+DANN_source BSP+DANN_target

(a) σ

2 4 6 8 10

Index

0.20 0.25 0.30 0.35 0.40 0.45 0.50 0.55 0.60

Corresponding angle corresponding angles ResNet DANN BSP+DANN

(b) cos(ψ)

2 4 6 8 10

Index

0.4 0.5 0.6 0.7 0.8 0.9 1.0

Corresponding angle corresponding angles ResNet DANN BSP+DANN

(c) cos(ψ)

Source Target Average

0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35

Error rate

ResNet-50 DANN BSP+DANN

(d) Classification error

A->W D->W

1.0 1.2 1.4 1.6 1.8 2.0

A-distance

ResNet-50 DANN BSP+DANN

(e) A-distance

  • X. Chen et al. (Tsinghua Univ.)

BSP: Batch Spectral Penalization June 12, 2019 8 / 9

slide-9
SLIDE 9

Summary

Thanks!

Poster: tonight at Pacific Ballroom #256

  • X. Chen et al. (Tsinghua Univ.)

BSP: Batch Spectral Penalization June 12, 2019 9 / 9