Data-Limited Face Analysis Yibo Hu JD AI Research Previously, - - PowerPoint PPT Presentation

β–Ά
data limited face analysis
SMART_READER_LITE
LIVE PREVIEW

Data-Limited Face Analysis Yibo Hu JD AI Research Previously, - - PowerPoint PPT Presentation

VALSE Webinar Dual Variational Generation for Data-Limited Face Analysis Yibo Hu JD AI Research Previously, CRIPAC, CASIA https://aberhu.github.io/ 09/23/2020 Face Analysis Face analysis contains a wide range of tasks with various


slide-1
SLIDE 1

Dual Variational Generation for Data-Limited Face Analysis

Yibo Hu

VALSE Webinar

JD AI Research Previously, CRIPAC, CASIA

https://aberhu.github.io/

09/23/2020

slide-2
SLIDE 2

VALSE Webinar

Face Analysis

Face Recognition Face Segmentation Face Anti-spoofing Face 3D Reconstruction Facial Landmark Detection Facial Makeup Transfer Facial Editing

Face analysis contains a wide range of tasks with various applications.

slide-3
SLIDE 3

VALSE Webinar

Contents

2 Dual Variational Generation for HFR 3 Application on Face Parsing 1 Variational Autoencoders 4 Summary

slide-4
SLIDE 4

VALSE Webinar

Generative Model

Latent: Z Data: X

Generative Models

What are generative models?

Deep Generative Models

GAN VAE

Auto- regressive (PixelRNN, PixelCNN) Invertible Flows (NICE, RealNVP, GLOW)

Others

VAE

slide-5
SLIDE 5

VALSE Webinar

Variational Autoencoders

Motivation

For a generative model, we want to perform:

  • Sampling: new samples from prior distribution p(x|z)
  • Inference: expressive representation from visible data p(z|x)
  • Estimation: find Q from a class of possible modes to best describe

an unknown true distribution P

  • Point-wise likelihood evaluation: calculate p(x)

π‘ž 𝑨|𝑦 = π‘ž 𝑦 𝑨 π‘ž(𝑨) π‘ž(𝑦) π‘ž 𝑦 = ΰΆ± π‘ž 𝑦 𝑨 π‘ž 𝑨 𝑒𝑨

intractable to compute p(x) and p(z|x)

slide-6
SLIDE 6

VALSE Webinar

Variational Autoencoders

Theoretical derivation

Reconstruction Quality 𝐒 Approximation Error 𝐁

slide-7
SLIDE 7

VALSE Webinar

Variational Autoencoders

Advantages of VAEs

  • There is a clear and recognized way to evaluate the quality of

the model (log-likelihood). β€”β€”by Yoshua Bengio

  • Stable training compared with GANs while efficient sampling

compared with Auto-regressive models.

  • Very straightforward to extend to a wide range of model

architecture.

  • Capable to perform sampling, inference, estimation and

likelihood computation with nice theory support.

slide-8
SLIDE 8

VALSE Webinar

Contents

2 Dual Variational Generation for HFR 3 Application on Face Parsing 1 Variational Autoencoders 4 Summary

slide-9
SLIDE 9

VALSE Webinar

What is Heterogeneous Face Recognition (HFR)?

Face Recognition

Intra-lass distance < inter-class distance Leaning both robust and discriminativefeatures

[SphereFace, CVPR2017]

slide-10
SLIDE 10

VALSE Webinar

What is Heterogeneous Face Recognition (HFR)?

Heterogeneous Face Recognition

(a) NIR-VIS (b) Thermal-VIS (c) Sketch-Photo (d) ID-Camera (e) Profile-Frontal Photo

From: CASIA NIR-VIS 2.0 Tufts Face IIITD-Sketch NJU-ID MultiPIE

slide-11
SLIDE 11

VALSE Webinar

MS-Celeb-1M:100K IDs with about 10M images

What are the challenges of HFR?

Large Domain Discrepancy

Tufts Face Database:

Insufficient Heterogeneous Data

CASIA NIR-VIS 2.0: 725 IDs with about 18K images + fine-tune w/ heterogeneous data + large-scale VIS data Tufts Face: 113 IDs with about 10K images

How to tackle the challenges ?

Collect as Much Data as Possible Reduce Domain Discrepancy

slide-12
SLIDE 12

VALSE Webinar

How to tackle the challenges?

Reduce Domain Discrepancy: Recognition via Generation

Input Synthesized Conditional Synthesis

Generator

Limitations:

Only synthesize one target image with same attributes

  • Diversity

Input Synthesized

Generator

slide-13
SLIDE 13

VALSE Webinar

How to tackle the challenges?

Reduce Domain Discrepancy: Recognition via Generation

Input Synthesized Conditional Synthesis

Generator

Limitations:

  • Diversity
  • Identity Preserving

Input Synthesized

Generator Which identity ?

slide-14
SLIDE 14

VALSE Webinar

Dual Variational Generation

Generate Pairs of New Images from Noise

  • Abundantdiversity
  • Identity consistency of paired images

𝐹𝑂 𝑦𝑂

𝑨𝑂 π‘¨π‘Š πœˆπ‘‚ πœπ‘‚ πœˆπ‘Š πœπ‘Š

πΉπ‘Š 𝐸𝑂

πΊπ‘—π‘ž

Pairwise Identity Preserving

π‘¦π‘Š

Reconstructed

πΈπ‘Š

Reconstructed

Framework Evolution v0.0.1

Dual Variational Generation for Low-Shot Heterogeneous Face Recognition. NeurIPS 2019

slide-15
SLIDE 15

VALSE Webinar 𝐹𝑂 𝑦𝑂

𝑨𝑂 π‘¨π‘Š πœˆπ‘‚ πœπ‘‚ πœˆπ‘Š πœπ‘Š

πΉπ‘Š π‘¦π‘Š 𝐸𝐽

𝑨𝐽

Concat

πΊπ‘—π‘ž

Pairwise Identity Preserving Reconstructed

Dual Variational Generation

Generate Pairs of New Images from Noise

  • Abundantdiversity
  • Identity consistency of paired images

Framework Evolution v0.0.2

Dual Variational Generation for Low-Shot Heterogeneous Face Recognition. NeurIPS 2019

slide-16
SLIDE 16

VALSE Webinar

Dual Variational Generation

Generate Pairs of New Images from Noise

  • Abundantdiversity
  • Identity consistency of paired images

Framework Evolution v0.0.2

Dual Variational Generation for Low-Shot Heterogeneous Face Recognition. NeurIPS 2019

𝑨𝑂 π‘¨π‘Š 𝐸𝐽

Generated

Standard Gaussian Noise

z

πΊπ‘—π‘ž

Pairwise Identity Preserving

Variance

slide-17
SLIDE 17

VALSE Webinar 𝐹𝑂 𝑦𝑂

𝑨𝑂 π‘¨π‘Š πœˆπ‘‚ πœπ‘‚ πœˆπ‘Š πœπ‘Š

πΉπ‘Š π‘¦π‘Š 𝐸𝐽

𝑨𝐽

Concat

πΊπ‘—π‘ž

Pairwise Identity Preserving Reconstructed

Dual Variational Generation

Generate Pairs of New Images from Noise

  • Abundantdiversity
  • Identity consistency of paired images

Framework Evolution v0.0.2

Dual Variational Generation for Low-Shot Heterogeneous Face Recognition. NeurIPS 2019

slide-18
SLIDE 18

VALSE Webinar 𝐹𝑂 𝑦𝑂

𝑨𝑂 π‘¨π‘Š

Distribution Alignment

πœˆπ‘‚ πœπ‘‚ πœˆπ‘Š πœπ‘Š

πΉπ‘Š 𝐸𝐽

𝑨𝐽

Concat

πΊπ‘—π‘ž

Pairwise Identity Preserving

π‘¦π‘Š

Reconstructed

Dual Variational Generation

Generate Pairs of New Images from Noise

  • Abundantdiversity
  • Identity consistency of paired images

Framework Evolution v0.1.0

Dual Variational Generation for Low-Shot Heterogeneous Face Recognition. NeurIPS 2019

slide-19
SLIDE 19

VALSE Webinar

Dual Variational Generation

Generate Pairs of New Images from Noise

  • Abundantdiversity
  • Identity consistency of paired images

Framework Evolution v0.1.0

Dual Variational Generation for Low-Shot Heterogeneous Face Recognition. NeurIPS 2019

𝐸𝐽

Copy

Standard Gaussian Noise

ΖΈ 𝑨𝐽

HFR Net Domain Gap Reduction

z

Generated

slide-20
SLIDE 20

VALSE Webinar 𝐸𝐽

Copy

Standard Gaussian Noise

ΖΈ 𝑨𝐽

HFR Net Domain Gap Reduction

z

Generated

Dual Variational Generation

Training Stage Testing Stage

Dual Variational Generation for Low-Shot Heterogeneous Face Recognition. NeurIPS 2019

slide-21
SLIDE 21

VALSE Webinar

Dual Variational Generation

Generate Pairs of New Images from Noise

  • Abundantdiversity
  • Identity consistency of paired images

Dual Variational Generation for Low-Shot Heterogeneous Face Recognition. NeurIPS 2019

New images Different poses intra-class diversity Paired images with same identity

slide-22
SLIDE 22

VALSE Webinar

Dual Variational Generation

Experimental Results

  • Significant improvements on the low shot datasets
  • Verification rates increase as more generated images are used. 10K-

100K seem enough.

slide-23
SLIDE 23

VALSE Webinar

Dual Variational Generation

Experimental Results NIR-VIS NIR-VIS

Dual Variational Generation for Low-Shot Heterogeneous Face Recognition. NeurIPS 2019

Thermal-VIS Sketch-Photo

slide-24
SLIDE 24

VALSE Webinar

Dual Variational Generation

New images

Two Challenges of DVG:

Tufts Face:

*MS: Mean Similarity *MIS: Mean Instance Similarity

  • Limited inter-class diversity due to the small number of

paired training data

  • The way to use the generated data
slide-25
SLIDE 25

VALSE Webinar

Dual Variational Generation

How to increase inter-class diversity?

100,000 identities included in the MS-Celeb-1M dataset:

slide-26
SLIDE 26

VALSE Webinar

πΉπ‘Š

πœπ‘Š πœˆπ‘Š

πΉπ‘Š 𝐹𝑂 𝐸 Pairwise ID Preserving

πœˆπ‘‚ πœπ‘‚ 𝑔

π‘Š

𝐺 𝐻 𝐹𝑂

Domain- specific attribute encoders LightCNN Identity sampler Decoder

𝐸 𝐺 𝐺

𝑑

Dual Variational Generation

Framework Evolution v0.1.1

(a) trainingwith paired heterogeneousdata

DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition.

slide-27
SLIDE 27

VALSE Webinar

πœˆπ‘‚

𝐸 Pairwise ID Preserving 𝐻

πœπ‘Š πœˆπ‘Š

πΉπ‘Š 𝐹𝑂

πœπ‘‚ 𝑔

π‘Š

𝐺

(b) trainingwith unpaired VIS data

πΉπ‘Š 𝐹𝑂

Domain- specific attribute encoders LightCNN Identity sampler Decoder

𝐸 𝐺 𝐺

𝑑

Dual Variational Generation

Framework Evolution v0.1.2

DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition.

slide-28
SLIDE 28

VALSE Webinar (b) trainingwith unpaired VIS data

Dual Variational Generation

Framework Evolution v0.2.0

DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition.

𝐺

𝑑

ID sampling

αˆ™ 𝑔

π‘Š

πœˆπ‘‚

𝐸 Pairwise ID Preserving 𝐻

πœπ‘Š πœˆπ‘Š

πΉπ‘Š 𝐹𝑂

πœπ‘‚

𝐺 πΉπ‘Š 𝐹𝑂

Domain- specific attribute encoders LightCNN Identity sampler Decoder

𝐸 𝐺 𝐺

𝑑

kl loss

𝑔

π‘Š

𝐺

𝑑

𝐹

𝑔

π‘Š

reconstruction loss

slide-29
SLIDE 29

VALSE Webinar

Dual Variational Generation

Framework Evolution v0.2.0

DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition.

𝐺

𝑑

ID sampling

𝐸 𝐻

(c) sampling after training

sampling sampling

slide-30
SLIDE 30

VALSE Webinar

Dual Variational Generation

Qualitative Results Diversity Measurement

Tufts Face:

*MS: Mean Similarity *MIS: Mean Instance Similarity

slide-31
SLIDE 31

VALSE Webinar

Dual Variational Generation

How to make better use of the generated data?

DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition.

Contrastive Learning: 1) The generated paired heterogeneousimages are regarded as positive pairs 2) The generated images from different samplings are regarded as negative pairs

slide-32
SLIDE 32

VALSE Webinar

Quantitative Experiments

Dual Variational Generation

slide-33
SLIDE 33

VALSE Webinar

Contents

2 Dual Variational Generation for HFR 3 Application on Face Parsing 1 Variational Autoencoders 4 Summary

slide-34
SLIDE 34

VALSE Webinar

Application on Face Parsing

a) Semantic, Instance, panoptic segmentation b) Face parsing c) Human parsing Pixel-level Semantic Understanding Appearance Structure Consistency

slide-35
SLIDE 35

VALSE Webinar

Application on Face Parsing

Dual-Structure Disentangling Variational Generation (D2VG)

Dual-Structure Disentangling Variational Generation for Data-Limited Face Parsing. ACM MM 2020 .

Appearance Structure Structure Concatenate Hadamard product

(a) Training pipeline of D2VG

slide-36
SLIDE 36

VALSE Webinar

Application on Face Parsing

Dual-Structure Disentangling Variational Generation (D2VG)

Dual-Structure Disentangling Variational Generation for Data-Limited Face Parsing. ACM MM 2020 .

(b) Synthesizing large-scale samples for parsing

slide-37
SLIDE 37

VALSE Webinar

Contents

2 Dual Variational Generation for HFR 3 Application on Face Parsing 1 Variational Autoencoders 4 Summary

slide-38
SLIDE 38

VALSE Webinar

Summary

  • VAE is one of popular generative models that is capable to

perform sampling, inference, estimation and likelihood computation with nice theory support.

  • Dual Variational Generation (DVG) is a general framework to

model joint probability distribution of two modalities.

  • The generated samples from DVG can be treated as extra data

to boost some data-limited face tasks.

  • Future work: other face tasks, more modalities, more diverse,

body or natural image ……

slide-39
SLIDE 39

VALSE Webinar

Thanks for Your Attention!!!

Yibo Hu

Email: huyibo871079699@gmail.com