Dual Variational Generation for Data-Limited Face Analysis
Yibo Hu
VALSE Webinar
JD AI Research Previously, CRIPAC, CASIA
https://aberhu.github.io/
09/23/2020
Data-Limited Face Analysis Yibo Hu JD AI Research Previously, - - PowerPoint PPT Presentation
VALSE Webinar Dual Variational Generation for Data-Limited Face Analysis Yibo Hu JD AI Research Previously, CRIPAC, CASIA https://aberhu.github.io/ 09/23/2020 Face Analysis Face analysis contains a wide range of tasks with various
VALSE Webinar
JD AI Research Previously, CRIPAC, CASIA
https://aberhu.github.io/
09/23/2020
VALSE Webinar
Face Recognition Face Segmentation Face Anti-spoofing Face 3D Reconstruction Facial Landmark Detection Facial Makeup Transfer Facial Editing
VALSE Webinar
VALSE Webinar
Latent: Z Data: X
GAN VAE
Auto- regressive (PixelRNN, PixelCNN) Invertible Flows (NICE, RealNVP, GLOW)
Others
VALSE Webinar
For a generative model, we want to perform:
π π¨|π¦ = π π¦ π¨ π(π¨) π(π¦) π π¦ = ΰΆ± π π¦ π¨ π π¨ ππ¨
intractable to compute p(x) and p(z|x)
VALSE Webinar
Reconstruction Quality π Approximation Error π
VALSE Webinar
compared with Auto-regressive models.
architecture.
likelihood computation with nice theory support.
VALSE Webinar
VALSE Webinar
Face Recognition
Intra-lass distance < inter-class distance Leaning both robust and discriminativefeatures
[SphereFace, CVPR2017]
VALSE Webinar
Heterogeneous Face Recognition
(a) NIR-VIS (b) Thermal-VIS (c) Sketch-Photo (d) ID-Camera (e) Profile-Frontal Photo
From: CASIA NIR-VIS 2.0 Tufts Face IIITD-Sketch NJU-ID MultiPIE
VALSE Webinar
MS-Celeb-1M:100K IDs with about 10M images
Tufts Face Database:
Insufficient Heterogeneous Data
CASIA NIR-VIS 2.0: 725 IDs with about 18K images + fine-tune w/ heterogeneous data + large-scale VIS data Tufts Face: 113 IDs with about 10K images
How to tackle the challenges ?
Collect as Much Data as Possible Reduce Domain Discrepancy
VALSE Webinar
Input Synthesized Conditional Synthesis
Generator
Only synthesize one target image with same attributes
Input Synthesized
Generator
VALSE Webinar
Input Synthesized Conditional Synthesis
Generator
Input Synthesized
Generator Which identity ?
VALSE Webinar
Generate Pairs of New Images from Noise
πΉπ π¦π
π¨π π¨π ππ ππ ππ ππ
πΉπ πΈπ
πΊππ
Pairwise Identity Preserving
π¦π
Reconstructed
πΈπ
Reconstructed
Dual Variational Generation for Low-Shot Heterogeneous Face Recognition. NeurIPS 2019
VALSE Webinar πΉπ π¦π
π¨π π¨π ππ ππ ππ ππ
πΉπ π¦π πΈπ½
π¨π½
Concat
πΊππ
Pairwise Identity Preserving Reconstructed
Generate Pairs of New Images from Noise
Dual Variational Generation for Low-Shot Heterogeneous Face Recognition. NeurIPS 2019
VALSE Webinar
Generate Pairs of New Images from Noise
Dual Variational Generation for Low-Shot Heterogeneous Face Recognition. NeurIPS 2019
π¨π π¨π πΈπ½
Generated
Standard Gaussian Noise
z
πΊππ
Pairwise Identity Preserving
Variance
VALSE Webinar πΉπ π¦π
π¨π π¨π ππ ππ ππ ππ
πΉπ π¦π πΈπ½
π¨π½
Concat
πΊππ
Pairwise Identity Preserving Reconstructed
Generate Pairs of New Images from Noise
Dual Variational Generation for Low-Shot Heterogeneous Face Recognition. NeurIPS 2019
VALSE Webinar πΉπ π¦π
π¨π π¨π
Distribution Alignment
ππ ππ ππ ππ
πΉπ πΈπ½
π¨π½
Concat
πΊππ
Pairwise Identity Preserving
π¦π
Reconstructed
Generate Pairs of New Images from Noise
Dual Variational Generation for Low-Shot Heterogeneous Face Recognition. NeurIPS 2019
VALSE Webinar
Generate Pairs of New Images from Noise
Dual Variational Generation for Low-Shot Heterogeneous Face Recognition. NeurIPS 2019
πΈπ½
Copy
Standard Gaussian Noise
ΖΈ π¨π½
HFR Net Domain Gap Reduction
z
Generated
VALSE Webinar πΈπ½
Copy
Standard Gaussian Noise
ΖΈ π¨π½
HFR Net Domain Gap Reduction
z
Generated
Training Stage Testing Stage
Dual Variational Generation for Low-Shot Heterogeneous Face Recognition. NeurIPS 2019
VALSE Webinar
Generate Pairs of New Images from Noise
Dual Variational Generation for Low-Shot Heterogeneous Face Recognition. NeurIPS 2019
New images Different poses intra-class diversity Paired images with same identity
VALSE Webinar
Experimental Results
100K seem enough.
VALSE Webinar
Experimental Results NIR-VIS NIR-VIS
Dual Variational Generation for Low-Shot Heterogeneous Face Recognition. NeurIPS 2019
Thermal-VIS Sketch-Photo
VALSE Webinar
New images
Two Challenges of DVG:
Tufts Face:
*MS: Mean Similarity *MIS: Mean Instance Similarity
paired training data
VALSE Webinar
How to increase inter-class diversity?
100,000 identities included in the MS-Celeb-1M dataset:
VALSE Webinar
πΉπ
ππ ππ
πΉπ πΉπ πΈ Pairwise ID Preserving
ππ ππ π
π
πΊ π» πΉπ
Domain- specific attribute encoders LightCNN Identity sampler Decoder
πΈ πΊ πΊ
π‘
(a) trainingwith paired heterogeneousdata
DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition.
VALSE Webinar
ππ
πΈ Pairwise ID Preserving π»
ππ ππ
πΉπ πΉπ
ππ π
π
πΊ
(b) trainingwith unpaired VIS data
πΉπ πΉπ
Domain- specific attribute encoders LightCNN Identity sampler Decoder
πΈ πΊ πΊ
π‘
DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition.
VALSE Webinar (b) trainingwith unpaired VIS data
DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition.
πΊ
π‘
ID sampling
α π
π
ππ
πΈ Pairwise ID Preserving π»
ππ ππ
πΉπ πΉπ
ππ
πΊ πΉπ πΉπ
Domain- specific attribute encoders LightCNN Identity sampler Decoder
πΈ πΊ πΊ
π‘
kl loss
π
π
πΊ
π‘
πΉ
π
π
reconstruction loss
VALSE Webinar
DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition.
πΊ
π‘
ID sampling
πΈ π»
(c) sampling after training
sampling sampling
VALSE Webinar
Qualitative Results Diversity Measurement
Tufts Face:
*MS: Mean Similarity *MIS: Mean Instance Similarity
VALSE Webinar
DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition.
Contrastive Learning: 1) The generated paired heterogeneousimages are regarded as positive pairs 2) The generated images from different samplings are regarded as negative pairs
VALSE Webinar
Quantitative Experiments
VALSE Webinar
VALSE Webinar
a) Semantic, Instance, panoptic segmentation b) Face parsing c) Human parsing Pixel-level Semantic Understanding Appearance Structure Consistency
VALSE Webinar
Dual-Structure Disentangling Variational Generation for Data-Limited Face Parsing. ACM MM 2020 .
Appearance Structure Structure Concatenate Hadamard product
(a) Training pipeline of D2VG
VALSE Webinar
Dual-Structure Disentangling Variational Generation for Data-Limited Face Parsing. ACM MM 2020 .
(b) Synthesizing large-scale samples for parsing
VALSE Webinar
VALSE Webinar
perform sampling, inference, estimation and likelihood computation with nice theory support.
to boost some data-limited face tasks.
body or natural image β¦β¦
VALSE Webinar
Email: huyibo871079699@gmail.com