Deep Transfer Learning for Visual Analysis Yu-Chiang Frank Wang, - PowerPoint PPT Presentation

Deep Transfer Learning for Visual Analysis Yu-Chiang Frank Wang, Associate Professor Dept. Electrical Engineering, National Taiwan University Taipei, Taiwan 2018/5/19 2 nd AII Workshop

Trends of Deep Learning 2

Transfer Learning: What, When, and Why? (cont’d) • A practical example https://techcrunch.com/2017/02/08/udacity-open-sources-its-self-driving-car-simulator-for-anyone-to-use/ https://googleblog.blogspot.tw/2014/04/the-latest-chapter-for-self-driving-car.html 3

Recent Research Focuses on Transfer Learning • CVPR 2018 Detach and Adapt: Learning Cross-Domain Disentangled Deep Representation • AAAI 2018 Order-Free RNN with Visual Attention for Multi-Label Classification • CVPR 2018 Multi-Label Zero-Shot Learning with Structured Knowledge Graphs • CVPRW 2018 Unsupervised Deep Transfer Learning for Person Re-Identification 4

Detach & Adapt – Beyond Image Style Transfer • Faceapp – Putting a smile on your face! • Deep learning for representation disentanglement • Interpretable deep feature representation Input Mr. Takeshi Kaneshiro 5

Detach & Adapt – Beyond Image Style Transfer • Cross-domain image synthesis, manipulation & translation With supervision w/o supervision Transfer Disentangle Disentangle smile smile from from Photo Cartoon Y.-C. F. Wang et al. , Detach and Adapt: Learning Cross-Domain Disentangled Deep Representation, CVPR 2018 6

Detach & Adapt – Beyond Image Style Transfer • Cross-domain image synthesis, manipulation & translation [CVPR’18] With supervision attribute W/o supervision Y.-C. F. Wang et al. , Detach and Adapt: Learning Cross-Domain Disentangled Deep Representation, CVPR 2018 7

Example Results • Face • Photo & Sketch Conditional Unsupervised Image Translation • w/o Label supervision w/o Label supervision Unpaired Y.-C. F. Wang et al. , Detach and Adapt: Learning Cross-Domain Disentangled Deep Representation, CVPR 2018 8

Comparisons Cross-Domain Image Translation Representation Disentanglement Unpaired Multi- Joint Interpretability of Bi-direction Unsupervised Training Data domains Representation disentangled factor X X X X Pix2pix O X O X CycleGAN Cannot disentangle image representation O O O X StarGAN O X O O UNIT O X X O DTN O X infoGAN Cannot translate images across domains X O AC-GAN O O O O O Partially CDRD (Ours) 9

Multi-Label Classification for Image Analysis • Prediction of multiple object labels from an image • Learning across image and semantics domains • No object detectors available • Desirable if be able to exploit label co-occurrence info Labels: Person Table Sofa Chair TV Lights Carpet … 11

DNN for Multi-Label Classification • Canonical-Correlated Autoencoder (C2AE) [Wang et al., AAAI 2017] • Unique integration of autoencoder & deep canonical correlation analysis (DCCA) • Autoencoder: label embedding + label recovery + label co-occurrence • DCCA: joint feature & label embedding • Can handle missing labels during learning feature space label space Clouds Clouds Lake Lake Ocean Ocean label space Latent space Water Water Sky Sky Sun Sun Sunset Sunset Y.-C. F. Wang et al. , Learning Deep Latent Spaces for Multi-Label Classification, AAAI 2017 12

Order-Free RNN with Visual Attention for Multi-Label Classification [AAAI’18] • Visual Attention for MLC [Wang et al., AAAI’18] Y.-C. F. Wang et al. , Order-Free RNN with Visual Attention for Multi-Label Classification, AAAI 2018 13

Order-Free RNN with Visual Attention for Multi-Label Classification • Experiments • NUS-WIDE: 269,648 images with 81 labels • MS-COCO: 82,783 images with 80 labels • Quantitative Evaluation MS-COCO NUS-WIDE Y.-C. F. Wang et al. , Order-Free RNN with Visual Attention for Multi-Label Classification, AAAI 2018 14

Order-Free RNN with Visual Attention for Multi-Label Classification • Qualitative Evaluation Example images in MS-COCO with the associated attention maps Incorrect predictions with reasonable visual attention Y.-C. F. Wang et al. , Order-Free RNN with Visual Attention for Multi-Label Classification, AAAI 2018 15

Multi-Label Zero-Shot Learning with Structured Knowledge Graphs [CVPR’18] • Utilizing structured knowledge graphs for modeling label dependency 16

• Our Proposed Network 17

• Our Proposed Network 18

Order-Free RNN with Visual Attention for Multi-Label Classification • Experiments • NUS-WIDE: 269,648 images with 1000 labels • MS-COCO: 82,783 images with 80 labels • Quantitative Evaluation • ML vs. ML-ZSL vs. Generalized ML-ZSL 19

Introduction: Person re-identification Camera #1 Camera #3 Camera #2 Camera #4 Person re-identification task: the system needs to match appearances of a person of interest across non-overlapping cameras. 21

Adaptation & Re-ID Network Latent Space Target Dataset 𝐽 𝑢 Latent Encoder Latent Decoder 𝐹 8 % 𝑓 2 $ % 𝑌 ℒ 5677 ℒ +:( + 𝑌 𝑢 % 𝑓 ( 𝐸 0 w/o labels 𝐹 9 𝐹 0 ℒ (%+& $ & ℒ +:( 𝑌 Source Dataset 𝐽 s & 𝑓 ( 𝑌 𝑡 ℒ 5677 + & 𝑓 2 𝐹 - w/ labels 𝐷 - ℒ ()*&& Classifier 22

Testing Scenario 23

Comparisons with Recent Re-ID Methods 24

Recent Research Focuses on Transfer Learning • AAAI 2018 Order-Free RNN with Visual Attention for Multi-Label Classification • CVPR 2018 Detach and Adapt: Learning Cross-Domain Disentangled Deep Representation • CVPR 2018 Multi-Label Zero-Shot Learning with Structured Knowledge Graphs • CVPRW 2018 Unsupervised Deep Transfer Learning for Person Re-Identification 25

Other Ongoing Research Topics • Take a Deep Look from a Single Image • Single-Image 3D Object Model Prediction • Completing Videos from a Deep Glimpse 26

3D Shape Estimation from A Single 2D Image • Recovering Shape from a Single Image • Supervised Setting • Input image and its ground truth 3D voxel available for training 27

3D Shape Estimation from A Single 2D Image • Recovering Shape from a Single Image • Semi-Supervised Setting • Input image and its ground truth 2D mask available for training 28

3D Shape Estimation from A Single 2D Image • Example Results 29

3D Shape Estimation from A Single 2D Image • Example Results Chair pose pose 30

Recent Research Focuses • Take a Deep Look from a Single Image • Single-Image 3D Object Model Prediction • Completing Videos from a Deep Glimpse 31

What’s Video Completion? 32

From Video Synthesis to Completion • Our Proposed Network • Variational autoencoder, recurrent neural nets, and GAN Input: non-consecutive frames of interest Input Output Output: video sequence (more than one possible output) Input Synthesized Real . . . . or Three Stages in Learning Fake Input Real 1. Learning frame-based representation Temporal Temporal . . . . 2. Learning video-based representation Encoder Generator 3. Learning video representation Stochastic & Recurrent Conditional-GAN (SR-cGAN) conditioned on input anchor frames 33

Video Synthesis KTH Shape Motion MUG 34

Video Completion – Example Results Shape Motion Output (Synthesized Video) Input (Anchor Frames) GIF 6 7 11 12 14 15 6 7 11 14 15 12 KTH Output (Synthesized Video) Input (Anchor Frames) GIF 2 3 7 9 12 14 2 3 7 9 12 14 35

Video Completion - Stochasticity Output (Synthesized Video) Input (Anchor Frames) GIF 3 5 8 12 13 14 3 5 8 12 13 14 Different Motion 36

Video Interpolation & Prediction • Interpolation • Input: 2 anchor frames • • fixed on t=1 and 8 • Output 8 frames • Prediction • Input: • 6 anchor frames • Fixed on t=1~6 • Output 16 frames 37

Summary • Deep Transfer Learning for Visual Analysis • Multi-Label Classification for Image Analysis • Detach and Adapt – Beyond Image Style Transfer • Single-Image 3D Object Model Prediction • Completing Videos from a Deep Glimpse Person Table Sofa Chair TV Lights Carpet … 38

For More Information… • Vision and Learning Lab at NTUEE (http://vllab.ee.ntu.edu.tw/) 39

Thank You! 40

Deep Transfer Learning for Visual Analysis Yu-Chiang Frank Wang, - PowerPoint PPT Presentation

Deep Transfer Learning for Visual Analysis Yu-Chiang Frank Wang, Associate Professor Dept. Electrical Engineering, National Taiwan University Taipei, Taiwan 2018/5/19 2 nd AII Workshop Trends of Deep Learning 2 Transfer Learning: What, When,

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Biovision team 2 Retina Visual cortex 3 Retina Visual cortex 3 Retina Visual cortex 3

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Industrial Transfer Learning Introduction to Industrial Transfer Learning Industrial Transfer

Radiative Transfer Radiative Transfer Radiative transfer is a branch of atmospheric physics. We

CHRONIC CHRONIC VISUAL LOSS VISUAL LOSS Wasu Supakornthanasarn, MD. Visual loss Sensory

A Model of Visual Imagery A Model of Visual Imagery John Abbondanza, OD, FCOVD John Abbondanza,

Overview Overview Visual displays Visual displays Visual and tactile displays Visual and

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

CSI5180. MachineLearningfor BioinformaticsApplications Deep learning encoding and transfer

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

Recap by Milo Davies, SAS NZ POWERFUL ADAPTIVE OPEN UNIFIED SAS Visual Analytics SAS Visual

Transfer United: Partnerships to Foster Transfer Student Success Tuesday, November 5 th

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning.

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Tasks & Memory Management October 30, 2007 Contents Task creation Address space

http://www.mpi-forum.org/ This work was performed under the auspices of the U.S.

3D effects of edge magnetic field configuration on divertor/SOL transport and optimization

Lecture 2.3: Equivalence and implication Matthew Macauley Department of Mathematical Sciences

Lecture 02: Project Management, Cost Estimation 2015-04-27 Prof. Dr. Andreas Podelski, Dr. Bernd

Introduction to materials modelling Lecture 2 - Decomposition of stress, geometric interpretation

Modelling of turbulent flows: RANS and LES Turbulenzmodelle in der Str omungsmechanik: RANS und

Sludge treatment: dewatering CTB3365x Introduc1on to water treatment

Deep Transfer Learning for Visual Analysis Yu-Chiang Frank Wang, - PowerPoint PPT Presentation

Deep Transfer Learning for Visual Analysis Yu-Chiang Frank Wang, Associate Professor Dept. Electrical Engineering, National Taiwan University Taipei, Taiwan 2018/5/19 2 nd AII Workshop Trends of Deep Learning 2 Transfer Learning: What, When,

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Biovision team 2 Retina Visual cortex 3 Retina Visual cortex 3 Retina Visual cortex 3

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Industrial Transfer Learning Introduction to Industrial Transfer Learning Industrial Transfer

Radiative Transfer Radiative Transfer Radiative transfer is a branch of atmospheric physics. We

CHRONIC CHRONIC VISUAL LOSS VISUAL LOSS Wasu Supakornthanasarn, MD. Visual loss Sensory

A Model of Visual Imagery A Model of Visual Imagery John Abbondanza, OD, FCOVD John Abbondanza,

Overview Overview Visual displays Visual displays Visual and tactile displays Visual and

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

CSI5180. MachineLearningfor BioinformaticsApplications Deep learning encoding and transfer

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

Recap by Milo Davies, SAS NZ POWERFUL ADAPTIVE OPEN UNIFIED SAS Visual Analytics SAS Visual

Transfer United: Partnerships to Foster Transfer Student Success Tuesday, November 5 th

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning.

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Tasks &amp; Memory Management October 30, 2007 Contents Task creation Address space

http://www.mpi-forum.org/ This work was performed under the auspices of the U.S.

3D effects of edge magnetic field configuration on divertor/SOL transport and optimization

Lecture 2.3: Equivalence and implication Matthew Macauley Department of Mathematical Sciences

Lecture 02: Project Management, Cost Estimation 2015-04-27 Prof. Dr. Andreas Podelski, Dr. Bernd

Introduction to materials modelling Lecture 2 - Decomposition of stress, geometric interpretation

Modelling of turbulent flows: RANS and LES Turbulenzmodelle in der Str omungsmechanik: RANS und

Sludge treatment: dewatering CTB3365x Introduc1on to water treatment

Tasks & Memory Management October 30, 2007 Contents Task creation Address space