Deep Learning for Vision Presented by Kevin Matzen Wednesday, - PowerPoint PPT Presentation

Deep Learning for Vision Presented by Kevin Matzen Wednesday, April 9, 14

Quick Intro - DNN • Feed-forward • Sparse connectivity (layer to layer) • Different layer types • Recently popularized for vision [Krizhevsky, et. al. NIPS 2012] Wednesday, April 9, 14

The Layers • Convolution • Loss functions • Fully connected • Image processing • Pooling • Neuron activation function • Normalization Wednesday, April 9, 14

deeplearning.net/tutorial/lenet.html Wednesday, April 9, 14

[Krizhevsky, NIPS 2012] Wednesday, April 9, 14

Software • code.google.com/p/cuda-convnet/ [nvidia gpu] • github.com/UCB-ICSI-Vision-Group/decaf-release/ [deprecated; cpu-only] • caffe.berkeleyvision.org [cpu; nvidia gpu] • research.google.com/archive/ large_deep_networks_nips2012.html [proprietary; distributed system] Wednesday, April 9, 14

DeepPose: Human Pose Estimation via Deep Neural Networks Alexander Toshev, Christian Szegedy - CVPR 2014 DeepFace: Closing the Gap to Human-Level Performance in Face Verification Yaniv Taigman, Ming Yang, Marc’Aurelio Ranzato, Lior Wolf - CVPR 2014 Wednesday, April 9, 14

Input: Uncropped photo Output: Joint locations Wednesday, April 9, 14

Pipeline 1. Person detection 2. Joint position regression 3. Joint refinement Wednesday, April 9, 14

Datasets Leeds Sports Pose (LSP) [Johnson, et. al. BMVC 2010] 14 joint locations 2000 main person - 150 px Frames Labeled in Cinema (FLIC) [Sapp, et. al. CVPR 2013] 5003 person detector every 10 frames of 30 movies 20k candidates Image Parse [Ramanan NIPS 2006] mturk 305 images 10 upperbody joints similar to leeds includes casual photos Buffy Stickmen 748 frames Wednesday, April 9, 14

Person Detection • Input: Uncropped image • Output: Cropped image • LSP dataset - No person detector • FLIC dataset - Enlarged face detector Wednesday, April 9, 14

Wednesday, April 9, 14

Main difference Wednesday, April 9, 14

Runtime • 0.1s per image - 12 cores (SotA - 1.5s, 4s) • Training stage 0 - 3 days • Training refinement - 7 days each Wednesday, April 9, 14

Evaluation • Percentage of Correct Parts (PCP) • Correct if predicted limb is within 1/2 of correct limb length • Percentage of Detected Joints (PDJ) • Predicted and correct joints are within some factor of torso diameter Wednesday, April 9, 14

DeepPose: Human Pose Estimation via Deep Neural Networks Alexander Toshev, Christian Szegedy - CVPR 2014 DeepFace: Closing the Gap to Human-Level Performance in Face Verification Yaniv Taigman, Ming Yang, Marc’Aurelio Ranzato, Lior Wolf - CVPR 2014 Wednesday, April 9, 14

Pipeline • Detect faces • Correct out-of-plane rotation • Generate features via CNN • Classify Wednesday, April 9, 14

Alignment Wednesday, April 9, 14

Fiducial Detection • LBP histograms • Support Vector Regressor • Iteratively transform and predict • 6 fiducial points for 2D alignment • 67 fiducial points for 3D alignment Wednesday, April 9, 14

3D Alignment • Iterative affine camera PnP • 3D reference - Average mesh of USF Human-ID dataset • Considers fiducial covariance • Residuals applied to reference mesh • Affine warp texture Wednesday, April 9, 14

CNN Architecture Wednesday, April 9, 14

CNN Architecture Features Wednesday, April 9, 14

CNN Architecture weight sharing no weight sharing Wednesday, April 9, 14

Training softmax cross-entropy loss -log p k Wednesday, April 9, 14

Sparsity • ReLU nonlinearly - rectified linear unit max(0, x) • 75% model parameters = 0 • Dropout - first fully connected layer Wednesday, April 9, 14

Normalization • ReLU - unbounded • Normalize features to [0, 1] based on holdout Wednesday, April 9, 14

Verification Metrics • Unsupervised - dot product • χ 2 similarity • Siamese network Wednesday, April 9, 14

Χ 2 Similarity • Χ 2 (f 1 ,f 2 ) = Σ i w i (f 1 [i] - f 2 [i]) 2 /(f 1 [i] + f 2 [i]) • weights learned via svm Wednesday, April 9, 14

Siamese Network FC 4096-to-1 - Wednesday, April 9, 14

Datasets • Social Face Classification (SFC) • Presumably Facebook photos • 4.4 mil faces; 4,030 people • No overlap with other datasets Wednesday, April 9, 14

Datasets • Labeled Faces in the Wild (LFW) • 13,323 faces; 5,749 celebs • 6,000 pairs • Restricted protocol - same/not same labels at training • Unrestricted protocol - identities during training • Unsupervised - no training on LFW Wednesday, April 9, 14

Datasets • YouTube Faces (YTF) • 3,425 videos of 1,595 subjects • Subset of celebs from LFW Wednesday, April 9, 14

SFC Training Perf Reduce data by Reduce data by Remove layers omitting people omitting examples from network Wednesday, April 9, 14

LFW Perf Wednesday, April 9, 14

Runtime • 0.18 s - feature extraction (1 core; 2.2 GHz) • 0.05 s - alignment • 0.33 s - total Wednesday, April 9, 14

Questions? Wednesday, April 9, 14

Deep Learning for Vision Presented by Kevin Matzen Wednesday, - PowerPoint PPT Presentation

Deep Learning for Vision Presented by Kevin Matzen Wednesday, April 9, 14 Quick Intro - DNN Feed-forward Sparse connectivity (layer to layer) Different layer types Recently popularized for vision [Krizhevsky, et. al. NIPS 2012]

Deep Learning in Computer Vision Caner Hazrba Deep Learning in Action 24. June 15

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Delving Deep into Computer Vision Caner Hazirbas Machine Learning Meetup #1 Delving Deep into

Computer Vision Computer Vision How does vision work? What is vision for? Ela Claridge

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

Deep Reasoning A Vision for Automated Deduction Stephan Schulz Deep Reasoning A Vision for

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

Branding Presentation VISION Mevushal VISION Muscat of Alexandria & Viognier VISION

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning.

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December

Differen'able Func'onal Programming Noel Welsh @noelwelsh underscore Goals Deep learning

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 6: Deep Learning Systems 1 Outline

ACCELERATE DEEP LEARNING WITH NVIDIA'S DEEP LEARNING PLATFORM | STEPHEN JONES | GTC16 DEEP

H Workshop on Photon Physics and Simulation at Hadron Colliders 2019 Ruggero Turra on

Characteristic number regression for fiducial facial feature extraction Presenter : Xin Fan

Effective Anti-thrombotic Therapy without Stenting: Intravascular OCT-based Management in Plaque

Change-point methods for anomaly detection in fibrous media joint work with E. Spodarev, C. Re-

Effect of Fiducial Cut on All Backgrounds Plot on the left considers the effect of a fiducial

FLASY2011 Short Baseline Neutrino Anomalies and Future Probes Sanjib Kumar Agarwalla

Widowed stars probe BH kicks Bin C osmos Mathieu Renzo PhD in Amsterdam Collaborators:

BA BAO measure rement fro rom joint anal analysi ysis s of f gal galax axies es and and

Deep Learning for Vision Presented by Kevin Matzen Wednesday, - PowerPoint PPT Presentation

Deep Learning for Vision Presented by Kevin Matzen Wednesday, April 9, 14 Quick Intro - DNN Feed-forward Sparse connectivity (layer to layer) Different layer types Recently popularized for vision [Krizhevsky, et. al. NIPS 2012]

Deep Learning in Computer Vision Caner Hazrba Deep Learning in Action 24. June 15

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Delving Deep into Computer Vision Caner Hazirbas Machine Learning Meetup #1 Delving Deep into

Computer Vision Computer Vision How does vision work? What is vision for? Ela Claridge

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

Deep Reasoning A Vision for Automated Deduction Stephan Schulz Deep Reasoning A Vision for

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

Branding Presentation VISION Mevushal VISION Muscat of Alexandria &amp; Viognier VISION

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning.

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December

Differen'able Func'onal Programming Noel Welsh @noelwelsh underscore Goals Deep learning

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 6: Deep Learning Systems 1 Outline

ACCELERATE DEEP LEARNING WITH NVIDIA'S DEEP LEARNING PLATFORM | STEPHEN JONES | GTC16 DEEP

H Workshop on Photon Physics and Simulation at Hadron Colliders 2019 Ruggero Turra on

Characteristic number regression for fiducial facial feature extraction Presenter : Xin Fan

Effective Anti-thrombotic Therapy without Stenting: Intravascular OCT-based Management in Plaque

Change-point methods for anomaly detection in fibrous media joint work with E. Spodarev, C. Re-

Effect of Fiducial Cut on All Backgrounds Plot on the left considers the effect of a fiducial

FLASY2011 Short Baseline Neutrino Anomalies and Future Probes Sanjib Kumar Agarwalla

Widowed stars probe BH kicks Bin C osmos Mathieu Renzo PhD in Amsterdam Collaborators:

BA BAO measure rement fro rom joint anal analysi ysis s of f gal galax axies es and and

Branding Presentation VISION Mevushal VISION Muscat of Alexandria & Viognier VISION