Impact of Deep Learning Speech Recogni4on Computer Vision - PowerPoint PPT Presentation

Impact ¡of ¡Deep ¡Learning ¡ • ¡Speech ¡Recogni4on ¡ • ¡Computer ¡Vision ¡ • ¡Recommender ¡Systems ¡ ¡ • ¡Language ¡Understanding ¡ ¡ • ¡Drug ¡Discovery ¡and ¡Medical ¡ Image ¡Analysis ¡ ¡ [Courtesy ¡of ¡R. ¡Salakhutdinov] ¡

[Hinton & Salakhutdinov, 2006] Deep Belief Networks: Training

Very Large Scale Use of DBN’s [Quoc Le, et al., ICML , 2012] Data: 10 million 200x200 unlabeled images, sampled from YouTube Training: use 1000 machines (16000 cores) for 1 week Learned network: 3 multi-stage layers, 1.15 billion parameters Achieves 15.8% (was 9.5%) accuracy classifying 1 of 20k ImageNet items Real images that most excite the feature: Image synthesized to most excite the feature:

Restricted ¡Boltzmann ¡Machines ¡ Pair-‑wise ¡ Unary ¡ ¡ ¡hidden ¡variables ¡ Graphical ¡Models: ¡ Powerful ¡ Feature ¡Detectors ¡ framework ¡for ¡represen4ng ¡ dependency ¡structure ¡between ¡ random ¡variables. ¡ Image ¡ ¡ ¡ ¡ ¡ ¡visible ¡variables ¡ RBM ¡is ¡a ¡Markov ¡Random ¡Field ¡with: ¡ • ¡Stochas4c ¡binary ¡visible ¡variables ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ • ¡Stochas4c ¡binary ¡hidden ¡variables ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ • ¡Bipar4te ¡connec4ons. ¡ Markov ¡random ¡fields, ¡Boltzmann ¡machines, ¡log-‑linear ¡models. ¡ ¡ [Courtesy, ¡R. ¡Salakhutdinov] ¡

Model ¡Learning ¡ ¡ ¡Hidden ¡units ¡ Given ¡a ¡set ¡of ¡ i.i.d. ¡training ¡examples ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡, ¡we ¡want ¡to ¡learn ¡ ¡ model ¡parameters ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡. ¡ ¡ ¡ ¡ Maximize ¡log-‑likelihood ¡objec4ve: ¡ Image ¡ ¡ ¡ ¡ ¡ ¡visible ¡units ¡ Deriva4ve ¡of ¡the ¡log-‑likelihood: ¡ [Courtesy, ¡R. ¡Salakhutdinov] ¡

Deep ¡Boltzmann ¡Machines ¡ Low-‑level ¡features: ¡ Edges ¡ Built ¡from ¡ unlabeled ¡ inputs. ¡ ¡ Input: ¡Pixels ¡ Image ¡ (Salakhutdinov & Hinton, Neural Computation 2012) [Courtesy, ¡R. ¡Salakhutdinov] ¡

Deep ¡Boltzmann ¡Machines ¡ Learn ¡simpler ¡representa4ons, ¡ then ¡compose ¡more ¡complex ¡ones ¡ Higher-‑level ¡features: ¡ Combina4on ¡of ¡edges ¡ Low-‑level ¡features: ¡ Edges ¡ Built ¡from ¡ unlabeled ¡ inputs. ¡ ¡ Input: ¡Pixels ¡ Image ¡ (Salakhutdinov 2008, Salakhutdinov & Hinton 2012) [Courtesy, ¡R. ¡Salakhutdinov] ¡

Model ¡Formula4on ¡ h 3 Same ¡as ¡RBMs ¡ W 3 requires ¡approximate ¡inference ¡to ¡ h 2 train, ¡but ¡it ¡can ¡be ¡done… ¡ W 2 and ¡scales ¡to ¡millions ¡of ¡examples ¡ h 1 W 1 v Input ¡ [Courtesy, ¡R. ¡Salakhutdinov] ¡

Samples ¡Generated ¡by ¡the ¡Model ¡ Training ¡Data ¡ Model-‑Generated ¡Samples ¡ Data ¡ [Courtesy, ¡R. ¡Salakhutdinov] ¡

Handwri4ng ¡Recogni4on ¡ MNIST ¡Dataset ¡ Op4cal ¡Character ¡Recogni4on ¡ 60,000 ¡examples ¡of ¡10 ¡digits ¡ 42,152 ¡examples ¡of ¡26 ¡English ¡le_ers ¡ ¡ Learning ¡Algorithm ¡ Error ¡ Learning ¡Algorithm ¡ Error ¡ Logis4c ¡regression ¡ 12.0% ¡ Logis4c ¡regression ¡ 22.14% ¡ K-‑NN ¡ ¡ 3.09% ¡ K-‑NN ¡ ¡ 18.92% ¡ Neural ¡Net ¡ (Pla_ ¡2005) ¡ 1.53% ¡ Neural ¡Net ¡ 14.62% ¡ SVM ¡ (Decoste ¡et.al. ¡2002) ¡ 1.40% ¡ SVM ¡ (Larochelle ¡et.al. ¡2009) ¡ 9.70% ¡ Deep ¡Autoencoder ¡ 1.40% ¡ Deep ¡Autoencoder ¡ 10.05% ¡ (Bengio ¡et. ¡al. ¡2007) ¡ ¡ (Bengio ¡et. ¡al. ¡2007) ¡ ¡ Deep ¡Belief ¡Net ¡ 1.20% ¡ Deep ¡Belief ¡Net ¡ 9.68% ¡ (Hinton ¡et. ¡al. ¡2006) ¡ ¡ (Larochelle ¡et. ¡al. ¡2009) ¡ ¡ DBM ¡ ¡ 0.95% ¡ DBM ¡ 8.40% ¡ Permuta4on-‑invariant ¡version. ¡ [Courtesy, ¡R. ¡Salakhutdinov] ¡

3-‑D ¡object ¡Recogni4on ¡ NORB ¡Dataset: ¡24,000 ¡examples ¡ Learning ¡Algorithm ¡ Error ¡ Logis4c ¡regression ¡ 22.5% ¡ K-‑NN ¡ (LeCun ¡2004) ¡ 18.92% ¡ SVM ¡ (Bengio ¡& ¡LeCun ¡ ¡2007) ¡ 11.6% ¡ Deep ¡Belief ¡Net ¡ (Nair ¡& ¡Hinton ¡ ¡ 9.0% ¡ 2009) ¡ ¡ DBM ¡ 7.2% ¡ Pa_ern ¡ Comple4on ¡ [Courtesy, ¡R. ¡Salakhutdinov] ¡

Learning ¡Shared ¡Representa4ons ¡ Across ¡Sensory ¡Modali4es ¡ “Concept” ¡ sunset, ¡pacific ¡ocean, ¡ baker ¡beach, ¡seashore, ¡ ocean ¡ [Courtesy, ¡R. ¡Salakhutdinov] ¡

Mul4modal ¡DBM ¡ Gaussian ¡model ¡ Replicated ¡Sojmax ¡ 0 ¡ Dense, ¡real-‑valued ¡ Word ¡ 0 ¡ image ¡features ¡ 0 ¡ counts ¡ 1 ¡ 0 ¡ (Srivastava & Salakhutdinov, NIPS 2012, JMLR 2014) ¡ [Courtesy, ¡R. ¡Salakhutdinov] ¡

Mul4modal ¡DBM ¡ Bo_om-‑up ¡ + ¡ Top-‑down ¡ Gaussian ¡model ¡ Replicated ¡Sojmax ¡ 0 ¡ Dense, ¡real-‑valued ¡ Word ¡ 0 ¡ image ¡features ¡ 0 ¡ counts ¡ 1 ¡ 0 ¡ (Srivastava & Salakhutdinov, NIPS 2012, JMLR 2014) ¡ [Courtesy, ¡R. ¡Salakhutdinov] ¡

Text ¡Generated ¡from ¡Images ¡ Generated ¡ ¡ ¡ Generated ¡ ¡ ¡ Given Given ¡ ¡ insect, ¡bu_erfly, ¡insects, ¡ dog, ¡cat, ¡pet, ¡ki_en, ¡ bug, ¡bu_erflies, ¡ ¡ ¡ ¡ ¡ puppy, ¡ginger, ¡tongue, ¡ lepidoptera ¡ ki_y, ¡dogs, ¡furry ¡ graffi4, ¡streetart, ¡stencil, ¡ sea, ¡france, ¡boat, ¡mer, ¡ s4cker, ¡urbanart, ¡graff, ¡ beach, ¡river, ¡bretagne, ¡ sanfrancisco ¡ plage, ¡bri_any ¡ portrait, ¡child, ¡kid, ¡ canada, ¡nature, ¡ ritra_o, ¡kids, ¡children, ¡ sunrise, ¡ontario, ¡fog, ¡ boy, ¡cute, ¡boys, ¡italy ¡ mist, ¡bc, ¡morning ¡ [Courtesy, ¡R. ¡Salakhutdinov] ¡

Text ¡Generated ¡from ¡Images ¡ Generated ¡ ¡ ¡ Given ¡ portrait, ¡women, ¡army, ¡soldier, ¡ ¡ ¡ mother, ¡postcard, ¡soldiers ¡ obama, ¡barackobama, ¡elec4on, ¡ poli4cs, ¡president, ¡hope, ¡change, ¡ sanfrancisco, ¡conven4on, ¡rally ¡ water, ¡glass, ¡beer, ¡bo_le, ¡ drink, ¡wine, ¡bubbles, ¡splash, ¡ drops, ¡drop ¡

Images ¡Selected ¡from ¡Text ¡ Given Retrieved ¡ ¡ water, ¡red, ¡ ¡ ¡ sunset ¡ nature, ¡flower, ¡ red, ¡green ¡ blue, ¡green, ¡ yellow, ¡colors ¡ chocolate, ¡cake ¡ [Courtesy, ¡R. ¡Salakhutdinov] ¡

Summary ¡ • Efficient ¡learning ¡algorithms ¡for ¡Deep ¡Learning ¡Models. ¡Learning ¡ more ¡adap4ve, ¡robust, ¡and ¡structured ¡representa4ons. ¡ ¡ ¡ Learning ¡a ¡Category ¡ Image ¡Tagging ¡ Text ¡& ¡image ¡retrieval ¡/ ¡ ¡ Hierarchy ¡ Object ¡recogniGon ¡ mosque, ¡tower, ¡ building, ¡cathedral, ¡ dome, ¡castle ¡ CapGon ¡GeneraGon ¡ Speech ¡RecogniGon ¡ MulGmodal ¡Data ¡ HMM ¡decoder ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ sunset, ¡pacific ¡ocean, ¡ beach, ¡seashore ¡ • Deep ¡models ¡improve ¡the ¡current ¡state-‑of-‑the ¡art ¡in ¡many ¡ applica4on ¡domains: ¡ Object ¡recogni4on ¡and ¡detec4on, ¡text ¡and ¡image ¡retrieval, ¡handwri_en ¡ Ø character ¡and ¡speech ¡recogni4on, ¡and ¡others. ¡ [Courtesy, ¡R. ¡Salakhutdinov] ¡

Impact of Deep Learning Speech Recogni4on Computer Vision - PowerPoint PPT Presentation

Impact of Deep Learning Speech Recogni4on Computer Vision Recommender Systems Language Understanding Drug Discovery and Medical Image

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning.

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December

Differen'able Func'onal Programming Noel Welsh @noelwelsh underscore Goals Deep learning

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 6: Deep Learning Systems 1 Outline

ACCELERATE DEEP LEARNING WITH NVIDIA'S DEEP LEARNING PLATFORM | STEPHEN JONES | GTC16 DEEP

Deep learning for natural language processing A short primer on deep learning Benoit Favre <

Relational Deep Learning: A Deep Latent Variable Model for Link Prediction Hao Wang, Xingjian

Medical Imaging Elisa Sayrol Medical Imaging Interest in this area in Deep Learning: DeepDeep

Deep learning Optimization and Regularization in deep networks Hamid Beigy Sharif university of

Minjie Wang Deep Learning Deep Learning trend in the past 10 years Caffe State-of-art DL

Responsive Typography Design for Meaning, Not for Screen Size ARTIFACT PVD 30 September, 2014

Compositional Methods for Learning and Inference in Deep Probabilistic Programs Jan-Willem

All for Web and Web for All: Moving Your Organization Toward Accessibility Compliance 2019

WE LCOME ! BUILDING COORDINATOR QUARTE RL Y ME E TING OCTOBE R 21, 2016 Jacquie Vargas

Numeracy Routines in the Intermediate Classroom Rose Palmer/Susan Aleson School District of

Low-Income Housing Tax Credit Funds: Investment Opportunities for Banks A Web and Telephone

Second Wednesdays | 1:00 2:15 pm ET www.fs.fed.us/research/urban-webinars This meeting is

PAUP* Lab Note: Parts of this computer lab exercise wer written by Paul O. Lewis. Paul has

Impact of Deep Learning Speech Recogni4on Computer Vision - PowerPoint PPT Presentation

Impact of Deep Learning Speech Recogni4on Computer Vision Recommender Systems Language Understanding Drug Discovery and Medical Image

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning.

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December

Differen'able Func'onal Programming Noel Welsh @noelwelsh underscore Goals Deep learning

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 6: Deep Learning Systems 1 Outline

ACCELERATE DEEP LEARNING WITH NVIDIA'S DEEP LEARNING PLATFORM | STEPHEN JONES | GTC16 DEEP

Deep learning for natural language processing A short primer on deep learning Benoit Favre &lt;

Relational Deep Learning: A Deep Latent Variable Model for Link Prediction Hao Wang, Xingjian

Medical Imaging Elisa Sayrol Medical Imaging Interest in this area in Deep Learning: DeepDeep

Deep learning Optimization and Regularization in deep networks Hamid Beigy Sharif university of

Minjie Wang Deep Learning Deep Learning trend in the past 10 years Caffe State-of-art DL

Responsive Typography Design for Meaning, Not for Screen Size ARTIFACT PVD 30 September, 2014

Compositional Methods for Learning and Inference in Deep Probabilistic Programs Jan-Willem

All for Web and Web for All: Moving Your Organization Toward Accessibility Compliance 2019

WE LCOME ! BUILDING COORDINATOR QUARTE RL Y ME E TING OCTOBE R 21, 2016 Jacquie Vargas

Numeracy Routines in the Intermediate Classroom Rose Palmer/Susan Aleson School District of

Low-Income Housing Tax Credit Funds: Investment Opportunities for Banks A Web and Telephone

Second Wednesdays | 1:00 2:15 pm ET www.fs.fed.us/research/urban-webinars This meeting is

PAUP* Lab Note: Parts of this computer lab exercise wer written by Paul O. Lewis. Paul has

Deep learning for natural language processing A short primer on deep learning Benoit Favre <