Deep Learning: Methods and Applications Chapter 3: Three Classes of - PowerPoint PPT Presentation

Deep Learning: Methods and Applications Chapter 3: Three Classes of Deep Learning Network 발표자 : 조성재 , 최상우 SNU Spoken Language Processing Lab / 서울대학교 음성언어처리연구실

목차 3.1. A Three-Way Categorization   딥 러닝을 3 가지 방식으로 분류 3.2. Deep Networks for Unsupervised or Generative Learning   첫번째 방식인 Unsupervised learning or Generative Learning 3.3 Deep Networks for Supervised Learning   두번째 방식인 Supervised Learning 3.4 Hybrid Deep Networks  SNU Spoken Language Processing Lab / 서울대학교 음성언어처리연구실 2

Machine Learning Basic Concept SNU Spoken Language Processing Lab / 서울대학교 음성언어처리연구실 3

Contents Overview Generative model vs Discriminative model  Joint Distribution vs Conditional distribution  Denoising Autoencoder  Mean square reconstruction error and KL divergence  DBN RBM DBM  Sum-product network  Hessian-Free Optimization  Conditional Random fields  Deep stacking Network  Time delayed neural network  Convolutional neural network  Hybrid models  SNU Spoken Language Processing Lab / 서울대학교 음성언어처리연구실 4

3.1 A Three-Way Categorization(OVERVIEW) Category 1: Deep networks for unsupervised or generative learning  intended to capture high-order correlation of the observed or visible data for pattern analysis or synthesis  purposes when no information about target class labels is available. Unsupervised feature or representation learning in the literature refers to this category of the deep networks.  Unsupervised learning in generative mode, may also be intended to characterize joint statistical distributions of  the visible data and their associated classes Category 2: Deep networks for supervised learning  intended to directly provide discriminative power for pattern classification purposes , often by characterizing  the posterior distributions of classes conditioned on the visible data . Target label data are always available in direct or indirect forms for such supervised learning.  They are also called discriminative deep networks.  Category 3: Hybrid deep Networks  the goal is discrimination which is assisted , often in a significant way, with the outcomes of generative or  unsupervised deep networks . This can be accomplished by better optimization or/and regularization of the deep networks in category SNU Spoken Language Processing Lab / 서울대학교 음성언어처리연구실 5

3.1 Basic Deep Learning Terminologies - 7 가지 용어 1. Deep Learning  A class of machine learning techniques  The essence of deep learning is to compute hierarchical features or representations of the observational data where the higher-level features or factors are defined from lower-level ones . 2. Deep Belief network  Probabilistic generative models composed of multiple layers of stochastic, hidden variables. 3. Boltzmann machine  A network of symmetrically connected, neuron-like units that make stochastic decisions about whether to be on or off SNU Spoken Language Processing Lab / 서울대학교 음성언어처리연구실 6

3.1 Basic Deep Learning Terminologies - 7 가지 용어 4. Restricted Boltzmann machine (RBM)  A special type of BM consisting of a layer of visible units and a layer of hidden units with no visible-visible or hidden-hidden connections 5. Deep neural network(DNN)  a multilayer perceptron with many hidden layers , whose weights are fully connected and are often initialized using either an unsupervised or a supervised pretraining technique 6. Deep Autoencoder  a “discriminative” DNN whose output targets are the data input itself rather than class labels; hence an unsupervised learning model 7. Distributed representation  an internal representation of the observed data in such a way that they are modeled as being explained by the interactions of many hidden factors . SNU Spoken Language Processing Lab / 서울대학교 음성언어처리연구실 7

3.2 Deep Networks for Unsupervised or Generative Learning - Intro Many deep networks in this category can be used to meaningfully generate samples  by sampling from the networks, with examples of RBMs, DBNs, DBMs , and generalized denoising autoencoders and are thus generative models Some networks in this category, however, cannot be easily sampled , with examples of  sparse coding networks and the original forms of deep autoencoders, and are thus not generative in nature Among the various subclasses of generative or unsupervised deep networks , the  energy-based deep models are the most common Such composition leads to deep belief network (DBN)( Chapter 5).  SNU Spoken Language Processing Lab / 서울대학교 음성언어처리연구실 8

3.2 Deep Networks for Unsupervised or Generative Learning - Introduced Deep Networks Deep autoencoders   Transforming autoencoder  Predictive sparse coders  De-noising (stacked) autoencoders Deep Boltzmann machines   Mean-covariance RBM  Deep Belief Networks Sum-product networks  Recurrent neural networks  SNU Spoken Language Processing Lab / 서울대학교 음성언어처리연구실 9

SNU Spoken Language Processing Lab / 서울대학교 음성언어처리연구실 10

3.2 Deep Networks for Unsupervised or Generative Learning - Autoencoders The original form of the deep autoencoder which we will give more detail about in  Chapter 4 , is a typical example of this unsupervised model category Most other forms of deep autoencoders are also unsupervised in nature, but with  quite different properties and implementations. Examples are transforming autoencoders , predictive sparse coders and their stacked  version, and de-noising autoencoders and their stacked versions SNU Spoken Language Processing Lab / 서울대학교 음성언어처리연구실 11

3.2 Deep Networks for Unsupervised or Generative Learning - Autoencoders Specifically, in de-noising autoencoders , the input vectors are first corrupted by, for  example, randomly selecting a percentage of the inputs and setting them to zeros or adding Gaussian noise to them.( 안개 낀 곳에서 구별하는 것과 같은 ) The encoded representations transformed from the uncorrupted data are used as the  inputs to the next level of the stacked de-noising autoencoder. SNU Spoken Language Processing Lab / 서울대학교 음성언어처리연구실 12

3.2 Deep Networks for Unsupervised or Generative Learning - Deep Boltzmann Machines A DBM contains many layers of hidden variables , and has no connections between the  variables within the same layer While having a simple learning algorithm , the general BMs are very complex to study  and very slow to train . In a DBM, each layer captures complicated, higher-order correlations between the  activities of hidden features in the layer below. DBMs have the potential of learning internal representations that become increasingly  complex, highly desirable for solving object and speech recognition problems. Further, the high-level representations can be built from a large supply of unlabeled  sensory inputs and very limited labeled data can then be used to only slightly fine- tune the model for a specific task at hand. SNU Spoken Language Processing Lab / 서울대학교 음성언어처리연구실 13

3.2 Deep Networks for Unsupervised or Generative Learning - Deep Boltzmann Machines When the number of hidden layers of DBM is reduced to one, we have restricted  Boltzmann machine (RBM) . Like DBM, there are no hidden-to-hidden and no visible-to-visible connections in the  RBM . The main virtue of RBM is that via composing many RBMs, many hidden layers can be  learned efficiently using the feature activations of one RBM as the training data for the next. SNU Spoken Language Processing Lab / 서울대학교 음성언어처리연구실 14

3.2 Deep Networks for Unsupervised or Generative Learning - Boltzmann Machines SNU Spoken Language Processing Lab / 서울대학교 음성언어처리연구실 15

3.2 Deep Networks for Unsupervised or Generative Learning - Restricted Boltzmann Machines SNU Spoken Language Processing Lab / 서울대학교 음성언어처리연구실 16

3.2 Deep Networks for Unsupervised or Generative Learning - Restricted Boltzmann Machines SNU Spoken Language Processing Lab / 서울대학교 음성언어처리연구실 17

3.2 Deep Networks for Unsupervised or Generative Learning - Deep Boltzmann Machines SNU Spoken Language Processing Lab / 서울대학교 음성언어처리연구실 18

3.2 Deep Networks for Unsupervised or Generative Learning - Deep Belief Networks SNU Spoken Language Processing Lab / 서울대학교 음성언어처리연구실 19

3.2 Deep Networks for Unsupervised or Generative Learning - DBM vs. DBN SNU Spoken Language Processing Lab / 서울대학교 음성언어처리연구실 20

Deep Learning: Methods and Applications Chapter 3: Three Classes of - PowerPoint PPT Presentation

Deep Learning: Methods and Applications Chapter 3: Three Classes of Deep Learning Network : , SNU Spoken Language Processing Lab / 3.1. A Three-Way Categorization

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Medical Imaging Elisa Sayrol Medical Imaging Interest in this area in Deep Learning: DeepDeep

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 6: Deep Learning Systems 1 Outline

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning.

Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December

Differen'able Func'onal Programming Noel Welsh @noelwelsh underscore Goals Deep learning

Computer architecture for deep learning applications David Brooks School of Engineering and

ACCELERATE DEEP LEARNING WITH NVIDIA'S DEEP LEARNING PLATFORM | STEPHEN JONES | GTC16 DEEP

Deep learning for natural language processing A short primer on deep learning Benoit Favre <

Relational Deep Learning: A Deep Latent Variable Model for Link Prediction Hao Wang, Xingjian

Distributed Synthetic Data Platform for Deep Learning Applications BITCOIN OR ETHER AMAZON DEEP

Neural Probabilistic Models for Melody Prediction, Sequence Labelling and Classification

Energy Harvesting: Strategies for ultra low power consumption and reliability. Antonio Rubio and

29/06/2020 1 29/06/2020 The University of Genoa is a generalist university that develops

Online Slip Prediction for Mobile Robots 16831 Project Proposal Neal Seegmiller, Chris

Using Bluetooth to Determine Using Bluetooth to Determine Travel Times www.kmjinc.com 1 Agenda

Nam e Standardization Nam e Standardization for Genealogical for Genealogical Record Linkage

MANAGEMENT AND ANALYSIS OF NATIONAL MULTISITE PROGRAM EVALUATION DATA: CENTER FOR SUBSTANCE ABUSE

FAULT INTERPRETATION: AUTOMATIC CURVE MATCHING Fault interpretation is carried out by Archimedes

Deep Learning: Methods and Applications Chapter 3: Three Classes of - PowerPoint PPT Presentation

Deep Learning: Methods and Applications Chapter 3: Three Classes of Deep Learning Network : , SNU Spoken Language Processing Lab / 3.1. A Three-Way Categorization

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Medical Imaging Elisa Sayrol Medical Imaging Interest in this area in Deep Learning: DeepDeep

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 6: Deep Learning Systems 1 Outline

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning.

Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December

Differen'able Func'onal Programming Noel Welsh @noelwelsh underscore Goals Deep learning

Computer architecture for deep learning applications David Brooks School of Engineering and

ACCELERATE DEEP LEARNING WITH NVIDIA'S DEEP LEARNING PLATFORM | STEPHEN JONES | GTC16 DEEP

Deep learning for natural language processing A short primer on deep learning Benoit Favre &lt;

Relational Deep Learning: A Deep Latent Variable Model for Link Prediction Hao Wang, Xingjian

Distributed Synthetic Data Platform for Deep Learning Applications BITCOIN OR ETHER AMAZON DEEP

Neural Probabilistic Models for Melody Prediction, Sequence Labelling and Classification

Energy Harvesting: Strategies for ultra low power consumption and reliability. Antonio Rubio and

29/06/2020 1 29/06/2020 The University of Genoa is a generalist university that develops

Online Slip Prediction for Mobile Robots 16831 Project Proposal Neal Seegmiller, Chris

Using Bluetooth to Determine Using Bluetooth to Determine Travel Times www.kmjinc.com 1 Agenda

Nam e Standardization Nam e Standardization for Genealogical for Genealogical Record Linkage

MANAGEMENT AND ANALYSIS OF NATIONAL MULTISITE PROGRAM EVALUATION DATA: CENTER FOR SUBSTANCE ABUSE

FAULT INTERPRETATION: AUTOMATIC CURVE MATCHING Fault interpretation is carried out by Archimedes

Deep learning for natural language processing A short primer on deep learning Benoit Favre <