Deep Learning: Theory and Practice 30-04-2019 Recurrent Neural - PowerPoint PPT Presentation

Sep 19, 2022 •382 likes •780 views

Deep Learning: Theory and Practice 30-04-2019 Recurrent Neural Networks Introduction The standard DNN/CNN paradigms (x,y) - ordered pair of data vectors/images (x) and target (y) Moving to sequence data (x(t),y(t)) where this

Deep Learning: Theory and Practice 30-04-2019 Recurrent Neural Networks
Introduction ❖ The standard DNN/CNN paradigms ❖ (x,y) - ordered pair of data vectors/images (x) and target (y) ❖ Moving to sequence data ❖ (x(t),y(t)) where this could be sequence to sequence mapping task. ❖ (x(t),y) where this could be a sequence to vector mapping task.
Introduction ❖ Difference between CNNs/DNNs ❖ (x(t),y(t)) where this could be sequence to sequence mapping task. ❖ Input features / output targets are correlated in time. ❖ Unlike standard models where each pair is independent. ❖ Need to model dependencies in the sequence over time.
Introduction to Recurrent Networks “Deep Learning”, Ian Goodfellow, Yoshua Bengio, Aaron Courville
Recurrent Networks “Deep Learning”, Ian Goodfellow, Yoshua Bengio, Aaron Courville
Recurrent Networks “Deep Learning”, Ian Goodfellow, Yoshua Bengio, Aaron Courville
Back Propagation in RNNs Model Parameters Gradient Descent
Recurrent Networks
Back Propagation Through Time
Back Propagation Through Time
Standard Recurrent Networks “Deep Learning”, Ian Goodfellow, Yoshua Bengio, Aaron Courville
Other Recurrent Networks Teacher Forcing Networks “Deep Learning”, Ian Goodfellow, Yoshua Bengio, Aaron Courville
Recurrent Networks Teacher Forcing Networks “Deep Learning”, Ian Goodfellow, Yoshua Bengio, Aaron Courville
Recurrent Networks Multiple Input Single Output
Recurrent Networks Single Input Multiple Output
Recurrent Networks Bi-directional Networks
Recurrent Networks Sequence to Sequence Mapping Networks
Long-term Dependency Issues
Vanishing/Exploding Gradients ❖ Gradients either vanish or explode ❖ Initial frames may not contribute to gradient computations or may contribute too much.
Long-Short Term Memory
LSTM Cell Input Gate f - sigmoid function g, h - tanh function Forget Gate Cell Output Gate LSTM output
Long Short Term Memory Networks
Gated Recurrent Units (GRU)
Attention in LSTM Networks ❖ Attentions allows a mechanism to add relevance ❖ Certain regions of the audio have more importance than the rest for the task at hand.
Encoder - Decoder Networks with Attention
Attention Models
Attention - Speech Example From our lab [part of ICASSP 2019 paper].
Language Recognition Evaluation
End-to-end model using GRUs and Attention
Proposed End-to-End Language Recognition Model
Proposed End-to-End Language Recognition Model
Proposed End-to-End Language Recognition Model
Language Recognition Evaluation State-of-art models use the input sequence directly. We proposed the attention model - Attention weighs th importance of each short-term segment feature for the task. 0-3s : O...One muscle at all, it was terrible Attention Weight 3s-4s : .... ah .... ah .... 4s - 9s : I couldn't scream, I couldn't shout, I couldn't even move my arms up, or my legs 9s -11s : I was trying me hardest, I was really really panicking. Bharat Padi, et al. “End-to-end language recognition using hierarchical gated recurrent networks”, under review 2018.
Language Recognition Evaluation
Language Recognition Evaluation

Recommend

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations deeplearning.cce2020@gmail.com Deep Networks Intuition Neural networks with multiple hidden layers - Deep networks [Hinton, 2006] Deep Networks Intuition

1.33k views • 28 slides

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

Deep 3D Representation Learning for Visual Computing Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms Conclusion 2 Outline Overview of 3D deep learning Background 3D deep learning tasks 3D deep

1.66k views • 122 slides

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from http://imgur.com/ Deep Learning Image from http://imgur.com/ Deep Learning Image from http://imgur.com/ Deep Learning Image from http://imgur.com/ Deep

1.15k views • 79 slides

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Deep Neural Networks and Deep Reinforcement Learning Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and Courville [chapt. 6,7,8]; AIMA [sect. 21.1-21.3]; Sutton and Barto, Reinforcement Learning: an

528 views • 35 slides

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice Scaling up DL 2 What is Deep Learning? 3 DEEP LEARNING EVERYWHERE INTERNET & CLOUD MEDICINE & BIOLOGY SECURITY & DEFENSE MEDIA &

1.45k views • 39 slides

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys: surveys: the case of the Chandra Deep Field South the case of the Chandra Deep Field South the case of the Chandra Deep Field South Fabrizio Fiore

423 views • 21 slides

Theory or Practice? Theory : Without theory, practice is but routine born out of habit.

Theory or Practice? Theory : Without theory, practice is but routine born out of habit. Theory alone can bring forth and develop the spirit of inventions Louis Pasteur, 1822-1895 Practice : It is a capital mistake to theorize before

548 views • 42 slides

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning.

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning. 2.Brief introduction of Backpropagation. 3.Brief introduction of Convolutional Neural Networks. Deep learning I . Introduction to Deep Learning

609 views • 22 slides

Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December

Deep learning Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December 25, 2018 Hamid Beigy | Sharif university of technology | December 25, 2018 1 / 65 Deep learning Table of contents 1 Introduction 2

836 views • 65 slides

Differen'able Func'onal Programming Noel Welsh @noelwelsh underscore Goals Deep learning

Differen'able Func'onal Programming Noel Welsh @noelwelsh underscore Goals Deep learning isnt deep Deep learning is func'onal programming You can implement it yourself Deep Learning Deep learning is supervised learning of

1.91k views • 161 slides

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 6: Deep Learning Systems 1 Outline

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 6: Deep Learning Systems 1 Outline Rise of Deep Learning Methods Deep Learning Systems: Specification Deep Learning Systems: Execution Future of Deep Learning Systems

561 views • 25 slides

ACCELERATE DEEP LEARNING WITH NVIDIA'S DEEP LEARNING PLATFORM | STEPHEN JONES | GTC16 DEEP

ACCELERATE DEEP LEARNING WITH NVIDIA'S DEEP LEARNING PLATFORM | STEPHEN JONES | GTC16 DEEP LEARNING EVERYWHERE INTERNET & CLOUD MEDICINE & BIOLOGY MEDIA & ENTERTAINMENT SECURITY & DEFENSE AUTONOMOUS MACHINES Image

314 views • 26 slides

Deep learning for natural language processing A short primer on deep learning Benoit Favre <

Deep learning for natural language processing A short primer on deep learning Benoit Favre < benoit.favre@univ-mrs.fr > Aix-Marseille Universit, LIF/CNRS 20 Feb 2017 Benoit Favre (AMU) DL4NLP: deep learning 20 Feb 2017 1 / 25 Deep

530 views • 25 slides

Relational Deep Learning: A Deep Latent Variable Model for Link Prediction Hao Wang, Xingjian

Relational Deep Learning: A Deep Latent Variable Model for Link Prediction Hao Wang, Xingjian Shi, Dit-Yan Yeung Motivation Bayesian Deep Learning Relational Deep Learning Parameter Learning Experiments Conclusion

577 views • 40 slides

Medical Imaging Elisa Sayrol Medical Imaging Interest in this area in Deep Learning: DeepDeep

Day 4 Lecture 5 Medical Imaging Elisa Sayrol Medical Imaging Interest in this area in Deep Learning: DeepDeep Deep LearningDeep Learning Deep Learning ApplicationsDeep Learning Applications Deep Learning Applications toDeep Learning

442 views • 21 slides

Deep Learning - Theory and Practice Deep Neural Networks 12-03-2020

Deep Learning - Theory and Practice Deep Neural Networks 12-03-2020 http://leap.ee.iisc.ac.in/sriram/teaching/DL20/ deeplearning.cce2020@gmail.com Logistic Regression 2- class logistic regression Maximum likelihood solution K-class

558 views • 24 slides

Recurrent Neural Networks + LSTMs + Attention Surag Nair (based on slides by Xavier

Recurrent Neural Networks + LSTMs + Attention Surag Nair (based on slides by Xavier Gir-i-Nieto, Santi Pascual and M. Malinowski) Multilayer Perceptron The output depends ONLY on the current input. Alex Graves, Supervised Sequence

1.04k views • 66 slides

Cognitive Psychology Philipp Koehn 13 February 2020 Philipp Koehn Artificial Intelligence:

Cognitive Psychology Philipp Koehn 13 February 2020 Philipp Koehn Artificial Intelligence: Cognitive Psychology 13 February 2020 1 two systems Philipp Koehn Artificial Intelligence: Cognitive Psychology 13 February 2020 System 1 2

1.31k views • 101 slides

2. Cognitive Perspective of Learning Cognition: Big Questions How do things out there

2. Cognitive Perspective of Learning Cognition: Big Questions How do things out there become knowledge in us? How does the brain make learning possible? What analogies can describe those brain processes? How can

899 views • 49 slides

Class 15 - Long Short-Term Memory (LSTM) Class 15 - Long Short-Term Memory (LSTM) Study materials

Class 15 - Long Short-Term Memory (LSTM) Class 15 - Long Short-Term Memory (LSTM) Study materials Study materials http://colah.github.io/posts/2015-08-Understanding-LSTMs/ (http://colah.github.io/posts/2015-08-Understanding-LSTMs/)

641 views • 23 slides

Understanding LSTM Networks Recurrent Neural Networks An unrolled recurrent neural network The

Understanding LSTM Networks Recurrent Neural Networks An unrolled recurrent neural network The Problem of Long-Term Dependencies RNN short-term dependencies Language model trying to predict the next word based on the previous ones the clouds

729 views • 27 slides

GESIS Survey Guidelines Timo Lenzner and Natalja Menold These slides are based on the GESIS

Question Wording GESIS Survey Guidelines Timo Lenzner and Natalja Menold These slides are based on the GESIS Survey Guidelines paper about question wording: Lenzner, T. and Menold, N. (2016). Question Wording. GESIS Survey Guidelines. Mannheim,

670 views • 29 slides

Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks by Kai

Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks by Kai Sheng Tai, Richard Socher, Christopher D. Manning Daniel Perez tuvistavie CTO @ Claude Tech M2 @ The University of Tokyo October 2, 2017

546 views • 22 slides

LSTM M Based sed Ada dapt ptive ive Fil ilterin ering g for r Redu duced ced Pre redi

LSTM M Based sed Ada dapt ptive ive Fil ilterin ering g for r Redu duced ced Pre redi diction ction Err rrors ors of Hype pers rspectral pectral Im Images ages Zhuocheng Jiang and W. David Pan Dept. of Electrical and Computer

988 views • 21 slides