Deep Learning for Language Understanding (at Google Scale) Anjuli - PowerPoint PPT Presentation

Dec 06, 2022 •475 likes •853 views

Deep Learning for Language Understanding (at Google Scale) Anjuli Kannan Software Engineer, Google Brain Confidential + Proprietary Confidential + Proprietary Text is just a sequence of words ["hi", "team",

Deep Learning for Language Understanding (at Google Scale) Anjuli Kannan Software Engineer, Google Brain Confidential + Proprietary Confidential + Proprietary
Text is just a sequence of words ["hi", "team", "the", "server", "appears", "to", "be", "dropping", "about", "10%", …] Confidential + Proprietary
About me ● My team: Google Brain ○ "Make machines intelligent, improve people's lives." ○ Research + software + applications ○ g.co/brain My work is at boundary of research and applications ● ● Focus on natural language understanding
Neural network basics
Neural network ... Is a 4 Is a 5 ... Image: Wikipedia Confidential + Proprietary
Neural network Is a 4 Is a 5 Neuron Confidential + Proprietary
Basic building block is the neuron Greg Corrado Confidential + Proprietary
Gradient descent Learning Rate w’ = w - α ∂ w L(w) w w’ Slide: Vincent Vanhoucke
Recurrent neural networks
Recurrent neural networks can model sequences Confidential + Proprietary
Recurrent neural networks can model sequences How Message
Recurrent neural networks can model sequences How are Message
Recurrent neural networks can model sequences How are you Message
Recurrent neural networks can model sequences How are you ? Message
Recurrent neural networks can model sequences Internal state is a fixed length encoding of the message How are you ? Message
Sequence-to-sequence models
Suppose we want to generate email replies Response Incoming Smartreply email email
Sequence-to-sequence model Sutskever et al, NIPS 2014
Sequence-to-sequence model decoder encoder
Sequence-to-sequence model Generates reply message Ingests incoming message
Encoder ingests the incoming message Internal state is a fixed length encoding of the message How are you ? Message
Decoder is initialized with final state of encoder How How are are you you ? ? __ Message
Decoder is initialized with final state of encoder How How are are you you ? ? __ Message
Decoder predicts next word Response I How are you ? __ Message
Decoder predicts next word Response I am How are you ? __ I Message
Decoder predicts next word Response I am great How are you ? __ I am Message
Decoder predicts next word Response I am ! great How are you ? __ I am great Message Vinyals & Le, ICML DL 2015 Kannan et al, KDD 2016
What the model can do
What the model can do
Summary - Neural networks learn feature representations from raw data - Recurrent neural networks have statefulness which allows them to model sequences of data such as text - The sequence-to-sequence model contains two recurrent neural networks: one to encode an input sequence and one to generate an output sequence
Smartreply
Google Translate
Research: Speech recognition
Research: Electronic health records
What's next? ?
Resources - All tensorflow tutorials: https://www.tensorflow.org/versions/master/tutorials/index.html - Sequence-to-sequence tutorial (machine translation): https://www.tensorflow.org/versions/master/tutorials/seq2seq - Chris Olah's blog: http://colah.github.io/
Thank you!

Recommend

Containers At Scale At Google, the Google Cloud Platform and Beyond Joe Beda jbeda@google.com

Containers At Scale At Google, the Google Cloud Platform and Beyond Joe Beda jbeda@google.com @jbeda google.com/+JoeBeda Senior Staff Software Engineer, Google Cloud Platform GlueCon - May 22, 2014 Google and Containers Everything

574 views • 18 slides

RPC Metrics at Google JBD, Google (@rakyll) gRPC Metrics at Google JBD, Google (@rakyll)

RPC Metrics at Google JBD, Google (@rakyll) gRPC Metrics at Google JBD, Google (@rakyll) Request Metrics at Google JBD, Google (@rakyll) "100% is the wrong reliability target for basically everything." -- Benjamin Treynor

452 views • 20 slides

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

Deep 3D Representation Learning for Visual Computing Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms Conclusion 2 Outline Overview of 3D deep learning Background 3D deep learning tasks 3D deep

1.66k views • 122 slides

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from http://imgur.com/ Deep Learning Image from http://imgur.com/ Deep Learning Image from http://imgur.com/ Deep Learning Image from http://imgur.com/ Deep

1.15k views • 79 slides

Understanding a Sites Traffic With Google Analytics UNDERSTANDING GOOGLE ANALYTICS Daniel

Understanding a Sites Traffic With Google Analytics UNDERSTANDING GOOGLE ANALYTICS Daniel Stern CODE WHISPERER @danieljackstern Understanding Google Analytics Understanding how it works and when to use it is the first step to

261 views • 15 slides

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Deep Neural Networks and Deep Reinforcement Learning Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and Courville [chapt. 6,7,8]; AIMA [sect. 21.1-21.3]; Sutton and Barto, Reinforcement Learning: an

528 views • 35 slides

Websites from Presentation Search Engines Google https://www.google.com/ Google Scholar

Websites from Presentation Search Engines Google https://www.google.com/ Google Scholar https://scholar.google.com/ Google Advanced Search https://www.google.com/advanced_search USA.gov https://www.usa.gov/ Duck Duck Go

224 views • 4 slides

BRAINJAR HOW GOOGLE THINKS AND DISPELLING 3 GOOGLE MYTHS (& 6 TIPS!) BRAINJAR HOW GOOGLE

BRAINJAR HOW GOOGLE THINKS AND DISPELLING 3 GOOGLE MYTHS (& 6 TIPS!) BRAINJAR HOW GOOGLE THINKS AND DISPELLING 3 GOOGLE MYTHS (& 6 TIPS!) MOBILEGEDDON! HOW GOOGLE THINKS AND DISPELLING 3 GOOGLE MYTHS (& 6 TIPS!) 60% BRAINJARS

695 views • 50 slides

Using deep learning and Using deep learning and Google Street View to estimate Google Street

Using deep learning and Using deep learning and Google Street View to estimate Google Street View to estimate the demographic makeup of the demographic makeup of neighbourhoods across the US neighbourhoods across the US Gebru , T. et al.

599 views • 17 slides

E9 205 Machine Learning for Signal Processing Understanding Deep Networks 08-11-2019 Instructor

E9 205 Machine Learning for Signal Processing Understanding Deep Networks 08-11-2019 Instructor - Sriram Ganapathy (sriramg@iisc.ac.in.in) Understanding Deep Networks Understanding Deep Networks Understanding Deep Networks SVHN dataset

366 views • 23 slides

Google Slides Opening a New Slide To open a new Google Slide, navigate to your Google Drive and

LIBRARY AND LEARNING SERVICES STUDY GUIDE | GOOGLE SLIDES www.2.eit.ac.nz/library/OnlineGuides/Google Slides.pdf Google Slides Opening a New Slide To open a new Google Slide, navigate to your Google Drive and click on New. Click on the arrow next

519 views • 7 slides

Using Deep Learning to Solve Challenging Problems Jeff Dean Google Brain team g.co/brain

Using Deep Learning to Solve Challenging Problems Jeff Dean Google Brain team g.co/brain Presenting the work of many people at Google Deep learning is causing a machine learning revolution ML Arxiv Papers per Year Deep Learning Modern

770 views • 64 slides

Deep learning for natural language processing A short primer on deep learning Benoit Favre <

Deep learning for natural language processing A short primer on deep learning Benoit Favre < benoit.favre@univ-mrs.fr > Aix-Marseille Universit, LIF/CNRS 20 Feb 2017 Benoit Favre (AMU) DL4NLP: deep learning 20 Feb 2017 1 / 25 Deep

530 views • 25 slides

Distributed DeepLearning at Scale Soumith Chintala Facebook AI Research Overview Deep

Distributed DeepLearning at Scale Soumith Chintala Facebook AI Research Overview Deep Learning Research at FAIR Deep Learning on GPUs Deep Learning at scale Emerging Trends Deep Learning Research at Facebook AI Research Image

1.18k views • 50 slides

Neural Program Synthesis Rishabh Singh, Google Brain Great Collaborators! Deep Learning and

Neural Program Synthesis Rishabh Singh, Google Brain Great Collaborators! Deep Learning and Evolutionary Progression Vision Speech Language Deep Learning and Evolutionary Progression Vision Speech Language Programming Deep Learning

1.44k views • 142 slides

Natural Language Processing with Deep Learning CS224N The Future of Deep Learning + NLP Kevin

Natural Language Processing with Deep Learning CS224N The Future of Deep Learning + NLP Kevin Clark Deep Learning for NLP 5 years ago No Seq2Seq No Attention No large-scale QA/reading comprehension datasets No TensorFlow or

1.41k views • 92 slides

S8822 OPTIMIZING NMT WITH TENSORRT Micah Villmow Senior TensorRT Software Engineer 2 100

S8822 OPTIMIZING NMT WITH TENSORRT Micah Villmow Senior TensorRT Software Engineer 2 100 2 DOUGLAS ADAMS BABEL FISH Neural Machine Translation Unit 3 4 OVER 100X FASTER, IS IT REALLY

834 views • 40 slides

Improved Neural Machine Translation with a Syntax-Aware Encoder and Decoder Huadong Chen ! ,

Improved Neural Machine Translation with a Syntax-Aware Encoder and Decoder Huadong Chen ! , Shujian Huang ! , David Chiang " , Jiajun Chen ! {chenhd,huangsj,chenjj}@nlp.nju.edu.cn dchiang@nd.edu 1. State Key Laboratory of Novel Software

555 views • 30 slides

Exemplar Encoder Decoder for Neural Conversation Generation By Gaurav Pandey, Danish

Exemplar Encoder Decoder for Neural Conversation Generation By Gaurav Pandey, Danish Contractor, Vineet Kumar and Sachindra Joshi IBM Research AI Generative Models for Conversations Context Context Context Response Decoder Embedding

437 views • 12 slides

Travis Perkins plc Travis Perkins plc Financial Results Financial Results Year ended 31

Travis Perkins plc Travis Perkins plc Financial Results Financial Results Year ended 31 December 2005 Year ended 31 December 2005 1 Geoff Cooper Geoff Cooper Chief Executive Chief Executive 2 Highlights Highlights Merchanting

921 views • 45 slides

N EU G EN Text Generation from Meaning Representations Yannis Konstas Joint work

N EU G EN Text Generation from Meaning Representations Yannis Konstas Joint work with Mark Yatskar, Luke Zettlemoyer and Yejin Choi (UW) ~ Yonatan Bisk and Daniel Marcu (ISI) Motivation Motivation Machine-generated

879 views • 76 slides

F i n a n c i a l R e s u l t s P r e s e n t a t i o n Aug 3, 2015

F o r t h e F i r s t T h r e e Q u a r t e r so f F i s c a l Y e a r E n d i n g S e p t e mb e r 3 0 , 2 0 1 5 F i n a n c i a l R e s u l t s P r e s e n t a t i o

472 views • 44 slides

Appendix 24 Securities Code :9438 Business Overview: docomo d-menu

F i n a n c i a l R e s u l t s B r i e f i n g f o r Q 1 / F Y 2 0 1 6 February 1, 2016 Appendix 24 Securities Code :9438 Business Overview: docomo d-menu Ranking Ranked No.1 in 8 contents! No. 1 pregnancy,

520 views • 18 slides

Measuring security with SecQua Metricon 7.0-USENIX 2012 Constantinos Patsakis Department of

I NTRODUCTION T HE METRIC S EC Q UA E XAMPLES W HAT S NEXT ? References Measuring security with SecQua Metricon 7.0-USENIX 2012 Constantinos Patsakis Department of Computer Engineering and Maths Universitat Rovira i Virgili UNESCO Chair

952 views • 62 slides