Large-Scale Deep Learning With TensorFlow Jeff Dean Google Brain - PowerPoint PPT Presentation

Sync converges faster (time to accuracy) 40 hours vs. 50 hours Synchronous updates (with backup workers) trains to higher accuracy faster Better scaling to more workers (less loss of accuracy) Revisiting Distributed Synchronous SGD , Jianmin Chen, Rajat Monga, Samy Bengio, Raal Jozefowicz, ICLR Workshop 2016, arxiv.org/abs/1604.00981

General Computations Although we originally built TensorFlow for our uses around deep neural networks, it’s actually quite flexible Wide variety of machine learning and other kinds of numeric computations easily expressible in the computation graph model

Runs on Variety of Platforms phones single machines (CPU and/or GPUs) … distributed systems of 100s of machines and/or GPU cards custom ML hardware

Trend: Much More Heterogeneous hardware General purpose CPU performance scaling has slowed significantly Specialization of hardware for certain workloads will be more important

Tensor Processing Unit Custom machine learning ASIC In production use for >16 months: used on every search query, used for AlphaGo match, ... See Google Cloud Platform blog: Google supercharges machine learning tasks with TPU custom chip, by Norm Jouppi, May, 2016

Long Short-Term Memory (LSTMs): Make Your Memory Cells Differentiable Sigmoids [Hochreiter & Schmidhuber, 1997] W R WRITE? READ? M M X Y X Y FORGET? F

Example: LSTM [Hochreiter et al, 1997][Gers et al, 1999] Enables long term dependencies to flow

Example: LSTM for i in range(20): m, c = LSTMCell(x[i], mprev, cprev) mprev = m cprev = c

Example: Deep LSTM for i in range(20): for d in range(4): # d is depth input = x[i] if d is 0 else m[d-1] m [d] , c [d] = LSTMCell( input , mprev [d] , cprev [d] ) mprev [d] = m [d] cprev [d] = c [d]

Example: Deep LSTM for i in range(20): for d in range(4): # d is depth input = x[i] if d is 0 else m[d-1] m[d], c[d] = LSTMCell(input, mprev[d], cprev[d]) mprev[d] = m[d] cprev[d] = c[d]

Example: Deep LSTM for i in range(20): for d in range(4): # d is depth with tf.device("/gpu:%d" % d): input = x[i] if d is 0 else m[d-1] m[d], c[d] = LSTMCell(input, mprev[d], cprev[d]) mprev[d] = m[d] cprev[d] = c[d]

A B C D GPU6 A B C D 80k softmax by GPU5 1000 dims This is very big! GPU4 Split softmax into 4 GPUs GPU3 1000 LSTM cells GPU2 2000 dims per timestep GPU1 2000 x 4 = 8k dims per _ A B C D A B C sentence _

What are some ways that deep learning is having a significant impact at Google? All of these examples implemented using TensorFlow or our predecessor system

Speech Recognition Deep “How cold is Recurrent it outside?” Neural Network Acoustic Input Text Output Reduced word errors by more than 30% Google Research Blog - August 2012, August 2015

The Inception Architecture (GoogLeNet, 2014) Going Deeper with Convolutions Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich ArXiv 2014, CVPR 2015

Neural Nets: Rapid Progress in Image Recognition Team Year Place Error (top-5) ImageNet challenge XRCE (pre-neural-net explosion) 2011 1st 25.8% classification Supervision (AlexNet) 2012 1st 16.4% task Clarifai 2013 1st 11.7% GoogLeNet (Inception) 2014 1st 6.66% Andrej Karpathy (human) 2014 N/A 5.1% BN-Inception (Arxiv) 2015 N/A 4.9% Inception-v3 (Arxiv) 2015 N/A 3.46%

Google Photos Search Deep Convolutional “ocean” Neural Network Automatic Tag Your Photo Search personal photos without tags. Google Research Blog - June 2013

Google Photos Search

Reuse same model for completely different problems Same basic model structure trained on different data , useful in completely different contexts Example: given image → predict interesting pixels

www.google.com/sunroof We have tons of vision problems Image search, StreetView, Satellite Imagery, Translation, Robotics, Self-driving Cars,

MEDICAL IMAGING Very good results using similar model for detecting diabetic retinopathy in retinal images

“Seeing” Go

RankBrain in Google Search Ranking Deep Score for Query: “car parts for sale”, Neural doc,query Doc: “Rebuilt transmissions …” pair Network Query & document features Launched in 2015 Third most important search ranking signal (of 100s) Bloomberg, Oct 2015: “ Google Turning Its Lucrative Web Search Over to AI Machines ”

Sequence-to-Sequence Model Target sequence [Sutskever & Vinyals & Le NIPS 2014] X Y Z Q v Deep LSTM A B C D __ X Y Z Input sequence

Sequence-to-Sequence Model: Machine Translation Target sentence [Sutskever & Vinyals & Le NIPS 2014] How v Quelle est votre taille? <EOS> Input sentence

Sequence-to-Sequence Model: Machine Translation Target sentence [Sutskever & Vinyals & Le NIPS 2014] How tall v Quelle est votre taille? <EOS> How Input sentence

Sequence-to-Sequence Model: Machine Translation Target sentence [Sutskever & Vinyals & Le NIPS 2014] How tall are v Quelle est votre taille? <EOS> How tall Input sentence

Sequence-to-Sequence Model: Machine Translation Target sentence [Sutskever & Vinyals & Le NIPS 2014] How tall are you? v Quelle est votre taille? <EOS> How tall are Input sentence

Sequence-to-Sequence Model: Machine Translation At inference time: Beam search to choose most [Sutskever & Vinyals & Le NIPS 2014] probable over possible output sequences v Quelle est votre taille? <EOS> Input sentence

Smart Reply April 1, 2009: April Fool’s Day joke Nov 5, 2015: Launched Real Product Feb 1, 2016: >10% of mobile Inbox replies

Smart Reply Google Research Blog - Nov 2015 Incoming Email Activate Smart Reply? Small Feed-Forward yes/no Neural Network

Smart Reply Google Research Blog - Nov 2015 Incoming Email Activate Smart Reply? Small Feed-Forward yes/no Neural Network Generated Replies Deep Recurrent Neural Network

Image Captioning [Vinyals et al., CVPR 2015] A asleep young girl W __ A girl young

Large-Scale Deep Learning With TensorFlow Jeff Dean Google Brain - PowerPoint PPT Presentation

Large-Scale Deep Learning With TensorFlow Jeff Dean Google Brain team g.co/brain In collaboration with many other people at Google What is the Google Brain Team? Research team focused on long term artificial intelligence research Mix

C-FX-02-V1.0 DSV 4.0 2 45 15 TensorFlow TensorBoard TensorFlow

Getting Started with TensorFlow Part I: TensorFlow Graphs and Sessions Nick Winovich Department

TensorFlow: a Framework for Scalable Machine Learning ACM Learning Center, 2016 You probably

TensorFlow: neural networks lab Paolo Dragone and Andrea Passerini paolo.dragone@unitn.it

A Trip Through the NGC TensorFlow Container GTC 2019 S9256 AGENDA A Trip Through the TensorFlow

Distributed TensorFlow Stony Brook University CSE545, Fall 2017 Goals Understand

TensorFlow w/XLA: TensorFlow, Compiled! Expressiveness with performance Pre-release

Some resources for ML/TensorFlow TensorFlow resources A good tutorial (about 2:40:00 long)

Tensorflow - A system for large-scale machine learning Presentation: Nat McAleese (nm583)

Large-Scale Deep Learning with TensorFlow for Building Intelligent Systems Jeff Dean Google

TensorFlow Probability Joshua V. Dillon Software Engineer Google Research What is TensorFlow

Machine learning on mobile and edge devices with TensorFlow Lite Developer advocate for

A large-scale International IPv6 Network A large-scale International IPv6 Network www.6net.org

TensorFlow: A System for Learning-Scale Machine Learning Google Brain The Problem Machine

TensorFlow Flexible, Scalable, Portable Rajat Monga Engineering Director, TensorFlow Released

Natural Language Processing with Deep Learning CS224N The Future of Deep Learning + NLP Kevin

Partners in Service: Texas Libraries and the Talking Book Program 1 Disability Facts Over

CMS Web Interface Q&A Session January 31, 2018 Disclaimer This presentation was current at

10/26/2017 Moderators: Natalie Afshari MD none Michael Trese MD

NCHIMA 66 th Annual Meeting Transforming Health Care Through the Use of Information Technology

Directly reprogrammed renal tubular epithelial cells are sensitive to typical metabolic

Structured Illumination Microscopy method for Adaptive Optics Flood Illumination Ophthalmoscope

Increases in Food Insecurity due to COVID-19: What Can be Done? Craig Gundersen University of

TAKE THAT! finding, trapping and taming exertion sounds Introduction! I'm DB Cooper, a

Large-Scale Deep Learning With TensorFlow Jeff Dean Google Brain - PowerPoint PPT Presentation

Large-Scale Deep Learning With TensorFlow Jeff Dean Google Brain team g.co/brain In collaboration with many other people at Google What is the Google Brain Team? Research team focused on long term artificial intelligence research Mix

C-FX-02-V1.0 DSV 4.0 2 45 15 TensorFlow TensorBoard TensorFlow

Getting Started with TensorFlow Part I: TensorFlow Graphs and Sessions Nick Winovich Department

TensorFlow: a Framework for Scalable Machine Learning ACM Learning Center, 2016 You probably

TensorFlow: neural networks lab Paolo Dragone and Andrea Passerini paolo.dragone@unitn.it

A Trip Through the NGC TensorFlow Container GTC 2019 S9256 AGENDA A Trip Through the TensorFlow

Distributed TensorFlow Stony Brook University CSE545, Fall 2017 Goals Understand

TensorFlow w/XLA: TensorFlow, Compiled! Expressiveness with performance Pre-release

Some resources for ML/TensorFlow TensorFlow resources A good tutorial (about 2:40:00 long)

Tensorflow - A system for large-scale machine learning Presentation: Nat McAleese (nm583)

Large-Scale Deep Learning with TensorFlow for Building Intelligent Systems Jeff Dean Google

TensorFlow Probability Joshua V. Dillon Software Engineer Google Research What is TensorFlow

Machine learning on mobile and edge devices with TensorFlow Lite Developer advocate for

A large-scale International IPv6 Network A large-scale International IPv6 Network www.6net.org

TensorFlow: A System for Learning-Scale Machine Learning Google Brain The Problem Machine

TensorFlow Flexible, Scalable, Portable Rajat Monga Engineering Director, TensorFlow Released

Natural Language Processing with Deep Learning CS224N The Future of Deep Learning + NLP Kevin

Partners in Service: Texas Libraries and the Talking Book Program 1 Disability Facts Over

CMS Web Interface Q&amp;A Session January 31, 2018 Disclaimer This presentation was current at

10/26/2017 Moderators: Natalie Afshari MD none Michael Trese MD

NCHIMA 66 th Annual Meeting Transforming Health Care Through the Use of Information Technology

Directly reprogrammed renal tubular epithelial cells are sensitive to typical metabolic

Structured Illumination Microscopy method for Adaptive Optics Flood Illumination Ophthalmoscope

Increases in Food Insecurity due to COVID-19: What Can be Done? Craig Gundersen University of

TAKE THAT! finding, trapping and taming exertion sounds Introduction! I'm DB Cooper, a

CMS Web Interface Q&A Session January 31, 2018 Disclaimer This presentation was current at