Distributed DeepLearning at Scale Soumith Chintala Facebook AI - PowerPoint PPT Presentation

Distributed DeepLearning at Scale Soumith Chintala Facebook AI Research

Overview • Deep Learning Research at FAIR • Deep Learning on GPUs • Deep Learning at scale • Emerging Trends

Deep Learning Research at Facebook AI Research

Image Intelligence: Classification

Image Intelligence Language Translation from Visual Learning

Image Intelligence : Detection

Image Intelligence : Detection 1x1# conv# 56x56# 512x14x14# 512x1x1# VGG# f segm (x):#224x224# 2x2# 512x14x14# pool# f score (x):#1x1 # # x:#3x224x224# 512x7x7# 512x1x1# 1024x1x1#

Image Intelligence : Detection image scores

Image Intelligence : Detection image image scores scores

Image Intelligence : Detection

Image Intelligence https://code.facebook.com/posts/accessibility/

Video Intelligence

Image and Video Generation Predicting the Future

Natural Language Understanding chatbots, personal assistants • Memory networks • Language Translation • Reading, Writing and answering Questions

Deep Learning at Scale

Deep Learning at Scale GPU-powered Convolution Neural Networks

Deep Learning at Scale GPU-powered Convolution Neural Networks Alex Khrizevsky

Deep Learning at Scale GPU-powered Convolution Neural Networks • Convolutions, GEMM take all the time • Faster Convolutions = faster research

Deep Learning at Scale GPU-powered Convolution Neural Networks

Deep Learning at Scale GPU-powered Convolution Neural Networks Winograd transform based Convolutions

Deep Learning at Scale GPU-powered Convolution Neural Networks • The standard in deep learning: NVIDIA GPUs + CUDA + CuDNN

Deep Learning at Scale GPU-powered Convolution Neural Networks • Exotic new hardware! • Custom chips (Yunji Chen et. al., Nervana Systems)

Deep Learning at Scale Multi-GPU Training • Use multiple GPUs on single machine

Deep Learning at Scale Multi-GPU Training • Data parallel

Deep Learning at Scale Multi-GPU Training • Model parallel

Deep Learning at Scale Multi-GPU Training • Pipeline-parallel

Deep Learning at Scale Multi-GPU Training Bottleneck: interconnects

Deep Learning at Scale Multi-Machine Training • Multi-machine SGD Send gradients

Deep Learning at Scale Multi-Machine Training • Multi-machine SGD Send Weights

Deep Learning at Scale Multi-Machine Training • Elastic Averaging SGD! (Sixin Zhang, Anna Choromanska, Yann LeCun)

Deep Learning at Scale Multi-Machine Training • Elastic Averaging SGD! Train synchronously Occasionally, check with master Dont go too far from everyone else

Deep Learning at Scale Multi-Machine Training • Elastic Averaging SGD! Train synchronously Occasionally, check with neighbors Dont go too far from everyone else

Deep Learning at Scale Multi-Machine Training • Elastic Averaging SGD! • Empirical speedup of SquareRoot(N) • N = number of nodes • No communication overhead with pre-fetching • 128 GPUs (32 clients * 4 GPUs) • Sharded parameters over 64 CPU servers • Tau = 10, prefetch = 5 • zero overhead

Deep Learning at Scale Multi-Machine Training • Elastic Averaging SGD! • Fun fact: Trained AlexNet in 5 epochs of Imagenet data • Good success in training Vision and Text networks

Big Sur Open Compute for Deep Learning • Serviceability • Thermal Efficiency • Performance

Big Sur Hot swappable fan modules Open Compute for Deep Learning Removable GPU baseboard GPU removal using 2 thumb screws Cables to change Swap PCI-e Topologies topologies with incredible ease Removable motherboard tray Rails for in-rack servicing 2.5” drive carriers

Big Sur PCI-e Topologies — Matter!

Emerging Trends

Emerging Trends E ffi cient Collectives + Imperative Programs • Data / Model / Pipeline parallel seems su ffi cient • Torch (nn / autograd / distlearn) • Ca ff e

Emerging Trends Computational Graph Toolkits • Intel CnC, Ca ff e, TensorFlow, MXNet, Theano • Graph placement hints + execution • DSLs to write the computation graphs

Silver Bullet Imperative Language + Graph Compiler • Best of both worlds • Hard problem of automatic graph placement • Limited heuristic-driven success

Presence at GTC 2016 If you want to chat in-person, drop us an email • Big Sur Hardware • Kevin Lee kevinlee@fb.com • Doug Wimer dwimer@fb.com • Soumith Chintala soumith@fb.com • Multi-GPU / Multi-machine Training Nicolas Vasilache ntv@fb.com • Je ff Johnson jhj@fb.com • Soumith Chintala soumith@fb.com • • Computation Graphs, Automatic Placement Je ff Johnson jhj@fb.com • Andrew Tulloch tulloch@fb.com • Yangqing Jia jiayq@fb.com • Soumith Chintala soumith@fb.com •

Distributed DeepLearning at Scale Soumith Chintala Facebook AI - PowerPoint PPT Presentation

Distributed DeepLearning at Scale Soumith Chintala Facebook AI Research Overview Deep Learning Research at FAIR Deep Learning on GPUs Deep Learning at scale Emerging Trends Deep Learning Research at Facebook AI Research Image

Deep Learning for Self Driving Cars Link: deeplearning.mit.edu Lex Fridman: fridman@mit.edu

Distributed Systems (ICE 601) Distributed Transactions Dongman Lee ICU Class Overview

Unleashing Talent in A Distributed Workforce C O R E N E T 2 0 2 0 HACKATHON: DISTRIBUTED W O R K

Upscaling Beyond SuperResolution Using a Novel DeepLearning System Pablo Navarrete

Early diagnosis of Alzheimer with DeepLearning Student : Supervisor: Ivancich Stefano Nanni

Deep Learning: State of the Art (2020) Deep Learning Lecture Series https://deeplearning.mit.edu

Distributed Databases Distributed database management system A distributed database (DDB) is

Distributed File Systems Distributed File Systems A distributed file system (DFS) is a

The Arvy Distributed Directory Protocol Pankaj Khanchandani, Roger Wattenhofer ETH Zurich -

` James R. Wilcox Zach Tatlock Ilya Sergey Distributed Systems Distributed Infrastructure

Distributed Systems Goals of Distributed Systems 13A. Distributed Systems: Goals & Challenges

Time in Distributed Systems, Distributed Simulation, and Distributed Debugging Friedemann

Distributed File Systems Issues in Distributed File Service Case Studies: Sun

Flat and nested distributed Outline transactions Flat and nested distributed transactions

Distributed Systems Goals of Distributed Systems 13A. Distributed Systems: Goals & Challenges

Distributed Objects Message Passing vs. Distributed Objects Message Passing versus Distributed

Operational Update Operational Update September 2014 September 2014 0 Cautionary Notes This

MEASURES FOR NUCLEAR MATERIAL AND NUCLEAR FACILITIES IN UGANDA BY Mr. Oboo Moses

Club Rotario Puerto Vallarta Sur Becas (Scholarship) Program The purpose of Programa de Becas is

Commitment Costs and Default Energy Bid Enhancements Revised Draft Final Proposal December

Web sites presentation STEP Group at ETH Pierre Amey Zrich, the 11th of may 2015 1 sur 8

Dualism in France: Le contrat unique Pierre Cahuc (Ecole Polytechnique, CREST, IZA, CEPR)

CFI Cyber-infrastructure Initiative June 2014 Context Budget 2013 made reference to the fact

Solid Foundation. Building New Platforms. www.parexresources.com | TSX:PXT | Corporate

Distributed DeepLearning at Scale Soumith Chintala Facebook AI - PowerPoint PPT Presentation

Distributed DeepLearning at Scale Soumith Chintala Facebook AI Research Overview Deep Learning Research at FAIR Deep Learning on GPUs Deep Learning at scale Emerging Trends Deep Learning Research at Facebook AI Research Image

Deep Learning for Self Driving Cars Link: deeplearning.mit.edu Lex Fridman: fridman@mit.edu

Distributed Systems (ICE 601) Distributed Transactions Dongman Lee ICU Class Overview

Unleashing Talent in A Distributed Workforce C O R E N E T 2 0 2 0 HACKATHON: DISTRIBUTED W O R K

Upscaling Beyond SuperResolution Using a Novel DeepLearning System Pablo Navarrete

Early diagnosis of Alzheimer with DeepLearning Student : Supervisor: Ivancich Stefano Nanni

Deep Learning: State of the Art (2020) Deep Learning Lecture Series https://deeplearning.mit.edu

Distributed Databases Distributed database management system A distributed database (DDB) is

Distributed File Systems Distributed File Systems A distributed file system (DFS) is a

The Arvy Distributed Directory Protocol Pankaj Khanchandani, Roger Wattenhofer ETH Zurich -

` James R. Wilcox Zach Tatlock Ilya Sergey Distributed Systems Distributed Infrastructure

Distributed Systems Goals of Distributed Systems 13A. Distributed Systems: Goals &amp; Challenges

Time in Distributed Systems, Distributed Simulation, and Distributed Debugging Friedemann

Distributed File Systems Issues in Distributed File Service Case Studies: Sun

Flat and nested distributed Outline transactions Flat and nested distributed transactions

Distributed Systems Goals of Distributed Systems 13A. Distributed Systems: Goals &amp; Challenges

Distributed Objects Message Passing vs. Distributed Objects Message Passing versus Distributed

Operational Update Operational Update September 2014 September 2014 0 Cautionary Notes This

MEASURES FOR NUCLEAR MATERIAL AND NUCLEAR FACILITIES IN UGANDA BY Mr. Oboo Moses

Club Rotario Puerto Vallarta Sur Becas (Scholarship) Program The purpose of Programa de Becas is

Commitment Costs and Default Energy Bid Enhancements Revised Draft Final Proposal December

Web sites presentation STEP Group at ETH Pierre Amey Zrich, the 11th of may 2015 1 sur 8

Dualism in France: Le contrat unique Pierre Cahuc (Ecole Polytechnique, CREST, IZA, CEPR)

CFI Cyber-infrastructure Initiative June 2014 Context Budget 2013 made reference to the fact

Solid Foundation. Building New Platforms. www.parexresources.com | TSX:PXT | Corporate

Distributed Systems Goals of Distributed Systems 13A. Distributed Systems: Goals & Challenges

Distributed Systems Goals of Distributed Systems 13A. Distributed Systems: Goals & Challenges