Industrial Level Deep Learning Training Infrastructure the Practice - PowerPoint PPT Presentation

Industrial Level Deep Learning Training Infrastructure — the Practice and Experience from SenseTime Shengen Yan SenseTime Group Limited.

The Success of Deep Learning Google Search AlexNet won ImageNet 2006-01 2007-01 2008-01 2009-01 2010-01 2011-01 2012-01 2013-01 2014-01 2015-01 2016-01

What Lead to the Success?

Model Capacity The Key to High Performance # Layers 1207 169 22 8 5 LeNe Net Alex exNet ( (2012) 2) Goog ogLeNet et (201 014) ResN sNet (2 (2016) Ours rs

Computation power Years months weeks days Accelerate the training time from several years to several days!

01 Deep Learning Package A deep learning framework that is efficient , scalable , and flexible . 02 DeepLink A large-scale cluster platform designed for deep learning. Applications 03 Delivers many application models

Deep Learning is Complicated GoogleNet (2014) Deep Learning community developed frameworks to make the life easier.

Deep learning Training Frameworks ‣ SenseTime Deep Learning training Package • Both model parallel & data • Scalability • Memory efficient parallel • Computation efficient • Support huge model

Memory Footprint Optimization  Optimizations: liveness analysis, computation graph high level compiler backend optimization algorithms on intermediate representation.

Memory Footprint Optimization Seeing  Generated Graph with mirror(re-compute) node Perceiving Chen T, Xu B, Zhang C, et al. Training deep nets with sublinear memory cost[J]. arXiv preprint arXiv:1604.06174, 2016.

Model Capacity Ours MxNet TensorFlow Chainer Caffe Torch 140 120 100 80 60 40 20 0 VGG ResNet50 ResNet152 Inception V4 ResNet269 Inception ResNet Memory usage efficiency, higher is better

Single-GPU Performance milliseconds / iteration 2500 2000 1500 1000 500 0 Batch-32 Batch-64 Batch-128 Caffe 497.5 1045 1965 Chainer 200 290 543 TensorFlow 178.6 315.7 587.2 Parrots 122.7 225.6 471 Caff ffe Chai ainer Tens nsorFlo low Parr rrots

Communication Optimization  Support Multi-GPUs and Multi-Nodes  Three procedures: Copy, Allreduce, Copy  Optimizations: Other Nodes Allreduce • Master-slave threads to overlap the communication and computation overhead • GPU direct communication CPU Memory • Ring allreduce message passing Copy Copy GPU0 GPU1 GPU2 GPU3

Scalability single node multiple nodes 12000 1.2 10000 1 8000 0.8 6000 0.6 4000 0.4 2000 0.2 0 0 1 2 3 4 8 16 24 32 # GPUs # GPUs millisec/iter scale efficiency

The role of supercomputer It just like highway in the city — It is a key infrastructure of AI

Supercomputing Centers for AI The key infrastructures for AI research. DATA DeepLink COMPPUT- MODEL ATION

Challenges ‣ Interconnects at multiple levels • GPUs, Nodes, Sub-networks ‣ Distributed data • Random access becomes particularly difficult ‣ Scale vs. Stability • Failures of individual nodes/links ‣ Human resources • Engineers who understand both Deep Learning & HPC are difficult to come by

DeepLink Clusters Designed for Deep Learning Software Maximize respective strengths while ensuring optimal Hardware cooperation. Co-design • High speed interconnects High- • High performance GPU computing performance Hardware • Efficient distributed storage • Distributed storage & cache system (optimized for small files) Customized • Distributed deep learning framework Middlewares • Task scheduling & monitoring

Platform overview Deep Learning Training Visualization System Task scheduling system Software Distributed training software Computation library Customized communication library for deep learning High speed storage Lightweight virtualization Distributed cache system system Platform Operation/Maintenance/Monitoring System Heterogeneous deep learning super computer

Training Visualization

DeepLink in SenseTime >3000 GPUs

THANK YOU

Industrial Level Deep Learning Training Infrastructure the Practice - PowerPoint PPT Presentation

Industrial Level Deep Learning Training Infrastructure the Practice and Experience from SenseTime Shengen Yan SenseTime Group Limited. The Success of Deep Learning Google Search AlexNet won ImageNet 2006-01 2007-01 2008-01 2009-01

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Industrial Transfer Learning Introduction to Industrial Transfer Learning Industrial Transfer

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning.

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December

Differen'able Func'onal Programming Noel Welsh @noelwelsh underscore Goals Deep learning

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 6: Deep Learning Systems 1 Outline

PowerWizard Level 1.0 & Level 2.0 Control Systems Training Systems Comparison Level 2

ACCELERATE DEEP LEARNING WITH NVIDIA'S DEEP LEARNING PLATFORM | STEPHEN JONES | GTC16 DEEP

Deep learning for natural language processing A short primer on deep learning Benoit Favre <

Relational Deep Learning: A Deep Latent Variable Model for Link Prediction Hao Wang, Xingjian

Medical Imaging Elisa Sayrol Medical Imaging Interest in this area in Deep Learning: DeepDeep

Livorno Port Authority MEDITRACKNET - Presentation, Firenze 16.12.2014 page N. 1 Technological

To Lead the Future of World Trade Our Journey : From Local Port Operator to Global Trade Enabler

2016/2017 Capital Improvement Program Georgia Ports Authority Christopher B. Novack, P.E.

intelligent cargo handling Investor presentation June 2019 1 Investor presentation June 2019

The new concepts for Accessibility systems being developed in in Europe Pilar Orero, UAB New EU

Vilawatt, an innovative public-private-citizen partnership for local energy governance

Deep Learning for Self Driving Cars Link: deeplearning.mit.edu Lex Fridman: fridman@mit.edu

West Seattle and Ballard Link Extensions DRAFT Stakeholder Advisory Group | 7.16.18

Industrial Level Deep Learning Training Infrastructure the Practice - PowerPoint PPT Presentation

Industrial Level Deep Learning Training Infrastructure the Practice and Experience from SenseTime Shengen Yan SenseTime Group Limited. The Success of Deep Learning Google Search AlexNet won ImageNet 2006-01 2007-01 2008-01 2009-01

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Industrial Transfer Learning Introduction to Industrial Transfer Learning Industrial Transfer

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning.

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December

Differen'able Func'onal Programming Noel Welsh @noelwelsh underscore Goals Deep learning

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 6: Deep Learning Systems 1 Outline

PowerWizard Level 1.0 &amp; Level 2.0 Control Systems Training Systems Comparison Level 2

ACCELERATE DEEP LEARNING WITH NVIDIA'S DEEP LEARNING PLATFORM | STEPHEN JONES | GTC16 DEEP

Deep learning for natural language processing A short primer on deep learning Benoit Favre &lt;

Relational Deep Learning: A Deep Latent Variable Model for Link Prediction Hao Wang, Xingjian

Medical Imaging Elisa Sayrol Medical Imaging Interest in this area in Deep Learning: DeepDeep

Livorno Port Authority MEDITRACKNET - Presentation, Firenze 16.12.2014 page N. 1 Technological

To Lead the Future of World Trade Our Journey : From Local Port Operator to Global Trade Enabler

2016/2017 Capital Improvement Program Georgia Ports Authority Christopher B. Novack, P.E.

intelligent cargo handling Investor presentation June 2019 1 Investor presentation June 2019

The new concepts for Accessibility systems being developed in in Europe Pilar Orero, UAB New EU

Vilawatt, an innovative public-private-citizen partnership for local energy governance

Deep Learning for Self Driving Cars Link: deeplearning.mit.edu Lex Fridman: fridman@mit.edu

West Seattle and Ballard Link Extensions DRAFT Stakeholder Advisory Group | 7.16.18

PowerWizard Level 1.0 & Level 2.0 Control Systems Training Systems Comparison Level 2

Deep learning for natural language processing A short primer on deep learning Benoit Favre <