ACCELERATE DEEP LEARNING WITH NVIDIA'S DEEP LEARNING PLATFORM | - - PowerPoint PPT Presentation

accelerate deep learning
SMART_READER_LITE
LIVE PREVIEW

ACCELERATE DEEP LEARNING WITH NVIDIA'S DEEP LEARNING PLATFORM | - - PowerPoint PPT Presentation

ACCELERATE DEEP LEARNING WITH NVIDIA'S DEEP LEARNING PLATFORM | STEPHEN JONES | GTC16 DEEP LEARNING EVERYWHERE INTERNET & CLOUD MEDICINE & BIOLOGY MEDIA & ENTERTAINMENT SECURITY & DEFENSE AUTONOMOUS MACHINES Image


slide-1
SLIDE 1

WITH NVIDIA'S DEEP LEARNING PLATFORM | STEPHEN JONES | GTC’16

ACCELERATE DEEP LEARNING

slide-2
SLIDE 2

2

DEEP LEARNING EVERYWHERE

INTERNET & CLOUD

Image Classification Speech Recognition Language Translation Language Processing Sentiment Analysis Recommendation

MEDIA & ENTERTAINMENT

Video Captioning Video Search Real Time Translation

AUTONOMOUS MACHINES

Pedestrian Detection Lane Tracking Recognize Traffic Sign

SECURITY & DEFENSE

Face Detection Video Surveillance Satellite Imagery

MEDICINE & BIOLOGY

Cancer Cell Detection Diabetic Grading Drug Discovery

slide-3
SLIDE 3

3

DEEP LEARNING REVOLUTIONIZING COMPUTING

Solves Problems Previously Unsolvable

slide-4
SLIDE 4

4

A NEW COMPUTING MODEL

Traditional Computer Vision

Domain experts design feature detectors Quality = patchwork of algorithms Need CV experts and time

Deep Learning

DNN learn features from large data Quality = data & training method Needs lots of data and compute

slide-5
SLIDE 5

5

DEEP LEARNING

The Next Innovation Network?

1910 1920 1930 1940 1950 1960 1970 1980 1990 2000 2010 2020

VACUUM TUBES TEST EQUIPMENT MICROWAVES INTEGRATED CIRCUITS PERSONAL COMPUTERS INTERNET SMARTPHONES

DEEP LEARNING

MACHINE INTELLIGENCE

(Source: Deep Learning Gold Rush of 2015, Tomasz Malisiewicz, November 07, 2015 (adapted from http://steveblank.com/secret-history/)

slide-6
SLIDE 6

6

0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% 2009 2010 2011 2012 2013 2014 2015 2016

THE AI RACE IS ON

IBM Watson Achieves Breakthrough in Natural Language Processing Facebook Launches Big Sur Baidu Deep Speech 2 Beats Humans Google Launches TensorFlow Microsoft & U. Science & Tech, China Beat Humans on IQ Toyota Invests $1B in AI Labs

IMAGENET Accuracy Rate

Traditional CV Deep Learning

slide-7
SLIDE 7

7 NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE.

$500B DEEP LEARNING OPPORTUNITY

Ad Service Technology Investment Media Oil and Gas Manufacturing Retail Other SOURCE: “Deep Learning for Enterprise Applications, 4Q 2015, Tractica”

“The current cutting edge of deep learning processing

platforms seems to be massively parallel GPU systems.”

— Tractica

Deep Learning Software Revenue by Industry

World Markets: 2015-2024

Deep Learning Total Revenue by Segment

World Markets: 2015-2024

2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 ($ Millions) $1,000 $2,000 $4,000 $5,000 $- $120,000 $100,000 $80,000 $60,000 $40,000 $20,000 $- 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 ($ Millions) DL-driven GPU Chip Revenue

slide-8
SLIDE 8

8

EDUCATION START-UPS

CNTK TENSORFLOW DL4J

THE ENGINE OF MODERN AI

NVIDIA DEEP LEARNING PLATFORM

*U. Washington, CMU, Stanford, TuSimple, NYU, Microsoft, U. Alberta, MIT, NYU Shanghai

VITRUVIAN SCHULTS LABORATORIES

TORCH THEANO CAFFE MATCONVNET PURINE MOCHA.JL MINERVA MXNET* CHAINER BIG SUR WATSON OPENDEEP KERAS

slide-9
SLIDE 9

9 NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE.

NVIDIA DEEP LEARNING PLATFORM

TITAN X - DEVELOPERS DEEP LEARNING SDK DL FRAMEWORK (CAFFE, CNTK,TENSORFLOW, THEANO, TORCH…) TESLA - DEPLOYMENT AUTOMOTIVE - DRIVEPX EMBEDDED - JETSON

slide-10
SLIDE 10

10

DEEP LEARNING FRAMEWORKS

VISION SPEECH BEHAVIOR

Object Detection Voice Recognition Language Translation Recommendation Engines Sentiment Analysis

DEEP LEARNING

cuDNN

MATH LIBRARIES

cuBLAS cuSPARSE

MULTI-GPU

NCCL cuFFT

Mocha.jl

Image Classification

NVIDIA Deep Learning SDK

High Performance GPU-Acceleration for Deep Learning

slide-11
SLIDE 11

11

NVIDIA Deep Learning SDK

Powerful developer tools and libraries for designing and deploying GPU-accelerated deep learning applications

High Performance Deep Learning for NVIDIA GPUs Industry Vetted Deep Learning Algorithms Easily integrated into deep learning applications

developer.nvidia.com/deep-learning

slide-12
SLIDE 12

12

NVIDIA cuDNN

Building blocks for accelerating deep neural networks on GPUs

High performance deep neural network training Accelerates Deep Learning: Caffe, CNTK, Tensorflow, Theano, Torch Performance continues to improve

  • ver time

“NVIDIA has improved the speed of cuDNN

with each release while extending the interface to more operations and devices at the same time.”

— Evan Shelhamer, Lead Caffe Developer, UC Berkeley developer.nvidia.com/cudnn

AlexNet training throughput based on 20 iterations, CPU: 1x E5-2680v3 12 Core 2.5GHz.

0.0x 2.0x 4.0x 6.0x 8.0x 10.0x 12.0x 2014 2015 2016 K40 (cuDNN v1) M40 (cuDNN v3) Pascal (cuDNN v5)

slide-13
SLIDE 13

13

cuBLAS

GPU-accelerated Basic Linear Algebra Subroutines that delivers 6x to 17x faster performance than the latest MKL BLAS Accelerated Level 3 BLAS: SGEMM, SYMM, TRSM, SYRK Up to 7 TFlops Single Precision on a single M40 Multi-GPU BLAS support available in cuBLAS-XT Accelerated Linear Algebra for Deep Learning

developer.nvidia.com/cublas

slide-14
SLIDE 14

14

NCCL

A topology-aware library of accelerated collectives to improve the scalability of multi-GPU applications Patterned after MPI’s collectives: includes all-reduce, all-gather, reduce-scatter, reduce, broadcast Optimized intra-node communication Supports multi-threaded and multi- process applications Accelerating Multi-GPU Communications

github.com/NVIDIA/nccl

slide-15
SLIDE 15

15

NVIDIA DIGITS

An interactive development environment for training deep neural networks

Prepare data quickly and easily for training Visualize network behavior Maximize training speed

Making Deep Learning Accessible

developer.nvidia.com/digits

slide-16
SLIDE 16

16

What’s new in DIGITS 3?

Train neural network models with Torch support (preview) Save time by quickly iterating to identify the best model Manage multiple jobs easily to

  • ptimize use of system resources

Active open source project with valuable community contributions

Improves Deep Learning Training Productivity

Screenshot of new DIGITS

New Results Browser!

developer.nvidia.com/digits

slide-17
SLIDE 17

17

Preview DIGITS Future

Object Detection Workflows for Automotive and Defense Targeted at Autonomous Vehicles, Remote Sensing Object Detection Workflow

Come see a live demo in the GTC Exhibit Hall!

developer.nvidia.com/digits

slide-18
SLIDE 18

18

Deep Learning at GTC

slide-19
SLIDE 19

19

Deep Learning at GTC

11:00am: Accelerate Deep Learning with NVIDIA's Deep Learning Platform 12:00pm: Hangout -- The DIGITS Roadmap 1:00pm: From Workstation to Embedded: Accelerated Deep Learning on NVIDIA Jetson TX1 3:00pm: A Tutorial on More Ways to Use DIGITS 4:00pm: Hangout – cuDNN -- Features, Roadmap and Q&A

Deep Learning at NVIDIA, Monday 4/4

slide-20
SLIDE 20

20

Deep Learning at GTC

9:00am: Caffe: an Open Framework for Deep Learning 10:00am: TensorFlow: Scaling Up Machine Learning 2:00pm: Torch: A Flexible Platform for Deep Learning Research 3:00pm: Chainer: A Powerful, Flexible, and Intuitive Deep Learning Framework 4:00pm: Theano at a Glance: A Framework for Machine Learning 4:30pm: Deep Learning in Microsoft with CNTK Monday, 4/4 at 3:00pm: MXNet: Flexible Deep Learning Framework from Distributed GPU Clusters to Embedded Systems

Frameworks Track, Wednesday 4/6

slide-21
SLIDE 21

21

Deep Learning at GTC

Wednesday 4/6

1:00pm: Introduction to CNTK 2:00pm: Machine Learning Using TensorFlow 2:00pm: BIDMach Machine Learning Toolkit 3:30pm: Applied Deep Learning for Vision and Natural Language with Torch7 3:30pm: Caffe Hands-on Lab

Frameworks Hands-on Labs

Thursday 4/7

9:30am: Chainer Hands-on: Introduction To Train Deep Learning Model in Python 9:30am: Deep Learning With the Theano Python Library 1:00pm: IBM Watson Developers Lab

Monday 4/4

1:00pm: Train and Deploy Deep Learning for Vision, Natural Language and Speech Using MXNet

slide-22
SLIDE 22

22

Deep Learning at GTC

Tuesday, 4/5 1:00pm: Distributed Deep Learning at Scale, Soumith Chintala, Research Engineer, Facebook AI Research 2:00pm: Generative Adversarial Networks, Ian Goodfellow, Senior Research Scientist, Google 3:00pm: Video Classification of Live Streams on Twitter's Periscope, Nicolas Koumchatzky, Engineer, Twitter 4:00pm: Training and Deploying Deep Neural Networks for Speech Recognition, Bryan Catanzaro, Senior Researcher, Baidu Research Wednesday, 4/6, 9:30am: Deep Reinforcement Learning, Pieter Abbeel, Professor, UC Berkeley

Over 50 sessions on Deep Learning, highlights include --

slide-23
SLIDE 23

23

Deep Learning at GTC

Monday 4/4

12:00pm: The DIGITS Roadmap 4:00pm: cuDNN--Features, Roadmap and Q&A

Tuesday 4/5

12:00pm: Dreaming Big: Scaling Up Deep Dream to Operate on Multi-Hundred Megapixel Images

Hangouts

Wednesday 4/6

10:00am: Deep Learning in Image and Video 1:00pm: Deep Learning Exploits Petabytes of DigitalGlobe GIS 1:00pm: Reinforcement Learning

Thursday 4/7

1:00pm: Large Vocabulary Speech Recognition with GPUs 1:00pm: NVIDIA Deep Learning Software

slide-24
SLIDE 24

24

Deep Learning at GTC

S6515 - Listen, Attend and Spell, William Chan, PhD Candidate, Carnegie Mellon University S6745 - VQA: Visual Question Answering, Aishwarya Agrawal, PhD Student, Virginia Tech S6781 - Deep Neural Networks for Conversational Language Understanding, Kaheer Suleman, CTO, Maluuba Inc. S6321 - How Deep Learning Works for Automated Customer Service, Chenghua (Kevin) Li, Chief Scientist

  • f DNN Lab, JD.COM

H6127 - Hangout: Large Vocabulary Speech Recognition with GPUs, Yifang Xu, Senior Deep Learning Software Engineer, NVIDIA

NLP/NLU

slide-25
SLIDE 25

25

Deep Learning at GTC

S6371 - Deep Convolutional Neural Networks for Spoken Dialect Classification of Spectrogram Images Using DIGITS, Nigel Cannings, Chief Technical Officer, Intelligent Voice Limited S6383 - High Performance CTC Training for End-to-End Speech Recognition on GPU, Minmin Sun, GPU Architecture Engineer, NVIDIA S6458 - A GPU-Based Cloud Speech Recognition Server for Dialog Applications, Alexei V. Ivanov, CTO, Verbumware Inc. S6672 - Training and Deploying Deep Neural Networks for Speech Recognition, Bryan Catanzaro, Senior Researcher, Baidu Research (Highly-Rated Speaker) S6515 - Listen, Attend and Spell, William Chan, PhD Candidate, Carnegie Mellon University S6781 - Deep Neural Networks for Conversational Language Understanding, Kaheer Suleman, CTO, Maluuba Inc.

Speech

slide-26
SLIDE 26

QUESTIONS?