Pedro Mario Cruz e Silva (pcruzesilva@nvidia.com) Solution Architect Manager Enterprise Latin America Global Oil & Gas Team
IMAGE CLASSIFICATION WITH NVIDIA DIGITS Pedro Mario Cruz e Silva - - PowerPoint PPT Presentation
IMAGE CLASSIFICATION WITH NVIDIA DIGITS Pedro Mario Cruz e Silva - - PowerPoint PPT Presentation
IMAGE CLASSIFICATION WITH NVIDIA DIGITS Pedro Mario Cruz e Silva (pcruzesilva@nvidia.com) Solution Architect Manager Enterprise Latin America Global Oil & Gas Team DEEP LEARNING WITH DIGITS Hands-On Lab NVIDIA QwikLabs
2
DEEP LEARNING WITH DIGITS
“NVIDIA QwikLabs” https://nvlabs.qwiklab.com “Image Classification with DIGITS”
Hands-On Lab
3
4
5
INTRODUCTION
6
LEARNING FROM DATA
AND SOME BUZZ WORDS ARTIFICAL INTELLIGENCE MACHINE LEARNING DEEP LEARNING
Knowledge & Reason Learning Planning Communicating Perceiving Learning from data Expert systems Handcrafted features Learning from data Neural networks Computer learned features
7
TRADITIONAL COMPUTING MODEL
Algorithm Input “Label” Output
8
A NEW COMPUTING MODEL
“Label” Input Training Data Output Trained Neural Network Trained Neural Network “Label” Output Input
TRAINING INFERENCE
9
A NEW COMPUTING MODEL
Outperform experts, facts, rules with software that writes software
Deep Learning Object Detection DNN + Data + GPU Traditional Computer Vision Experts + Time Deep Learning Achieves “Superhuman” Results
0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%
2009 2010 2011 2012 2013 2014 2015 2016
Traditional CV Deep Learning
ImageNet
10
MNIST (MODIFIED NATIONAL INSTITUTE OF STANDARDS AND TECHNOLOGY)
11
12
NVIDIA DIGITS
13
POWERING THE DEEP LEARNING ECOSYSTEM
NVIDIA SDK accelerates every major framework
COMPUTER VISION
OBJECT DETECTION IMAGE CLASSIFICATION
SPEECH & AUDIO
VOICE RECOGNITION LANGUAGE TRANSLATION
NATURAL LANGUAGE PROCESSING
RECOMMENDATION ENGINES SENTIMENT ANALYSIS
DEEP LEARNING FRAMEWORKS
Mocha.jl
NVIDIA DEEP LEARNING SDK
developer.nvidia.com/deep-learning-software
14
NVIDIA DIGITS
Interactive Deep Learning GPU Training System
developer.nvidia.com/digits
Interactive deep neural network development environment for image classification and object detection
Schedule, monitor, and manage neural network training jobs Analyze accuracy and loss in real time Track datasets, results, and trained neural networks Scale training jobs across multiple GPUs automatically
15
OBJECT DETECTION IMAGE CLASSIFICATION
DEEP LEARNING WORKFLOWS
Classify images into classes or categories Object of interest could be anywhere in the image Find instances of objects in an image Objects are identified with bounding boxes
98% Dog 2% Cat
New in DIGITS 5
Partition image into multiple regions Regions are classified at the pixel level
IMAGE SEGMENTATION
16
STEP 0 – RUN DIGITS
17
18
19
STEP 1 – CREATE DATASET
20
LOAD AND ORGANIZE DATA
21
LOAD AND ORGANIZE DATA
22
EXPLORE DB
23
24
25
26
27
STEP 2 – TRAINING A MODEL
28
TRAINING A MODEL
29
TRAINING A MODEL
30
TRAINING A MODEL
31
TRAINING A MODEL
32
33
STEP 3 – INFERENCE
34
INFERENCE
35
INFERENCE (TEST)
36
NVIDIA AI PLATFORM
37
150B XTORS | 5.3TF FP64 | 10.6TF FP32 | 21.2TF FP16 | 14MB SM RF | 4MB L2 Cache
TESLA P100
THE MOST ADVANCED HYPERSCALE DATACENTER GPU EVER BUILT
38 NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE.
INTRODUCING TESLA P100
New GPU Architecture to Enable the World’s Fastest Compute Node
Pascal Architecture NVLink CoWoS HBM2 Page Migration Engine
Highest Compute Performance GPU Interconnect for Maximum Scalability Unifying Compute & Memory in Single Package Simple Parallel Programming with Virtually Unlimited Memory Space
Unified Memory
CPU T esla P100
39
ANNOUNCING TESLA V100
GIANT LEAP FOR AI & HPC VOLTA WITH NEW TENSOR CORE
21B xtors | TSMC 12nm FFN | 815mm2 5,120 CUDA cores 7.5 FP64 TFLOPS | 15 FP32 TFLOPS NEW 120 Tensor TFLOPS 20MB SM RF | 16MB Cache 16GB HBM2 @ 900 GB/s 300 GB/s NVLink
40
NEW TENSOR CORE
New CUDA TensorOp instructions & data formats 4x4 matrix processing array D[FP32] = A[FP16] * B[FP16] + C[FP32] Optimized for deep learning
Activation Inputs Weights Inputs Output Results
41
TENSOR CORE
4x4x4 matrix multiply and accumulate
42
Tesla P100 vs Tesla V100
Tesla P100 (Pascal) Tesla V100 (Volta) Memory 16 GB (HBM2) 16 GB (HMB2) Memory Bandwidth 720 GB/s 900 GB/s NVLINK 160 GB/s 300 GB/s CUDA Cores (FP32) 3584 5120 CUDA Cores (FP64) 1792 2560 Tensor Cores (TC) NA 640 Peak TFLOPS/s (FP32) 10.6 15 Peak TFLOPS/s (FP64) 5.3 7.5 Peak TFLOPS/s (TC) NA 120 Power 300 W 300 W
3x
43
Tesla P100 vs Tesla V100
Tesla P100 (Pascal) Tesla V100 (Volta) Memory 16 GB (HBM2) 16 GB (HMB2) Memory Bandwidth 720 GB/s 900 GB/s NVLINK 160 GB/s 300 GB/s CUDA Cores (FP32) 3584 5120 CUDA Cores (FP64) 1792 2560 Tensor Cores (TC) NA 640 Peak TFLOPS/s (FP32) 10.6 15 Peak TFLOPS/s (FP64) 5.3 7.5 Peak TFLOPS/s (TC) NA 120 Power 300 W 300 W
50%
44
NVIDIA GPU CLOUD SIMPLIFYING AI & HPC
DEEP LEARNING HPC APPS HPC VIZ
45
46
DRAMATICALLY MORE FOR YOUR MONEY
5X Better HPC TCO for Same Throughput
160 Self-hosted Skylake CPU Servers 96 KWatts MIXED HPC WORKLOAD:
Amber, CHROMA, GTC, LAMMPS, MILC, NAMD, Quantum Expresso, SPECFEM3D
8 Accelerated Servers w/4 V100 GPUs 13 KWatts
SAME
THROUGHPUT
1/5
THE COST
1/7
THE SPACE
1/7
THE POWER
MIXED HPC WORKLOAD:
Amber, CHROMA, GTC, LAMMPS, MILC, NAMD, Quantum Espresso, SPECFEM3D
47
NVIDIA SUPPORT PROGRAMS
48
developer.nvidia.com
49
Deep Learning Fundamentals Game Development & Digital Content Finance
NVIDIA DEEP LEARNING INSTITUTE
Hands-on self-paced and instructor-led training in deep learning and accelerated computing for developers Request onsite instructor-led workshops at your
- rganization: www.nvidia.com/requestdli
Take self-paced labs online: www.nvidia.com/dlilabs Download the course catalog, view upcoming workshops, and learn about the University Ambassador Program: www.nvidia.com/dli
Intelligent Video Analytics Medical Image Analysis Autonomous Vehicles Accelerated Computing Fundamentals More industry- specific training coming soon… Genomics
50
NVIDIA HW GRANT PROGRAM
Titan X Pascal
- Robotics
- Autonomous Machines
Jetson TX2 (Dev Kit)
- Scientific Visualization
- Virtual Reality
Quadro P6000
- Scientific Computing
- HPC
- Deep Learning
51
INCEPTION PROGRAM
http://www.nvidia.com/object/inception-program.html
Pedro Mario Cruz e Silva (pcruzesilva@nvidia.com) Solution Architect Manager Enterprise Latin America Global Oil & Gas Team LinkedIn