accelerate deep learning
play

ACCELERATE DEEP LEARNING WITH NVIDIA'S DEEP LEARNING PLATFORM | - PowerPoint PPT Presentation

ACCELERATE DEEP LEARNING WITH NVIDIA'S DEEP LEARNING PLATFORM | STEPHEN JONES | GTC16 DEEP LEARNING EVERYWHERE INTERNET & CLOUD MEDICINE & BIOLOGY MEDIA & ENTERTAINMENT SECURITY & DEFENSE AUTONOMOUS MACHINES Image


  1. ACCELERATE DEEP LEARNING WITH NVIDIA'S DEEP LEARNING PLATFORM | STEPHEN JONES | GTC’16

  2. DEEP LEARNING EVERYWHERE INTERNET & CLOUD MEDICINE & BIOLOGY MEDIA & ENTERTAINMENT SECURITY & DEFENSE AUTONOMOUS MACHINES Image Classification Cancer Cell Detection Face Detection Pedestrian Detection Video Captioning Speech Recognition Diabetic Grading Video Surveillance Lane Tracking Video Search Language Translation Drug Discovery Satellite Imagery Recognize Traffic Sign Real Time Translation Language Processing Sentiment Analysis Recommendation 2

  3. DEEP LEARNING REVOLUTIONIZING COMPUTING Solves Problems Previously Unsolvable 3

  4. A NEW COMPUTING MODEL Traditional Computer Vision Deep Learning Domain experts design feature detectors DNN learn features from large data Quality = patchwork of algorithms Quality = data & training method Need CV experts and time Needs lots of data and compute 4

  5. DEEP LEARNING The Next Innovation Network? MACHINE INTELLIGENCE DEEP LEARNING SMARTPHONES INTERNET PERSONAL COMPUTERS INTEGRATED CIRCUITS MICROWAVES TEST EQUIPMENT VACUUM TUBES 1910 1920 1930 1940 1950 1960 1970 1980 1990 2000 2010 2020 5 (Source: Deep Learning Gold Rush of 2015, Tomasz Malisiewicz, November 07, 2015 (adapted from http://steveblank.com/secret-history/)

  6. THE AI RACE IS ON IMAGENET Accuracy Rate 100% Traditional CV Deep Learning 90% 80% 70% Baidu Deep Speech 2 IBM Watson Achieves Breakthrough Facebook Beats Humans in Natural Language Processing Launches Big Sur 60% 50% 40% 30% 20% 10% Google Toyota Invests $1B Microsoft & U. Science & Tech, China Launches TensorFlow Beat Humans on IQ in AI Labs 0% 2009 2010 2011 2012 2013 2014 2015 2016 6

  7. $500B DEEP LEARNING OPPORTUNITY Deep Learning Total Revenue by Segment Deep Learning Software Revenue by Industry World Markets: 2015-2024 World Markets: 2015-2024 $120,000 $100,000 Other $80,000 Ad Service ($ Millions) Retail Technology $60,000 Manufacturing $40,000 $20,000 Oil and Gas $- 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 Investment Media $5,000 DL-driven $4,000 ($ Millions) GPU Chip Revenue $2,000 $1,000 “ The current cutting edge of deep learning processing $- platforms seems to be massively parallel GPU systems. ” 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 — Tractica 7 SOURCE: “Deep Learning for Enterprise Applications, 4Q 2015, Tractica ” NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE.

  8. THE ENGINE OF MODERN AI EDUCATION BIG SUR TENSORFLOW WATSON CNTK TORCH CAFFE THEANO MATCONVNET START-UPS MOCHA.JL PURINE CHAINER DL4J KERAS OPENDEEP MINERVA MXNET* SCHULTS VITRUVIAN LABORATORIES NVIDIA DEEP LEARNING PLATFORM 8 * U. Washington, CMU, Stanford, TuSimple, NYU, Microsoft, U. Alberta, MIT, NYU Shanghai

  9. NVIDIA DEEP LEARNING PLATFORM DL FRAMEWORK (CAFFE, CNTK,TENSORFLOW, THEANO, TORCH …) DEEP LEARNING SDK TITAN X - DEVELOPERS TESLA - DEPLOYMENT AUTOMOTIVE - DRIVEPX EMBEDDED - JETSON 9 NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE.

  10. NVIDIA Deep Learning SDK High Performance GPU-Acceleration for Deep Learning Recommendation Sentiment Analysis Image Classification Voice Recognition Language Translation Object Detection Engines SPEECH BEHAVIOR VISION Mocha.jl DEEP LEARNING FRAMEWORKS cuFFT cuBLAS cuSPARSE cuDNN NCCL DEEP LEARNING MULTI-GPU MATH LIBRARIES 10

  11. NVIDIA Deep Learning SDK Powerful developer tools and libraries for designing and deploying GPU-accelerated deep learning applications High Performance Deep Learning for NVIDIA GPUs Industry Vetted Deep Learning Algorithms Easily integrated into deep learning applications developer.nvidia.com/ deep-learning 11

  12. Pascal NVIDIA cuDNN 12.0x (cuDNN v5) 10.0x 8.0x M40 (cuDNN v3) 6.0x Building blocks for accelerating deep 4.0x K40 neural networks on GPUs (cuDNN v1) 2.0x High performance deep neural 0.0x network training 2014 2015 2016 AlexNet training throughput based on 20 iterations, CPU: 1x E5-2680v3 12 Core 2.5GHz. Accelerates Deep Learning: Caffe, CNTK, Tensorflow, Theano, Torch “ NVIDIA has improved the speed of cuDNN with each release while extending the Performance continues to improve interface to more operations and devices over time at the same time. ” — Evan Shelhamer, Lead Caffe Developer, UC Berkeley developer.nvidia.com/ cudnn 12

  13. cuBLAS Accelerated Linear Algebra for Deep Learning GPU-accelerated Basic Linear Algebra Subroutines that delivers 6x to 17x faster performance than the latest MKL BLAS Accelerated Level 3 BLAS: SGEMM, SYMM, TRSM, SYRK Up to 7 TFlops Single Precision on a single M40 Multi-GPU BLAS support available in cuBLAS-XT developer.nvidia.com/cublas 13

  14. NCCL Accelerating Multi-GPU Communications A topology-aware library of accelerated collectives to improve the scalability of multi-GPU applications Patterned after MPI’s collectives: includes all-reduce, all-gather, reduce-scatter, reduce, broadcast Optimized intra-node communication Supports multi-threaded and multi- process applications github.com/NVIDIA/nccl 14

  15. NVIDIA DIGITS Making Deep Learning Accessible An interactive development environment for training deep neural networks Prepare data quickly and easily for training Visualize network behavior Maximize training speed developer.nvidia.com/digits 15

  16. What’s new in DIGITS 3? Improves Deep Learning Training Productivity Train neural network models with Screenshot of new DIGITS Torch support (preview) Save time by quickly iterating to identify the best model Manage multiple jobs easily to optimize use of system resources Active open source project with New Results Browser! valuable community contributions developer.nvidia.com/digits 16

  17. Preview DIGITS Future Object Detection Workflow Object Detection Workflows for Automotive and Defense Targeted at Autonomous Vehicles, Remote Sensing Come see a live demo in the GTC Exhibit Hall! developer.nvidia.com/digits 17

  18. Deep Learning at GTC 18

  19. Deep Learning at GTC Deep Learning at NVIDIA, Monday 4/4 11:00am: Accelerate Deep Learning with NVIDIA's Deep Learning Platform 12:00pm: Hangout -- The DIGITS Roadmap 1:00pm: From Workstation to Embedded: Accelerated Deep Learning on NVIDIA Jetson TX1 3:00pm: A Tutorial on More Ways to Use DIGITS 4:00pm: Hangout – cuDNN -- Features, Roadmap and Q&A 19

  20. Deep Learning at GTC Frameworks Track, Wednesday 4/6 9:00am: Caffe: an Open Framework for Deep Learning 10:00am: TensorFlow: Scaling Up Machine Learning 2:00pm: Torch: A Flexible Platform for Deep Learning Research 3:00pm: Chainer: A Powerful, Flexible, and Intuitive Deep Learning Framework 4:00pm: Theano at a Glance: A Framework for Machine Learning 4:30pm: Deep Learning in Microsoft with CNTK Monday, 4/4 at 3:00pm: MXNet: Flexible Deep Learning Framework from Distributed GPU Clusters to Embedded Systems 20

  21. Deep Learning at GTC Frameworks Hands-on Labs Wednesday 4/6 Thursday 4/7 1:00pm: Introduction to CNTK 9:30am: Chainer Hands-on: Introduction To Train Deep Learning Model in Python 2:00pm: Machine Learning Using TensorFlow 9:30am: Deep Learning With the Theano 2:00pm: BIDMach Machine Learning Toolkit Python Library 3:30pm: Applied Deep Learning for Vision 1:00pm: IBM Watson Developers Lab and Natural Language with Torch7 Monday 4/4 3:30pm: Caffe Hands-on Lab 1:00pm: Train and Deploy Deep Learning for Vision, Natural Language and Speech Using MXNet 21

  22. Deep Learning at GTC Over 50 sessions on Deep Learning, highlights include -- Tuesday, 4/5 1:00pm: Distributed Deep Learning at Scale, Soumith Chintala , Research Engineer, Facebook AI Research 2:00pm: Generative Adversarial Networks, Ian Goodfellow , Senior Research Scientist, Google 3:00pm: Video Classification of Live Streams on Twitter's Periscope, Nicolas Koumchatzky , Engineer, Twitter 4:00pm: Training and Deploying Deep Neural Networks for Speech Recognition, Bryan Catanzaro , Senior Researcher, Baidu Research Wednesday, 4/6, 9:30am: Deep Reinforcement Learning, Pieter Abbeel , Professor, UC Berkeley 22

  23. Deep Learning at GTC Hangouts Monday 4/4 Wednesday 4/6 12:00pm: The DIGITS Roadmap 10:00am: Deep Learning in Image and Video 4:00pm: cuDNN--Features, Roadmap and Q&A 1:00pm: Deep Learning Exploits Petabytes of DigitalGlobe GIS Tuesday 4/5 1:00pm: Reinforcement Learning 12:00pm: Dreaming Big: Scaling Up Deep Dream Thursday 4/7 to Operate on Multi-Hundred Megapixel Images 1:00pm: Large Vocabulary Speech Recognition with GPUs 1:00pm: NVIDIA Deep Learning Software 23

  24. Deep Learning at GTC NLP/NLU S6515 - Listen, Attend and Spell, William Chan , PhD Candidate, Carnegie Mellon University S6745 - VQA: Visual Question Answering, Aishwarya Agrawal , PhD Student, Virginia Tech S6781 - Deep Neural Networks for Conversational Language Understanding, Kaheer Suleman , CTO, Maluuba Inc. S6321 - How Deep Learning Works for Automated Customer Service, Chenghua (Kevin) Li , Chief Scientist of DNN Lab, JD.COM H6127 - Hangout: Large Vocabulary Speech Recognition with GPUs, Yifang Xu , Senior Deep Learning Software Engineer, NVIDIA 24

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend