Deep Learning for Everybody Elliot English and the MetaMind team - - PowerPoint PPT Presentation

deep learning for
SMART_READER_LITE
LIVE PREVIEW

Deep Learning for Everybody Elliot English and the MetaMind team - - PowerPoint PPT Presentation

Deep Learning for Everybody Elliot English and the MetaMind team elliot@metamind.io Socher, Ng, Manning Socher, Manning, Ng Unstructured visual and textual data Enormous growth of images and text 1.8B images shared / day 100B


slide-1
SLIDE 1

Socher, Manning, Ng

Socher, Ng, Manning

Deep Learning for Everybody

Elliot English and the MetaMind team

elliot@metamind.io

slide-2
SLIDE 2

Socher, Manning, Ng

Socher, Ng, Manning

Unstructured visual and textual data

  • Enormous growth of images and text
  • 1.8B images shared / day
  • 100B business emails sent / day
  • They span all industries and their analysis is valuable
  • Advertising, ad-optimization based on content
  • Medicine, Radiology images: early cancer detection
  • Insurance, Satellite images: building risk analysis
  • Finance, Sentiment analysis for trading
  • Customer Relationship Management, churn prediction
  • Their analysis requires machine learning
slide-3
SLIDE 3

Socher, Manning, Ng

Socher, Ng, Manning

Machine Learning Used to Require a Ph.D.

http://xkcd.com/1425/

slide-4
SLIDE 4

Socher, Manning, Ng

Socher, Ng, Manning

Why is that?

Learning algorithm Describing your data with features a computer can understand Learning algorithm

Domain specific, requires Ph.D. level talent Can take largely

  • ff the shelf
slide-5
SLIDE 5

Socher, Manning, Ng

Socher, Ng, Manning

Parsing Named Entities Char n-grams POS Tags Coreference Taxonomy Feature Engineering is hard! – Real NLP Example

  • Task: Predict quality of a radiology report
  • Features:
slide-6
SLIDE 6

Socher, Manning, Ng

Socher, Ng, Manning

Deep Learning can replace all of these: NLP!

  • Word vectors:
  • Recursive structures
slide-7
SLIDE 7

Socher, Manning, Ng

Socher, Ng, Manning

Canny Edges Harris Corners SIFT SURF Feature Engineering is hard! – Real Vision Example

  • Task: Predict class of object in image
  • Features:

http://www.cs.ubc.ca/~lowe/papers/ijcv04.pdf http://en.wikipedia.org/wiki/Edge_detection http://docs.opencv.org/trunk/doc/py_tutorials/py_feature2d/py_featur es_harris/py_features_harris.html http://www.vision.ee.ethz.ch/~surf/eccv06.pdf

slide-8
SLIDE 8

Socher, Manning, Ng

Socher, Ng, Manning

Deep Learning can replace all of these: Vision!

  • Bottom level features from a convolutional neural network:

http://www.cs.toronto.edu/~fritz/absps/imagenet.pdf

slide-9
SLIDE 9

Socher, Manning, Ng

Socher, Ng, Manning

Deep Learning has revolutionized the industry

  • Speech recognition systems of

Google, Microsoft, Baidu all use DL

  • Google+, Microsoft and others use DL for

very accurate image classification, e.g. results for: seat belt, boston rocker, archery, shredder

  • Let’s take a look at how we’re doing on the latter by examining a popular

benchmark.

slide-10
SLIDE 10

Socher, Manning, Ng

Socher, Ng, Manning

ImageNet Large Scale Visual Recognition Challenge

NEC (SIFT Features) XRCE (Fisher Features)

65 70 75 80 85 90 95 100 Jan-10 Feb-11 Apr-12 May-13 Jun-14 Jul-15 ILSVRC Classification Task Top-5 Accuracy

slide-11
SLIDE 11

Socher, Manning, Ng

Socher, Ng, Manning

State-of-the-art rapidly improving

  • Convolutional neural networks

now the de facto standard for image classification

  • LeCun, Yann, et al. "Gradient-

based learning applied to document recognition.” (1998).

  • Krizhevsky, Alex, Ilya Sutskever,

and Geoffrey E. Hinton. "Imagenet classification with deep convolutional neural networks.” (2012).

  • Szegedy, Christian, et al. "Going

deeper with convolutions.” (2014).

slide-12
SLIDE 12

Socher, Manning, Ng

Socher, Ng, Manning

ImageNet Large Scale Visual Recognition Challenge

NEC (SIFT Features) XRCE (Fisher Features) SuperVision Clarifai Google Google Microsoft Baidu

65 70 75 80 85 90 95 100 Jan-10 Feb-11 Apr-12 May-13 Jun-14 Jul-15 ILSVRC Classification Task Top-5 Accuracy

  • We’re now at human

accuracy! (not really)

  • Deep learning still

limited to select companies

slide-13
SLIDE 13

Socher, Manning, Ng

Socher, Ng, Manning

ImageNet Large Scale Visual Recognition Challenge

  • We’re now at human

accuracy! (not really)

  • Deep learning still

limited to select companies

  • MetaMind makes

state-of-the-art deep learning readily usable

NEC (SIFT Features) XRCE (Fisher Features) SuperVision Clarifai Google Google Microsoft Baidu

65 70 75 80 85 90 95 100 Jan-10 Feb-11 Apr-12 May-13 Jun-14 Jul-15 ILSVRC Classification Task Top-5 Accuracy

slide-14
SLIDE 14

Socher, Manning, Ng

Socher, Ng, Manning

MetaMind: Deep learning for everybody

  • We take care of the details:

– Machine learning algorithm selection – Hyper parameter tuning – Efficient training procedures – Computational resource management

  • you don’t need to worry about owning

your own GPU machines

– Scalable inference infrastructure

  • We constantly improve your performance

100 80 60 40 20

0 2 4 6 8

Accuracy* (%)

Training time (Days)

slide-15
SLIDE 15

Socher, Manning, Ng

Socher, Ng, Manning

Demos!

slide-16
SLIDE 16

Socher, Manning, Ng

Socher, Ng, Manning

Language Demo: Twitter Sentiment

slide-17
SLIDE 17

Socher, Manning, Ng

Socher, Ng, Manning

Language Demo: Semantic Similarity

slide-18
SLIDE 18

Socher, Manning, Ng

Socher, Ng, Manning

Language Demo: Train Your Own Classifier

slide-19
SLIDE 19

Socher, Manning, Ng

Socher, Ng, Manning

Vision: General Image Classifier

slide-20
SLIDE 20

Socher, Manning, Ng

Socher, Ng, Manning

Vision Demo: Food Classifier

slide-21
SLIDE 21

Socher, Manning, Ng

Socher, Ng, Manning

Vision Demo: Train your own classifier

slide-22
SLIDE 22

Socher, Manning, Ng

Socher, Ng, Manning

Vision Classifier Use Cases

Language Classifiers Vision Classifiers

slide-23
SLIDE 23

Socher, Manning, Ng

Socher, Ng, Manning

API

  • Pre-trained classifiers:

– https://www.metamind.io/api-quick-start

  • Train your own classifier tutorial:

– https://www.metamind.io/api-tutorial-fit

slide-24
SLIDE 24

Socher, Manning, Ng

Socher, Ng, Manning

Also doing research

  • Developing new models to improve accuracy
  • Improving both training and inference speed
  • Addressing new problems involving multimodal systems
slide-25
SLIDE 25

Socher, Manning, Ng

Socher, Ng, Manning

MetaMind’s Vision

Breakthrough AI for Everybody

slide-26
SLIDE 26

Socher, Manning, Ng

Socher, Ng, Manning

slide-27
SLIDE 27

Socher, Manning, Ng

Socher, Ng, Manning

slide-28
SLIDE 28

Socher, Manning, Ng

Socher, Ng, Manning

Grounded sentence-image search

Image-Sentence Demo

slide-29
SLIDE 29

Socher, Manning, Ng

Socher, Ng, Manning

slide-30
SLIDE 30

Socher, Manning, Ng

Socher, Ng, Manning