C O U R T E S Y O F C O R T E X
VIDEO UNDERSTANDING @ TWITTER C O U R T E S Y O F C O R T E X - - PowerPoint PPT Presentation
VIDEO UNDERSTANDING @ TWITTER C O U R T E S Y O F C O R T E X - - PowerPoint PPT Presentation
VIDEO UNDERSTANDING @ TWITTER C O U R T E S Y O F C O R T E X USER PROTECTION T W I T T E R C O R T E X T W I T T E R C O R T E X CONTENT UNDERSTANDING T W I T T E R C O R T E X T W I T T E R C O R T E X T W I T T E R C O R T E X
T W I T T E R C O R T E X
USER PROTECTION
T W I T T E R C O R T E X
T W I T T E R C O R T E X
CONTENT UNDERSTANDING
T W I T T E R C O R T E X
T W I T T E R C O R T E X
CONTENT INDEXING
T W I T T E R C O R T E X
T W I T T E R C O R T E X
event poster Two dogs sitting on a couch A red car parked in front of a building
CONTENT RELEVANCE
T W I T T E R C O R T E X
T W I T T E R C O R T E X
USER PROTECTION CONTENT UNDERSTANDING CONTENT INDEXING CONTENT RELEVANCE
T W I T T E R C O R T E X
VIDEOS — THE COMBINATORIAL CHALLENGE
T W I T T E R C O R T E X
T W I T T E R C O R T E X
T W I T T E R C O R T E X
T W I T T E R C O R T E X
T W I T T E R C O R T E X
Twitter Images Vine Periscope
T W I T T E R C O R T E X
FORWARD OR BACKWARD ?
T W I T T E R C O R T E X
MOTION: FORWARD OR BACKWARD ? CLOCKWISE OR COUNTER-CLOCKWISE ?
T W I T T E R C O R T E X
MULTIMODAL
- VIDEO
- AUDIO
- TEXT
T W I T T E R C O R T E X
George Toderici and Sudheendra Vijayanarasimhan http://googleresearch.blogspot.com/2015/04/beyond-short-snippets-deep-networks- for.html 0.93933 0.43433
T W I T T E R C O R T E X
RESEARCH ANSWERS
T W I T T E R C O R T E X
TWITTER CORTEX
CNN VARIATIONS
VGG SIMONYAN AND ZISSERMAN
T W I T T E R C O R T E X
TWITTER CORTEX
Inception Net SZEGEDY AND AL.
CNN VARIATIONS
T W I T T E R C O R T E X
TWITTER CORTEX
CNN VARIATIONS
Residual Nets KAIMING HE AND AL.
T W I T T E R C O R T E X
TWITTER CORTEX
RECURRENT NEURAL NETWORKS
T W I T T E R C O R T E X
T W I T T E R C O R T E X
T W I T T E R C O R T E X
?
CORTEX SOLUTIONS
T W I T T E R C O R T E X
TAXONOMY EFFORT FAST PROTOTYPING SCALABILITY EASY DEPLOYMENT
T W I T T E R C O R T E X
TAXONOMY EFFORT FAST PROTOTYPING SCALABILITY EASY DEPLOYMENT
T W I T T E R C O R T E X
TAXONOMY EFFORT
- T W I T T E R C O R T E X
PRODUCT MANAGER
Defines a taxonomy
LABELING
MODEL DEPLOYMENT
TRAINING
TAXONOMY EFFORT
- T W I T T E R C O R T E X
PRODUCT MANAGER
Defines a taxonomy
LABELING
MODEL DEPLOYMENT
TRAINING
TAXONOMY EFFORT FAST PROTOTYPING SCALABILITY EASY DEPLOYMENT
T W I T T E R C O R T E X
CORTEX PROCESS
T W I T T E R C O R T E X
- 1. Data gathering
- 2. Labeling
- 3. Model Training
- 4. Production Service
& Traffic
CORTEX REAL PROCESS
T W I T T E R C O R T E X
PROJECT IDEA
Hey, let’s build a model that generates video depending
- n tweet contents and user’s
mood!
MODEL PROTOTYPING MODEL TRAINING L A B E L I N G D E B U G G I N G PREVIOUS SLIDE
That means you’re lucky
T W I T T E R C O R T E X
NEEDS SOFTWARE HARDWARE
VIDEO CLASSIFICATION
CPU NETWORK DATA PROCESSING
Data downloadDATABASE
Low latency storageDISTRIBUTED TRAINING
Training speed C- m
- d
- d
CONVOLUTIONAL LAYERS FULLY CONNECTED LAYERS LSTM
Variable temporal sequenceMULTI-GPU
Computation + Memory hungry Memory hungryLOOKUP TABLE
word dictionaryT W I T T E R C O R T E X
NEEDS SOFTWARE HARDWARE
VIDEO CLASSIFICATION
CPU NETWORK DATA PROCESSING
Data downloadDATABASE
Low latency storageDISTRIBUTED TRAINING
Training speed C- m
- d
- d
CONVOLUTIONAL LAYERS FULLY CONNECTED LAYERS LSTM
Variable temporal sequenceMULTI-GPU
Computation + Memory hungry Memory hungryLOOKUP TABLE
word dictionaryCORTEX 1.0
T W I T T E R C O R T E X
T W I T T E R C O R T E X
FORWARD BACKWARD ACCGRADPARAMETERS
Input Output Target
FORWARD
Loss output
BACKWARD
GradOutput Parameters GradInput GradParameters
nn.Module nn.Criterion
T W I T T E R C O R T E X
AUTOGRAD FUNCTION
Input Loss output Parameters GradInput GradParameters
torch-autograd function*
Target Some other stuff
T W I T T E R C O R T E X
CASCADED DEEP NETS FOR BINARY CLASSIFICATION —— DYNAMIC DAG
torch-autograd function
GITHUB.COM/TWITTER/TORCH-AUTOGRAD
T W I T T E R C O R T E X
TAXONOMY EFFORT FAST PROTOTYPING SCALABILITY EASY DEPLOYMENT
T W I T T E R C O R T E X
LARGE SCALE DATASET
T W I T T E R C O R T E X
T W I T T E R C O R T E X
NEEDS SOFTWARE HARDWARE
VIDEO CLASSIFICATION
CPU NETWORK DATA PROCESSING
Data downloadDATABASE
Low latency storageDISTRIBUTED TRAINING
Training speed C- m
- d
- d
CONVOLUTIONAL LAYERS FULLY CONNECTED LAYERS LSTM
Variable temporal sequenceMULTI-GPU
Computation + Memory hungry Memory hungryLOOKUP TABLE
word dictionaryT W I T T E R C O R T E X
DATA STORAGES THREADS MODEL INPUTS
HDFS BLOB STORAGE ……… Download/Decode 1
… . .
Download/Decode 2 Download/Decode 64 Download/Decode 2 Download/Decode 64 VIDEO BATCH 64*N*3*256*256
… . .
T W I T T E R C O R T E X
DATA STORAGES THREADS MODEL INPUTS
HDFS BLOB STORAGE ……… Download/Decode 1
… . .
Download/Decode 2 Download/Decode 64 Download/Decode 2 Download/Decode 64 VIDEO BATCH 64*N*3*256*256
T W I T T E R C O R T E X
UNLIMITED SAMPLING CAPABILITY
T W I T T E R C O R T E X
GITHUB.COM/TWITTER/TORCH-DATASET
LARGE SCALE TRAINING
T W I T T E R C O R T E X
T W I T T E R C O R T E X
NEEDS SOFTWARE HARDWARE
VIDEO CLASSIFICATION
CPU NETWORK DATA PROCESSING
Data downloadDATABASE
Low latency storageDISTRIBUTED TRAINING
Training speed C- m
- d
- d
CONVOLUTIONAL LAYERS FULLY CONNECTED LAYERS LSTM
Variable temporal sequenceMULTI-GPU
Computation + Memory hungry Memory hungryLOOKUP TABLE
word dictionaryGITHUB.COM/TWITTER/TORCH-DISTLEARN
T W I T T E R C O R T E X
LARGE SCALE INFERENCE
T W I T T E R C O R T E X
T W I T T E R C O R T E X
T W I T T E R C O R T E X
PREDICTIONS GPU CLUSTER
TAXONOMY EFFORT FAST PROTOTYPING SCALABILITY EASY DEPLOYMENT
T W I T T E R C O R T E X
T W I T T E R C O R T E X
ANYBODY CAN TRAIN PLAY DEPLOY
T W I T T E R C O R T E X
T W I T T E R C O R T E X
NEW PROJECT PROTOTYPING DEPLOYMENT LARGE SCALE CLUSTER
T W I T T E R C O R T E X
NEW PROJECT PROTOTYPING DEPLOYMENT LARGE SCALE CLUSTER
< 1 WEEKS
THANK YOU !!
T W I T T E R C O R T E X
SLIDES SKETCH
- Twitter Cortex mission statements:
- User protection
- Content Understanding
- Content indexing
- Content relevance
T W I T T E R C O R T E X
SLIDES SKETCH
- Focus on video: how is it challenging?
- Pixels distribution (darker, blurry)
- Combinatorial challenge (motion, streaming, size, mulitmodal)
T W I T T E R C O R T E X
SLIDES SKETCH
- Focus on video: what research tells us
- CNNs
- RNNs
- Multimodal?
T W I T T E R C O R T E X
SLIDES SKETCH
- Focus on video: Cortex industrial answer
- Smart taxonomy [organization] (fast labeling, feedback)
- Fast Prototyping [software] (torch-autograd)
- Scalability [software + hardware] (torch-dataset, torch-distlearn)
- Fast deployment [software] (embedded torch, no extra step)
T W I T T E R C O R T E X