TVM for Ads Ranking @ Facebook
Hao Lu, Ansha Yu, Yinghai Lu, Andrew Tulloch
TVM for Ads Ranking @ Facebook Hao Lu, Ansha Yu, Yinghai Lu, Andrew - - PowerPoint PPT Presentation
TVM for Ads Ranking @ Facebook Hao Lu, Ansha Yu, Yinghai Lu, Andrew Tulloch Ads Ranking at Facebook . . . ad 1 ad 2 ad 3 ad n model 2 model 2 model 1 model 3 . . . model k batch 1 batch 2 predictions X 2 Ads Ranking at Facebook:
Hao Lu, Ansha Yu, Yinghai Lu, Andrew Tulloch
model 2 batch 2
2
X
predictions ad 1 ad 2 ad 3 ad n model 1 model 2 batch 1 model 3 model k . . . . . .
batches executing at the same time. In this case, weights are global and shared between threads, but activations are thread local
released at the end of each inference to avoid running out of memory
3
X
predictions ad 1 ad 2 ad 3 model 1 model 2 batch 1 model 2 batch 2
MLP: Multilayer perceptron (sequence of FC + activation function)
https://ai.facebook.com/blog/dlrm-an-advanced-open-source-deep-learning-recommendation-model/
EMB
5
Implementation
periodically
model
runtimes
Performance
Dense features + embeddings from caffe2 batch_size x graph runtime batch_size 1 graph runtime batch_size 2 graph runtime batch_size n prediction
6
Relay VM
Performance