Understanding the Downstream Instability of Word Embeddings
Megan Leszczynski, Avner May, Jian Zhang, Sen Wu, Chris Aberger, Chris Ré Stanford University
Understanding the Downstream Instability of Word Embeddings Megan - - PowerPoint PPT Presentation
Understanding the Downstream Instability of Word Embeddings Megan Leszczynski , Avner May, Jian Zhang, Sen Wu, Chris Aberger, Chris R Stanford University Motivation Re Recommend new conte tent Le Learn new words De Detect the latest
Megan Leszczynski, Avner May, Jian Zhang, Sen Wu, Chris Aberger, Chris Ré Stanford University
Re Recommend new conte tent De Detect the latest spam Le Learn new words
2
Changing distribution of popular videos New spam techniques Out-of-vocabulary words
Why retrain?
3
[1] Baylor et al. TFX: A TensorFlow-Based Production-Scale Machine Learning Platform. KDD, 2017. [2] Hazelwood et al. Applied Machine Learning at Facebook: A Datacenter Infrastructure Perspective. HPCA, 2018.
Data 1 Data 1 + ∆ Predictions 1 Predictions 2
4
[1] Cormier et al. Launch and Iterate: Reducing Prediction Churn. NeurIPS, 2016.
5
1 2 3 4
6
Embedding Server
Named Entity Recognition (NER) Question Answering Sentiment Analysis Relation Extraction
Changing Data Downstream Tasks Refresh Embeddings
0.1 0.3 0.5 …
7
8
9
Data 1 Data 1 + ∆ Predictions 1 Predictions 2
Emb 1 (") Emb 2 ( # ")
10
11
[1] May et al. On the downstream performance of compressed word embeddings. NeurIPS, 2019.
0.04
0.1
In Interval: [-0. 0.1, 1, 0. 0.1] 1] 32 32-bi bit 1-bi bit
Un Unif iform rm Qu Quantization
12
Sentiment Analysis NER
13
Sentiment Analysis NER
14
11%
Sentiment Analysis NER
15
16
Data 1 Data 1 + ∆ Predictions 1 Predictions 2
Downstream Instability
Emb 1 (") Emb 2 ( # ")
Distance (Emb1, Emb2)
Emb mb (!) )
S VT U
17
Singular Value Decomposition
[1] May et al. On the downstream performance of compressed word embeddings. NeurIPS, 2019.
18
19
20
21
[1] Hellrich & Hahn, COLING, 2016; [2] Antoniak & Mimno, TACL, 2018; [3] Wendlandt et al., NAACL-HLT, 2018; [4] Hamilton et al., ACL, 2016; [5] Yin & Shen, NeurIPS, 2018; [6] May et al., NeurIPS, 2019
22
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 Eigenspace Instability (EIS) 1 - k-NN Measure Semantic Displacement PIP Loss 1 - Eigenspace Overlap Spearman Correlation
23
Memory % Disagreement
100-dim, 1-bit 25-dim, 4-bit 50-dim, 2-bit
Choices
Oracle
24
0.5 1 1.5 2 2.5 3 3.5 SST-2 MR SUBJ MPQA CoNLL-2003
EIS 1 - k-NN SD PIP 1 - EO
25
26
27