Hate Speech Detection is Not as Easy as You May Think: A Closer Look at Model Validation
Aymé Arango, Jorge Pérez and Bárbara Poblete
Hate Speech Detection is Not as Easy as You May Think: A Closer Look - - PowerPoint PPT Presentation
Hate Speech Detection is Not as Easy as You May Think: A Closer Look at Model Validation Aym Arango, Jorge Prez and Brbara Poblete UNDETECTED ALMOST PERFECT HATE SPEECH VS STATE-OF-THE-ART IN RESULTS SOCIAL MEDIA UNDETECTED HATE
Aymé Arango, Jorge Pérez and Bárbara Poblete
UNDETECTED HATE SPEECH IN SOCIAL MEDIA
ALMOST PERFECT STATE-OF-THE-ART RESULTS
UNDETECTED HATE SPEECH IN SOCIAL MEDIA
ALMOST PERFECT STATE-OF-THE-ART RESULTS
94% F1
[Agrawal and Awekar] ECIR
2018
93% F1
[Badjatiya et al.] WWW
2017
92% F1
[Zeerak Waseem] NAACL
2016
Including the testing set during training phase Oversampling the data before splitting User-biased datasets
ALMOST PERFECT STATE-OF-THE-ART RESULTS
94% F1
[Agrawal and Awekar] ECIR
2018
93% F1
[Badjatiya et al.] WWW
2017
92% F1
[Zeerak Waseem] NAACL
2016
DATASET 1
[Waseem and Hovy] NAACL 2016
Tweet Label Hate Non-Hate
Model 1
[Badjatiya et al.]
2017
PHASE 1 Feature Extraction PHASE 2 Classification Method DATASET 1
[Waseem and Hovy] NAACL 2016
PHASE 2 Classification Method Model 1
[Badjatiya et al.]
2017
PHASE 1 Feature Extraction DATASET 1
[Waseem and Hovy] NAACL 2016
Embeddings LSTM Softmax Prediction Fully Connected
PHASE 2 Classification Method Model 1
[Badjatiya et al.]
2017
PHASE 1 Feature Extraction DATASET 1
[Waseem and Hovy] NAACL 2016
Embeddings LSTM Softmax Prediction Fully Connected
PHASE 2 Classification Method Model 1
[Badjatiya et al.]
2017
PHASE 1 Feature Extraction DATASET 1
[Waseem and Hovy] NAACL 2016
TEST TRAIN Splitting
Embeddings Embeddings LSTM Softmax Prediction Fully Connected
PHASE 2 Classification Method Model 1
[Badjatiya et al.]
2017
PHASE 1 Feature Extraction DATASET 1
[Waseem and Hovy] NAACL 2016
TEST TRAIN Splitting
AVG(Embeddings) GBDT Prediction
Embeddings LSTM Softmax Prediction Fully Connected
AVG(Embeddings) GBDT Prediction
PHASE 2 Classification Method Model 1
[Badjatiya et al.]
2017
PHASE 1 Feature Extraction DATASET 1
[Waseem and Hovy] NAACL 2016
TEST TRAIN TEST
Embeddings LSTM Softmax Prediction Fully Connected
Splitting
PHASE 2 Classification Method Model 1
[Badjatiya et al.]
2017
PHASE 1 Feature Extraction DATASET 1
[Waseem and Hovy] NAACL 2016
PHASE 2 Classification Method Model 1
[Badjatiya et al.]
2017
TEST TRAIN TEST TRAIN Same Splitting New PHASE 1 Feature Extraction
PHASE 2 Classification Method Model 1
[Badjatiya et al.]
2017
New PHASE 1 Feature Extraction TRAIN TEST TRAIN Same Splitting
Embeddings LSTM Softmax Prediction Fully Connected
PHASE 2 Classification Method Model 1
[Badjatiya et al.]
2017
New PHASE 1 Feature Extraction TEST TRAIN
Embeddings
TRAIN Same Splitting
Embeddings LSTM Softmax Prediction Fully Connected
PHASE 2 Classification Method Model 1
[Badjatiya et al.]
2017
New PHASE 1 Feature Extraction TEST TRAIN
TRAIN Same Splitting
Embeddings LSTM Softmax Prediction Fully Connected AVG(Embeddings) GBDT Prediction
Model 2
[Agrawal and Awekar]
2018
Oversampling Data Feature Extraction + Classification Method DATASET 1
[Waseem and Hovy] NAACL 2016
DATASET 1
[Waseem and Hovy] NAACL 2016
Model 2
[Agrawal and Awekar]
2018
Oversampling Model 2
[Agrawal and Awekar]
2018
Splitting TRAIN TEST
Embeddings LSTM Softmax Prediction Fully Connected
DATASET 1
[Waseem and Hovy] NAACL 2016
Model 2
[Agrawal and Awekar]
2018
Model 2
[Agrawal and Awekar]
2018
Oversampling Splitting TRAIN TEST
Splitting
Model 2
[Agrawal and Awekar]
2018
Oversampling TEST
Embeddings LSTM Softmax Prediction Fully Connected
% Tweets from the most prolific user per class
96% 44% 38% 25%
96 % 44 % 25 %
Hate Non-Hate
Sexism Racism
TEST TRAIN Splitting without
DATASET 1
[Waseem and Hovy] NAACL 2016
Model 1
[Badjatiya et al.]
2017
Model 2
[Agrawal and Awekar]
2018
DATASET 1
250 tweets per user per class
NEW DATASET DATASET 2
[Davidson et al.] ICWSM 2017
DATASET 2
Hateful tweets
NEW DATASET TEST TRAIN Splitting without
Model 1
[Badjatiya et al.]
2017
Model 2
[Agrawal and Awekar]
2018
TRAINING SET TESTING SET
TRAINING SET DATASET 3
[Basile et al.] SemEval 2019
DATASET 1
[Waseem and Hovy] NAACL 2016
DATASET 3
[Basile et al.] SemEval 2019
Model 1
[Badjatiya et al.]
2017
NEW DATASET DATASET 3
[Basile et al.] SemEval 2019
DATASET 1
[Waseem and Hovy] NAACL 2016
DATASET 3
[Basile et al.] SemEval 2019
Model 2
[Agrawal and Awekar]
2018
NEW DATASET DATASET 3
[Basile et al.] SemEval 2019
Including the testing set during training phase Oversampling the data before splitting User-biased datasets
Aymé Arango, Jorge Pérez and Bárbara Poblete