SemEval-2019 Task 4: Hyperpartisan News Detection
Johannes Maria Rishabh Emmanuel Payam David Benno Martin Kiesel1 Mestre2 Shukla2 Vincent2 Adineh1 Corney Stein1 Potthast3
Webis
Bauhaus-Universität Weimar1, Leipzig University3
2
1 @KieselJohannes
SemEval-2019 Task 4: Hyperpartisan News Detection Johannes Maria - - PowerPoint PPT Presentation
SemEval-2019 Task 4: Hyperpartisan News Detection Johannes Maria Rishabh Emmanuel Payam David Benno Martin Kiesel 1 Mestre 2 Shukla 2 Vincent 2 Adineh 1 Stein 1 Potthast 3 Corney Webis Bauhaus-Universitt Weimar 1 , Leipzig University 3
2
1 @KieselJohannes
2 @KieselJohannes
3 @KieselJohannes
4 @KieselJohannes
5 @KieselJohannes
6 @KieselJohannes
7 @KieselJohannes
8 @KieselJohannes
doi.org/10.5281/zenodo.1489920
❑ Dataset Annotated by Article: 1 273 articles. ❑ Manual annotation of each article by crowdworkers.
9 @KieselJohannes
doi.org/10.5281/zenodo.1489920
❑ Dataset Annotated by Article: 1 273 articles. ❑ Manual annotation of each article by crowdworkers.
❑ Dataset Annotated by Publisher: 754 000 articles. ❑ Manual annotation of each publisher by journalists.
10 @KieselJohannes
11 @KieselJohannes
❑ 322 registrations ❑ 184 virtual machines assigned ❑ 42 software submissions from as many teams ❑ 34 papers ❑ Ongoing submissions in
pan.webis.de/semeval19/ semeval19-web/leaderboard.html
12 @KieselJohannes
❑ Meta-learning dataset created from
❑ Higher accuracy (from 0.822) ❑ Baselines beat best single system ❑ Both participants beat the baselines ❑ They use a Random Forest and a
❑ Ongoing submissions in
13 @KieselJohannes
❑ Meta-learning dataset created from
❑ Higher accuracy (from 0.822) ❑ Baselines beat best single system ❑ Both participants beat the baselines ❑ They use a Random Forest and a
❑ Ongoing submissions in
Vernon Fenwick Bertha von Suttner Borat Sagdiyev yes no yes no yes no Howard Beale yes no Ned Leeds yes no 13 193 2 160 17 26 22 5 10 22 3 6
14 @KieselJohannes
❑ Meta-learning dataset created from
❑ Higher accuracy (from 0.822) ❑ Baselines beat best single system ❑ Both participants beat the baselines ❑ They use a Random Forest and a
❑ Ongoing submissions in
Vernon Fenwick Bertha von Suttner Borat Sagdiyev yes no yes no yes no Howard Beale yes no Ned Leeds yes no 13 193 2 160 17 26 22 5 10 22 3 6
15 @KieselJohannes
❑ 28 teams (of 42) ❑ Lower accuracy (from 0.822) ❑ Most teams focused on the other dataset ❑ Ranking very different ❑ Ongoing submissions in
pan.webis.de/semeval19/ semeval19-web/leaderboard.html
16 @KieselJohannes
❑ 28 teams (of 42) ❑ Lower accuracy (from 0.822) ❑ Most teams focused on the other dataset ❑ Ranking very different ❑ Ongoing submissions in
40 39 38 36 34 32 30 29 27 26 25 24 22 21 20 16 15 14 13 11 9 8 7 6 5 4 3 1 Rank for dataset annotated by article 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 Rank for dataset annotated by publisher
17 @KieselJohannes
❑ Two datasets, newest version downloaded ∼450 times
18 @KieselJohannes
❑ Two datasets, newest version downloaded ∼450 times ❑ Features reported to be especially efficient: embeddings, n-grams, sentiment ❑ So far, 10 teams released their code open source
19 @KieselJohannes
❑ Two datasets, newest version downloaded ∼450 times ❑ Features reported to be especially efficient: embeddings, n-grams, sentiment ❑ So far, 10 teams released their code open source ❑ Very high accuracy: 0.8 to 0.9 ❑ Submission still open!
20 @KieselJohannes
❑ Two datasets, newest version downloaded ∼450 times ❑ Features reported to be especially efficient: embeddings, n-grams, sentiment ❑ So far, 11 teams released their code open source ❑ Very high accuracy: 0.8 to 0.9 ❑ Submission still open!
21 @KieselJohannes
❑ Two datasets, newest version downloaded ∼450 times ❑ Features reported to be especially efficient: embeddings, n-grams, sentiment ❑ So far, 11 teams released their code open source ❑ Very high accuracy: 0.8 to 0.9 ❑ Submission still open!
22 @KieselJohannes
❑ Two datasets, newest version downloaded ∼450 times ❑ Features reported to be especially efficient: embeddings, n-grams, sentiment ❑ So far, 11 teams released their code open source ❑ Very high accuracy: 0.8 to 0.9 ❑ Submission still open! ❑ Challenge ahead: explainability
23 @KieselJohannes