adversarial training for satire detection controlling for
play

Adversarial Training for Satire Detection: Controlling for - PowerPoint PPT Presentation

University of Stuttgart Institute for Natural Language Processing Adversarial Training for Satire Detection: Controlling for Confounding Variables June 3rd, 2019 Robert McHardy, Heike Adel and Roman Klinger Satire & Research Goals


  1. University of Stuttgart Institute for 
 Natural Language Processing Adversarial Training for Satire Detection: Controlling for Confounding Variables June 3rd, 2019 Robert McHardy, Heike Adel and Roman Klinger

  2. Satire & Research Goals Model/Data Experiments & Results Conclusion Motivation 1: Satire or not? “ After years of �ghting there �nally is a settlement between the Gema and Youtube . It became known today , that in future every music video is allowed to be played back in Germany again, as long as the audio is removed” (translated from German) University of Stuttgart McHardy/Adel/Klinger June 3rd, 2019 2 / 12

  3. Satire & Research Goals Model/Data Experiments & Results Conclusion Motivation 2: Satire or not? “Erfurt ( dpo ) – It is an organization which operates outside of law and order, funds numerous NPD operatives and is to a not inconsiderable extent involved in the series of murders of the so-called Zwickauer Zelle. ” (translated from German) DPA is a German news agency – DPO does not exist (in this context). University of Stuttgart McHardy/Adel/Klinger June 3rd, 2019 3 / 12

  4. Outline 1 Satire, Previous Work and Research Goals 2 Model and Data 3 Experiments & Results 4 Conclusion & Availability

  5. Satire & Research Goals Model/Data Experiments & Results Conclusion Satire ● Form of art to critize in an entertaining manner ● Stylistic devices include humor, irony, sarcasm ● Goal: Mimic regular news in diction ● It’s not misinformation or desinformation (fake news): Articles typically contain satire markers (similar to irony or sarcasm) Automatic Satire Detection Automatically distinguish satirical news from regular news ⇒ Challenging task (even for humans) University of Stuttgart McHardy/Adel/Klinger June 3rd, 2019 4 / 12

  6. Satire & Research Goals Model/Data Experiments & Results Conclusion Previous Work Yang et al. 2017 , De Sarkar et al. 2018 ● Created data sets which are automatically labeled from publication source ● Potential limitation: Models might learn characteristics of publication sources instead of actual characteristics of satire ● (evaluation is not faulty, they use di�erent publication sources for validation than for training) ⇒ Bad generalization to unseen publication sources? ⇒ Interpretation of models (regarding concepts of satire) misleading? University of Stuttgart McHardy/Adel/Klinger June 3rd, 2019 5 / 12

  7. Satire & Research Goals Model/Data Experiments & Results Conclusion Our Contributions ● We propose adversarial training: Improve robustness of model against confounding variable of publication sources ● We show that adversarial training is crucial for the model to pay attention to satire instead of publication characteristics ● We publish a large German data set for satire detection. ● First dataset in German ● First dataset including publication sources ● Largest resource for satire detection so far University of Stuttgart McHardy/Adel/Klinger June 3rd, 2019 6 / 12

  8. Outline 1 Satire, Previous Work and Research Goals 2 Model and Data 3 Experiments & Results 4 Conclusion & Availability

  9. Satire & Research Goals Model/Data Experiments & Results Conclusion Model feature extractor input layer LSTM layer attention layer ∂ J s − λ ∂ J p ∂ θ f ∂ θ f satire publication detector identifier ∂ J p ∂ J s ∂ θ p ∂ θ s satire? (yes/no) publication name University of Stuttgart McHardy/Adel/Klinger June 3rd, 2019 7 / 12

  10. Satire & Research Goals Model/Data Experiments & Results Conclusion Data Collection and Selection ● Regular news: Der Spiegel, Der Standard, Die Zeit, Süddeutsche Zeitung ● Satire: Der Enthüller, Eulenspiegel, Nordd. Nach., Der Postillon, Satirepatzer, Die Tagespresse, Titanic, Welt (Satire), Der Zeitspiegel, Eine Zeitung, Zynismus24 ● Articles from January 1st, 2000 and May 1st, 2018 Average Length Publication #Articles Article Sent. Title Regular 320,219 663.45 17 .79 6.86 Satire 9,643 269.28 18.73 9.52 University of Stuttgart McHardy/Adel/Klinger June 3rd, 2019 8 / 12

  11. Satire & Research Goals Model/Data Experiments & Results Conclusion Research Question 1: Performance How does a decrease in publication classi�cation performance through adversarial training a�ect the satire classi�cation performance? � ��� � �� � � � ������� � �� � �� � �� � � ������ � �������� ���� ���� ���� �� � ���� ��� ���� ���� � ���� ���� � ���� �������� ����� University of Stuttgart McHardy/Adel/Klinger June 3rd, 2019 9 / 12

  12. Erfurt ( dpo ) - It is an organization which operates outside of law and order , funds numerous NPD operatives and is to a not inconsiderable extent involved in the series discussed , whereof the Union hopes for an ofg-putting efgect . After all , the proposal to allow family reunion only inclusive mothers-in-law is being discussed , whereof the Union hopes for an ofg-putting efgect . After all , the proposal to allow family reunion only inclusive mothers-in-law is being of murders of the so called Zwickauer Zelle . numerous NPD operatives and is to a not inconsiderable extent involved in the series of murders of the so called Zwickauer Zelle . Erfurt ( dpo ) - It is an organization which operates outside of law and order , funds Satire & Research Goals Model/Data Experiments & Results Conclusion Research Question 2: Attention Weights Is adversarial training e�ective for avoiding that the model pays most attention to the characteristics of publication source rather than actual satire? no adv adv no adv adv University of Stuttgart McHardy/Adel/Klinger June 3rd, 2019 10 / 12

  13. Satire & Research Goals Model/Data Experiments & Results Conclusion Conclusion and Availability ● Observation: Satire detection models learn characteristics of publication sources Our Contributions ● Adversarial training to control for this confounding variable ⇒ Considerable reduction of publication identi�cation performance while satire detection remains on comparable levels ⇒ Attention weights show e�ectiveness of our approach ● First German dataset for satire detection ⇒ Dataset and code available at: http://www.ims.uni-stuttgart.de/data/germansatire University of Stuttgart McHardy/Adel/Klinger June 3rd, 2019 11 / 12

  14. University of Stuttgart Institute for 
 Natural Language Processing Adversarial Training for Satire Detection: Controlling for Confounding Variables June 3rd, 2019 Robert McHardy, Heike Adel and Roman Klinger

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend