An Interpretable Joint Graphical Model for Fact-Checking from Crowds - PowerPoint PPT Presentation

An Interpretable Joint Graphical Model for Fact-Checking from Crowds An T. Nguyen 1 Aditya Kharosekar 1 Matthew Lease 1 Byron C. Wallace 2 1 University of Texas at Austin 2 Northeastern University 1

Problems Given a claim: Facebook Shut Down an AI Experiment Because Chatbots Developed Their Own Language. 2

Problems Given a claim: Facebook Shut Down an AI Experiment Because Chatbots Developed Their Own Language. and relevant article headlines: No, Facebook Did Not Panic and Shut Down an AI Program That Was Getting Dangerously Smart. source: gizmodo.com 2

Problems Given a claim: Facebook Shut Down an AI Experiment Because Chatbots Developed Their Own Language. and relevant article headlines: No, Facebook Did Not Panic and Shut Down an AI Program That Was Getting Dangerously Smart. source: gizmodo.com Predict headline stance: For Against Observing 2

Problems Given a claim: Facebook Shut Down an AI Experiment Because Chatbots Developed Their Own Language. and relevant article headlines: No, Facebook Did Not Panic and Shut Down an AI Program That Was Getting Dangerously Smart. source: gizmodo.com Predict headline stance: For Against Observing Predict claim veracity: False True Unknown 2

Problems Given a claim: Facebook Shut Down an AI Experiment Because Chatbots Developed Their Own Language. and relevant article headlines: No, Facebook Did Not Panic and Shut Down an AI Program That Was Getting Dangerously Smart. source: gizmodo.com Predict headline stance: For Against Observing Predict claim veracity: False True Unknown Our motivation: ◮ Make sense of general claims incl. scientific, historical, ... 2

Problems Given a claim: Facebook Shut Down an AI Experiment Because Chatbots Developed Their Own Language. and relevant article headlines: No, Facebook Did Not Panic and Shut Down an AI Program That Was Getting Dangerously Smart. source: gizmodo.com Predict headline stance: For Against Observing Predict claim veracity: False True Unknown Our motivation: ◮ Make sense of general claims incl. scientific, historical, ... ◮ Not just “fake news”. 2

Solutions Previous work: 3

Solutions Previous work: ◮ Predict stance from text features (Ferreira& Vlachos 2016). 3

Solutions Previous work: ◮ Predict stance from text features (Ferreira& Vlachos 2016). ◮ Predict veracity from stance+source features (Popat et al. 2017) 3

Solutions Previous work: ◮ Predict stance from text features (Ferreira& Vlachos 2016). ◮ Predict veracity from stance+source features (Popat et al. 2017) We proposed: ◮ Crowdsource stance labels. 3

Solutions Previous work: ◮ Predict stance from text features (Ferreira& Vlachos 2016). ◮ Predict veracity from stance+source features (Popat et al. 2017) We proposed: ◮ Crowdsource stance labels. ◮ Hybrid human AI ◮ Available near real-time 3

Solutions Previous work: ◮ Predict stance from text features (Ferreira& Vlachos 2016). ◮ Predict veracity from stance+source features (Popat et al. 2017) We proposed: ◮ Crowdsource stance labels. ◮ Hybrid human AI ◮ Available near real-time ◮ Joint graphical model of stance, veracity, annotators. 3

Solutions Previous work: ◮ Predict stance from text features (Ferreira& Vlachos 2016). ◮ Predict veracity from stance+source features (Popat et al. 2017) We proposed: ◮ Crowdsource stance labels. ◮ Hybrid human AI ◮ Available near real-time ◮ Joint graphical model of stance, veracity, annotators. ◮ Interaction between variables ◮ Interpretable 3

Powered by TCPDF (www.tcpdf.org) Powered by TCPDF (www.tcpdf.org) Model B U L A c lablers V S W T n claims R m sources 4

Powered by TCPDF (www.tcpdf.org) Powered by TCPDF (www.tcpdf.org) Model B U L A c lablers V S W 1. Predict Stance S ◮ Text features T T n claims R m sources Powered by TCPDF (www.tcpdf.org) 4

Powered by TCPDF (www.tcpdf.org) Powered by TCPDF (www.tcpdf.org) Model B U L A c lablers V S W 1. Predict Stance S ◮ Text features T T n claims 2. Predict Veracity V R ◮ Stance S m sources Powered by TCPDF (www.tcpdf.org) ◮ Reputation R 4

Model B 1. Predict Stance S ◮ Text features T U L A 2. Predict Veracity V c lablers ◮ Stance S V S W ◮ Reputation R 3. Stance Label L T ◮ True stance S n claims ◮ Annotator competence A R m sources Powered by TCPDF (www.tcpdf.org) Powered by TCPDF (www.tcpdf.org) 4

Inference & Learning Inference: ◮ Gibbs sampling: accurate but slow. 5

Inference & Learning Inference: ◮ Gibbs sampling: accurate but slow. ◮ Variational inference: fast but biased. 5

Inference & Learning Inference: ◮ Gibbs sampling: accurate but slow. ◮ Variational inference: fast but biased. Learning: Expectation Maximization. 5

Inference & Learning Inference: ◮ Gibbs sampling: accurate but slow. ◮ Variational inference: fast but biased. Learning: Expectation Maximization. Details in the paper. 5

Evaluation Data: Emergent (Ferreira and Vlachos 2016) ◮ 300 claims. ◮ 2595 articles with stance labels. 6

Evaluation Data: Emergent (Ferreira and Vlachos 2016) ◮ 300 claims. ◮ 2595 articles with stance labels. ◮ We collected: crowd stance labels by Mechanical Turk. 6

Evaluation Data: Emergent (Ferreira and Vlachos 2016) ◮ 300 claims. ◮ 2595 articles with stance labels. ◮ We collected: crowd stance labels by Mechanical Turk. Baseline: Separated models for stance, veracity & crowd labels. 6

Evaluation Data: Emergent (Ferreira and Vlachos 2016) ◮ 300 claims. ◮ 2595 articles with stance labels. ◮ We collected: crowd stance labels by Mechanical Turk. Baseline: Separated models for stance, veracity & crowd labels. Metric: Brier score, measures accuracy and prob. calibration. 6

Results 7

User study Interface: users enter claims, see predictions. 8

User study Interface: users enter claims, see predictions. A/B testing 8

User study Interface: users enter claims, see predictions. A/B testing ◮ A: see only veracity predictions 8

User study Interface: users enter claims, see predictions. A/B testing ◮ A: see only veracity predictions ◮ B: also see explanation (reputation, stances) 8

User study: results 9

Conclusion Takeaway: ◮ Stance/Veracity predictions are hard. 10

Conclusion Takeaway: ◮ Stance/Veracity predictions are hard. ◮ We contribute: crowdsourcing + joint modeling. 10

Conclusion Takeaway: ◮ Stance/Veracity predictions are hard. ◮ We contribute: crowdsourcing + joint modeling. Paper: experiments on Snopes. 10

Conclusion Takeaway: ◮ Stance/Veracity predictions are hard. ◮ We contribute: crowdsourcing + joint modeling. Paper: experiments on Snopes. Demo: fcweb.pythonanywhere.com 10

Conclusion Takeaway: ◮ Stance/Veracity predictions are hard. ◮ We contribute: crowdsourcing + joint modeling. Paper: experiments on Snopes. Demo: fcweb.pythonanywhere.com We share code + data 10

Conclusion Takeaway: ◮ Stance/Veracity predictions are hard. ◮ We contribute: crowdsourcing + joint modeling. Paper: experiments on Snopes. Demo: fcweb.pythonanywhere.com We share code + data Acknowledge: Crowd annotator, reviewers, NSF. 10

Conclusion Takeaway: ◮ Stance/Veracity predictions are hard. ◮ We contribute: crowdsourcing + joint modeling. Paper: experiments on Snopes. Demo: fcweb.pythonanywhere.com We share code + data Acknowledge: Crowd annotator, reviewers, NSF. Questions? 10

An Interpretable Joint Graphical Model for Fact-Checking from Crowds - PowerPoint PPT Presentation

An Interpretable Joint Graphical Model for Fact-Checking from Crowds An T. Nguyen 1 Aditya Kharosekar 1 Matthew Lease 1 Byron C. Wallace 2 1 University of Texas at Austin 2 Northeastern University 1 Problems Given a claim: Facebook Shut Down an

Real Real Real Time Real-Time Time Time Model Checking Model Model Checking Model

Statistical Statistical Statistical Model Statistical Model Model Checking Model Checking

From Model Checking to Proof Checking ... and Back Kedar Namjoshi Bell Labs April 29, 2005

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Hoare Logic and Model Checking Model Checking Lecture 11: Model checking for Computation Tree

A Review of Fact-Checking, Fake News Detection and Argumentation Tariq Alhindi March 02, 2020

Recent Advances in Automated Fact Checking Immanuel Trummer Cornell University

Interpretable sets in o-minimal structures Will Johnson March 27, 2015 Will Johnson

Scalable Multi-Core Model Checking Alfons Laarman ( alfons@laarman.com ), Theory joint work with

Transforming Graphical System Models to Graphical Attack Models ! Joint work with Marieta

CTL Chapter 6 Part 2 Overview Review CTL Model Checking CTL model Checking algorithms

Model-checking in systems biology - From Micro to Macro 1 / 62 00001 - 00:00:01 Model-checking

Mechanized Metatheory Model-Checking WMM 2006 James Cheney 9/21/06 Mechanized Metatheory

Faculty-Administrator Collaboration Team(FACT) FDP Meeting Jan 2020 Informing the Future of

Building Concurrency Primitives CS 450 : Operating Systems Michael Lee <lee@iit.edu>

Xen Crash Dumps on Solaris Nils Nieuwejaar Senior Staff Engineer Solaris Kernel Technology

First Lab Something: promoting proper laboratory learning through intelligent course

Knit: Component Composition for Systems Software Alastair Reid, Matthew Flatt, Leigh Stoller,

DONT PANIC HOW TO ENJOY & SURVIVE PHD DR. PAULI MIETTINEN, MPI INFORMATICS TEKSTI

14:332:231 DIGITAL LOGIC DESIGN Ivan Marsic, Rutgers University Electrical & Computer

and Meson Production at JLab/CLAS Hyon-Suk Jo for the CLAS collaboration IPN Orsay PANIC 2011

LINUX VULNERABILITIES, WINDOWS EXPLOITS Escalating Privileges with WSL Saar Amar Recon brx 2018

An Interpretable Joint Graphical Model for Fact-Checking from Crowds - PowerPoint PPT Presentation

An Interpretable Joint Graphical Model for Fact-Checking from Crowds An T. Nguyen 1 Aditya Kharosekar 1 Matthew Lease 1 Byron C. Wallace 2 1 University of Texas at Austin 2 Northeastern University 1 Problems Given a claim: Facebook Shut Down an

Real Real Real Time Real-Time Time Time Model Checking Model Model Checking Model

Statistical Statistical Statistical Model Statistical Model Model Checking Model Checking

From Model Checking to Proof Checking ... and Back Kedar Namjoshi Bell Labs April 29, 2005

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Hoare Logic and Model Checking Model Checking Lecture 11: Model checking for Computation Tree

A Review of Fact-Checking, Fake News Detection and Argumentation Tariq Alhindi March 02, 2020

Recent Advances in Automated Fact Checking Immanuel Trummer Cornell University

Interpretable sets in o-minimal structures Will Johnson March 27, 2015 Will Johnson

Scalable Multi-Core Model Checking Alfons Laarman ( alfons@laarman.com ), Theory joint work with

Transforming Graphical System Models to Graphical Attack Models ! Joint work with Marieta

CTL Chapter 6 Part 2 Overview Review CTL Model Checking CTL model Checking algorithms

Model-checking in systems biology - From Micro to Macro 1 / 62 00001 - 00:00:01 Model-checking

Mechanized Metatheory Model-Checking WMM 2006 James Cheney 9/21/06 Mechanized Metatheory

Faculty-Administrator Collaboration Team(FACT) FDP Meeting Jan 2020 Informing the Future of

Building Concurrency Primitives CS 450 : Operating Systems Michael Lee &lt;lee@iit.edu&gt;

Xen Crash Dumps on Solaris Nils Nieuwejaar Senior Staff Engineer Solaris Kernel Technology

First Lab Something: promoting proper laboratory learning through intelligent course

Knit: Component Composition for Systems Software Alastair Reid, Matthew Flatt, Leigh Stoller,

DONT PANIC HOW TO ENJOY &amp; SURVIVE PHD DR. PAULI MIETTINEN, MPI INFORMATICS TEKSTI

14:332:231 DIGITAL LOGIC DESIGN Ivan Marsic, Rutgers University Electrical &amp; Computer

and Meson Production at JLab/CLAS Hyon-Suk Jo for the CLAS collaboration IPN Orsay PANIC 2011

LINUX VULNERABILITIES, WINDOWS EXPLOITS Escalating Privileges with WSL Saar Amar Recon brx 2018

Building Concurrency Primitives CS 450 : Operating Systems Michael Lee <lee@iit.edu>

DONT PANIC HOW TO ENJOY & SURVIVE PHD DR. PAULI MIETTINEN, MPI INFORMATICS TEKSTI

14:332:231 DIGITAL LOGIC DESIGN Ivan Marsic, Rutgers University Electrical & Computer