Fake Cures: User-centric Modeling of Health Misinformation in Social - PowerPoint PPT Presentation

Fake Cures: User-centric Modeling of Health Misinformation in Social Media 22 Oct 2018 The 21st ACM Conference on Computer-Supported Cooperative Work and Social Computing (CSCW) November 3rd-7th, 2018, New York City Amira Ghenai (Waterloo University), Yelena Mejova (ISI Foundation - Turin, Italy)

Fake Cures: User-centric Modeling of Health Misinformation in Social Media PAGE 2 Amira Ghenai

Topic: “cancer cure” Fake Cures: User-centric Modeling of Health Misinformation in Social Media PAGE 3 Amira Ghenai

Topic: “cancer cure” They are all unproven treatments Fake Cures: User-centric Modeling of Health Misinformation in Social Media PAGE 5 Amira Ghenai

Topic: “cancer cure” They are all unproven treatments Cancer patients! Fake Cures: User-centric Modeling of Health Misinformation in Social Media PAGE 7 Amira Ghenai

Problem Statement § Social media use for health management is growing § 62% of internet users in U.S. use social networking sites for health related topics § Accountability, quality and confidentiality issues § Perfect medium for propagating possible medical misinformation § Serious threat to public health Fake Cures: User-centric Modeling of Health Misinformation in Social Media PAGE 8 Amira Ghenai

Proposed Solution “Fake cancer treatments” topic § Method: user modeling § Aim: determine characteristics of users propagating unverified “cures” of cancer on Twitter § Benefits: allow public health officials to § Detect potential sources of misinformation § Monitor social media communications § Identify current limitations and improve them § Detect new misinformation before it causes harm Fake Cures: User-centric Modeling of Health Misinformation in Social Media PAGE 9 Amira Ghenai

Data Collection Rumor/Control Relevance User Selection data collection Refinement Fake Cures: User-centric Modeling of Health Misinformation in Social Media PAGE 12 Amira Ghenai

Data Collection Rumor/Control Relevance User Selection data collection Refinement Control Group causes symptoms 1. awareness Preventions General cancer topics § Fake Cures: User-centric Modeling of Health Misinformation in Social Media PAGE 14 Amira Ghenai

Data Collection Rumor/Control Relevance User Selection data collection Refinement Control Group causes symptoms 1. awareness Preventions General cancer topics § We use Paul and Dredze [1] dataset § 144 million tweets related to health topics § Dataset time period between 01 August 2011 - 28 February 2013 § Cancer topic has 676,236 users who posted 969,259 tweets § [1] Michael J Paul and Mark Dredze. 2014. Discovering health topics in social media using topic models. PloS one 9,8 (2014), e103408. Fake Cures: User-centric Modeling of Health Misinformation in Social Media PAGE 15 Amira Ghenai

Data Collection Rumor/Control Relevance User Selection data collection Refinement Rumor Group 2. Fake Cures: User-centric Modeling of Health Misinformation in Social Media PAGE 16 Amira Ghenai

Data Collection Rumor/Control Relevance User Selection data collection Refinement Rumor Group 2. § 139 total unproven cancer treatments from 3 different sources § Validated by trained oncologist Fake Cures: User-centric Modeling of Health Misinformation in Social Media PAGE 17 Amira Ghenai

Data Collection Rumor/Control Relevance User Selection data collection Refinement Rumor Group 2. § 139 total unproven cancer treatments from 3 different sources § Validated by trained oncologist § Collect tweets about treatments: § Same time period as control group § Hand craft query & query expansion § 39,675 users with 215,109 tweets Fake Cures: User-centric Modeling of Health Misinformation in Social Media PAGE 18 Amira Ghenai

Data Collection Rumor/Control Relevance User Selection data collection Refinement Topic* Expanded Query Example Tweet Soursop (Soursop:OR:Graviola:OR:guyabano: “[...] University show that the OR:guanabana:OR:"Annona:muricat soursop fruit kills cancer cells a":OR:"Annona:crassiflora":OR:"Gua effectively, particularly prostate nabanus:muricatus":OR:"Annona:bo cancer cells, pancreas and lung .” nplandiana":OR:"Annona:cearensis": OR:"Annona:muricata"):AND:cancer Ginger ginger:AND:cancer “ Can ginger help cure ovarian cancer ? Since 2007, the University of [...] has been studying GINGER ... <url>” Antineoplaston (antineoplaston:OR:burzynski):AND: “ RT Dr. Burzynski He has the cancer cure for cancer , the FDA want to shut him down <url>” * The topics (along with the keyword queries) are available at https://tinyurl.com/y78mkg6s Fake Cures: User-centric Modeling of Health Misinformation in Social Media PAGE 19 Amira Ghenai

Data Collection Rumor/Control Relevance User Selection data collection Refinement Rumor Control 215,109 tweets 969,259 tweets 39,675 users 676,236 users Humanizr [2] 39,514 users 675,621 users [2] James McCorriston, David Jurgens, and Derek Ruths. 2015. Organizations Are Users Too: Characterizing and Detecting the Presence of Organizations on Twitter. In ICWSM. 650–653. Fake Cures: User-centric Modeling of Health Misinformation in Social Media PAGE 22 Amira Ghenai

Data Collection Rumor/Control Relevance User Selection data collection Refinement Rumor Control 215,109 tweets 969,259 tweets 39,675 users 676,236 users Humanizr [2] 39,514 users 675,621 users Name Lexicon 24,441 users 469,494 users [2] James McCorriston, David Jurgens, and Derek Ruths. 2015. Organizations Are Users Too: Characterizing and Detecting the Presence of Organizations on Twitter. In ICWSM. 650–653. Fake Cures: User-centric Modeling of Health Misinformation in Social Media PAGE 23 Amira Ghenai

Data Collection Rumor/Control Relevance User Selection data collection Refinement Rumor Control 215,109 tweets 969,259 tweets 39,675 users 676,236 users Humanizr [2] 39,514 users 675,621 users Name Lexicon 24,441 users 469,494 users Tweet Rate Filter 17,978 users 324,590 users [2] James McCorriston, David Jurgens, and Derek Ruths. 2015. Organizations Are Users Too: Characterizing and Detecting the Presence of Organizations on Twitter. In ICWSM. 650–653. Fake Cures: User-centric Modeling of Health Misinformation in Social Media PAGE 24 Amira Ghenai

Data Collection Rumor/Control Relevance User Selection data collection Refinement § We check whether every tweet is relevant to the topic of interest, we define users as follows: § Rumor group - users who claim a cure is helpful for treating cancer and not users who talk about other topics such as prevention or debunking § Control group - users who post at least once about cancer symptoms, awareness, prevention, cause or personal experience etc. but not about a cancer cure § To make our users follow these definitions, we use: § Crowdsourcing & Classification – machine learning Fake Cures: User-centric Modeling of Health Misinformation in Social Media PAGE 25 Amira Ghenai

Data Collection Rumor/Control Relevance User Selection data collection Refinement Crowdsourcing 1. Sample the tweets (4,000 tweets from rumor and control groups) a) Label the sampled tweets: b) Rumor group - whether the tweet is about: Control group - whether the tweet is about: i. cancer treatment helps with i. cancer , and has personal (or treating cancer friend/family) experience ii. cancer treatment does not help ii. about cancer treatment with treating cancer (debunks the iii. other cancer-related information claim) (symptoms, awareness, prevention, iii. cancer treatment prevents cancer causes, etc.) iv. No potential cancer remedy iv. No information about cancer (Note: participants did not access the veracity of the tweets!) 184 CrowdFlower annotators contributed to the task c) A minimum of three labels collected per tweet d) Fake Cures: User-centric Modeling of Health Misinformation in Social Media PAGE 26 Amira Ghenai

Data Collection Rumor/Control Relevance User Selection data collection Refinement Classification 2. § We train several classifiers on the labeled tweets using 1,2,3-grams as features § We train the classifiers on the labeled tweets, which we then apply to the rest to characterize each user’s behavior § For every label in every group, we build a binary logistic regression classifier Ø Example: from the crowdsourcing task of rumor group: 2,564 were cancer cure tweets and 1,587 were not. We build the classifier and apply it to the rest of (non-labeled) rumor tweets which results in 12,685 tweets about cancer cure and 7,872 not § 7,221 rumor user and 433,883 control users Fake Cures: User-centric Modeling of Health Misinformation in Social Media PAGE 27 Amira Ghenai

Fake Cures: User-centric Modeling of Health Misinformation in Social - PowerPoint PPT Presentation

Fake Cures: User-centric Modeling of Health Misinformation in Social Media 22 Oct 2018 The 21st ACM Conference on Computer-Supported Cooperative Work and Social Computing (CSCW) November 3rd-7th, 2018, New York City Amira Ghenai (Waterloo

So what is Fake News Fake news is a type of hoax or deliberate spread of misinformation: News

Overview of the 21 st Century CURES Act The CURES Act is designed to improve the quality of

You Wont Believe It: Exploring the Advertising Ecosystem of Fake News Websites Catherine Han

Fake News Do Now: How would you define fake news? Why is fake news so common?

Section 12006 of the 21 st Century CURES Act Section 12006 of the 21 st Century CURES Act

NCATS Cures Acceleration Network Review Board UPDATE JANUARY 14, 2016 Cures Acceleration Network

Quantum Cryptography 1. Fake Quantum Theory. 2. Simple Quantum Protocols. 3. More Fake Quantum

11-830 Computational Ethics for NLP Lecture 13: Fake News and Influencing Elections Fake News

Health Misinformation in Search and Social Media Amira Ghenai University of Waterloo Digital

FAKE Messages CA D K Tejwani The Chamber of Tax Consultants | April 07, 2020 2 Misinformation

Summary User-centric Social Social Multimedia Multimedia Computing From Users: user-perceptive

The spread of misinformation in social media Filippo Menczer Center for Complex Networks and

Misinformation as a Source of NIH Collaboratory Complication for Clinical Trials Grand Rounds

Journalism and Misinformation Supply, Demand, Scale Dan Gillmor Situation Too much

Studying Misinformation effect on the Episodic and Semantic memory INSTRUCTOR PROF. AMITABH

Improved Practical Efficiency for Misinformation Prevention in Social Networks Michael Simpson

Principal Component Analysis Proseminar Data Mining Tobias Holl 1 1 Technische Universitt

Transhumanism & Cyberculture CIU211 - Monica M - 1005213 Key Terms: transhumanism,

Coronavirus and The Economic Value of Human Life (Or - Is the lockdown worth it?) Julian Jessop

Poverty, democratic governance and poverty reduction strategies Paul Spicker Robert Gordon

1 BELLWORK9/16/13 Introduction to Progressivism Imagine: The year is 1900. You have always

1 7 th Grade PSI Structure and Function & Information Processing 20151107 www.njctl.org

Microcompartments, Nanoparticles and the BioBattery Alix Blackshaw, Lisza Bruder, Mackenzie

Scientific Reasoning Rubric Scientific Reasoning Rubric Revision Process at MC3 Revision Process

Sambuz

Useful Links

Newsletter

Mail Us

Fake Cures: User-centric Modeling of Health Misinformation in Social - PowerPoint PPT Presentation

Fake Cures: User-centric Modeling of Health Misinformation in Social Media 22 Oct 2018 The 21st ACM Conference on Computer-Supported Cooperative Work and Social Computing (CSCW) November 3rd-7th, 2018, New York City Amira Ghenai (Waterloo

So what is Fake News Fake news is a type of hoax or deliberate spread of misinformation: News

Overview of the 21 st Century CURES Act The CURES Act is designed to improve the quality of

You Wont Believe It: Exploring the Advertising Ecosystem of Fake News Websites Catherine Han

Fake News Do Now: How would you define fake news? Why is fake news so common?

Section 12006 of the 21 st Century CURES Act Section 12006 of the 21 st Century CURES Act

NCATS Cures Acceleration Network Review Board UPDATE JANUARY 14, 2016 Cures Acceleration Network

Quantum Cryptography 1. Fake Quantum Theory. 2. Simple Quantum Protocols. 3. More Fake Quantum

11-830 Computational Ethics for NLP Lecture 13: Fake News and Influencing Elections Fake News

Health Misinformation in Search and Social Media Amira Ghenai University of Waterloo Digital

FAKE Messages CA D K Tejwani The Chamber of Tax Consultants | April 07, 2020 2 Misinformation

Summary User-centric Social Social Multimedia Multimedia Computing From Users: user-perceptive

The spread of misinformation in social media Filippo Menczer Center for Complex Networks and

Misinformation as a Source of NIH Collaboratory Complication for Clinical Trials Grand Rounds

Journalism and Misinformation Supply, Demand, Scale Dan Gillmor Situation Too much

Studying Misinformation effect on the Episodic and Semantic memory INSTRUCTOR PROF. AMITABH

Improved Practical Efficiency for Misinformation Prevention in Social Networks Michael Simpson

Principal Component Analysis Proseminar Data Mining Tobias Holl 1 1 Technische Universitt

Transhumanism &amp; Cyberculture CIU211 - Monica M - 1005213 Key Terms: transhumanism,

Coronavirus and The Economic Value of Human Life (Or - Is the lockdown worth it?) Julian Jessop

Poverty, democratic governance and poverty reduction strategies Paul Spicker Robert Gordon

1 BELLWORK9/16/13 Introduction to Progressivism Imagine: The year is 1900. You have always

1 7 th Grade PSI Structure and Function &amp; Information Processing 20151107 www.njctl.org

Microcompartments, Nanoparticles and the BioBattery Alix Blackshaw, Lisza Bruder, Mackenzie

Scientific Reasoning Rubric Scientific Reasoning Rubric Revision Process at MC3 Revision Process

Sambuz

Useful Links

Newsletter

Mail Us

Transhumanism & Cyberculture CIU211 - Monica M - 1005213 Key Terms: transhumanism,

1 7 th Grade PSI Structure and Function & Information Processing 20151107 www.njctl.org