Fools’ Gold:
Understanding the Linguistic Features of Deception and Humour Through April Fools’ Hoaxes
e.dearden@lancaster.ac.uk
Ed Dearden
Fools Gold: Understanding the Linguistic Features of Deception and - - PowerPoint PPT Presentation
Fools Gold: Understanding the Linguistic Features of Deception and Humour Through April Fools Hoaxes Ed Dearden e.dearden@lancaster.ac.uk Hell Planet Why do we care about April Fools? False Information But where does April
Understanding the Linguistic Features of Deception and Humour Through April Fools’ Hoaxes
e.dearden@lancaster.ac.uk
Ed Dearden
Is the author trying to deceive me?
Not Deceive? Deceive?
Vagueness Imagination Humour Deception Details Formality Complexity
Vagueness CLAWS Ambiguity Superlatives Exaggeration Comparative Adverbs Degree Adverbs Vague Degree USAS Ambiguity Wordnet Ambiguity
Details Time Related Spatial Terms Dates Numbers Proper Nouns Sense Terms Motion Terms
Imagination Imaginative Conjunctions Articles Adjectives Imaginative Determiners Prepositions Informative Verbs Imaginative Verbs
Deception Negative Emotional Terms Negations First Person Pronouns
Humour Positive Emotion Body Contextual Imbalance Profanity Alliteration Relationships Head Contextual Imbalance
Formality Associated Press Title Guidelines Spelling Errors Associated Press Date Guidelines Associated Press Number Guidelines
Complexity Average Sentence Length Lexical Diversity Lexical Density Function Words Readability Body Punctuation Head Punctuation
Corpus Create feature matrix Feature Selection Classification Analysis
Feature 1 … Feature N Class 0.111 … 0.552 AF … … … … 0.444 … 0.654 NAF
Which features are most informative? Can we learn to automatically differentiate? What do the results mean?
Compl plexity ty
lity
l Diversity
Details ls
lated Term rms
Im Imagination
Decepti tion
n Prono nouns uns
Formali lity ty
Vag agueness
rbs
Feature 1 … Feature N Class 0.111 … 0.552 AF … … … … 0.444 … 0.654 NAF Artjcle Predictjon Truth 1 AF AF 2 NAF AF … … … n-1 NAF NAF n AF NAF
Classification Accuracies for all Feature Sets
Hoax Set: 74% Bag-of-Words: 80% Complexity: 71% + Detail
1. One classifier trained on Fake News. 2. Second Classifier trained on April Fools’ and tested on Fake News.
Classification Accuracies for Fake News
Hoax Set: 76.9% Bag-of-Words: 77.7% Complexity: 78.1% + Detail
Classification Accuracies for Fake News
Hoax Set: 64.5% Bag-of-Words: 49.4% Complexity: 65.7% + Detail Complexity: 75.7%
Created a a ne new c corp rpus us of April F ril Fools ls’ h hoax axes.
detection to classify hoaxes with moderate success.
seem to be the most important.
Created a new corpus of April Fools’ hoaxes. Us
Used f d features f from d m deceptio tion, humo umour, an r, and ir d irony ny de detectio tion t n to c clas assif ify h hoax axes w with ith mo mode derate s suc uccess.
Showed that features relating to complexity and detail
seem to be the most important.
detection to classify hoaxes with moderate success.
Showed d that f t featu atures r rela lating ting t to compl mplexit ity an and d d detail ail seem t m to be be the mo most t impo important. ant.
und th that s at simil imilar ar featur atures ar are us useful in l in ide identif tifyin ying Apr April il Fools ls’ an ’ and d Fake Ne News.
for both AF Hoaxes and Fake News.
Fools’ and Fake News.
Some o
these featur tures ma manif nifest th t thems mselv lves s simil imilarly arly for bo both th AF H AF Hoax axes and and F Fak ake N News.
Thanks for listening!