Automatically identifying changes in the semantic orientation of - PowerPoint PPT Presentation

Automatically identifying changes in the semantic orientation of words Paul Cook and Suzanne Stevenson University of Toronto

Amelioration and pejoration ● Changes in a word's meaning to have a more positive or negative evaluation ● Historical examples – Amelioration: Urbane – Pejoration: Hussy ● Contemporary examples – Amelioration: Pimp – Pejoration: Gay 2

Challenges ● Natural language processing – Many systems for sentiment analysis require appropriate and up-to-date polarity lexicons ● Lexicography – Identify new word senses and changes in established senses to keep dictionaries current 3

Inferring semantic orientation ● Semantic orientation from association with known positive and negative words – T urney and Littman's (2003) SO-PMI ● A difference in polarity between corpora of differing time periods indicates amelioration or pejoration 4

General Inquirer Dictionary ● Lexicon intended for text analysis – Some entries mark positive or negative outlook ● Seed words: All words labelled positive or negative (but not both) ● 1621 positive seeds, 1989 negative seeds – T urney and Littman: 7 positive seeds, 7 negative seeds 5

Corpora ● Three corpora of British English from differing time periods. Corpus Size Time period (millions of words) Lampeter 1 1640-1740 CLMETEV 15 1710-1920 BNC 100 Late 20 th c. 6

Inferring polarity ● Verify that our method for inferring polarity works well on small corpora ● Leave-one-out experiment – Classify each seed word with frequency greater than 5 using all others as seeds – Performance metric: Accuracy over all words, and only words with calculated polarity in top 25% 7

Inferring polarity: Results Corpus Accuracy: Accuracy: All top-25% Lampeter 75 88 CLMETEV 80 92 BNC 82 94 ● Most frequent class baseline: 55% 8

Historical data ● Small dataset of ameliorations and pejorations – T aken from texts on semantic change, dictionaries, and Shakespearean plays – Underwent change in (roughly) 18 th c. – 6 ameliorations, 2 pejorations ● Compare calculated change in polarity (Lampeter to CLMETEV) to change indicated by resources 9

Historical data: Results Expression Change identified Calculated from resources change in polarity ambition amelioration 0.52 eager amelioration 0.97 fond amelioration 0.07 luxury amelioration 1.49 nice amelioration 2.84 succeed amelioration -0.75 artful pejoration -1.71 plainness pejoration -0.61 10

Artificial data ● Suppose good in one corpus and bad in another were in fact the same word – Similar to WSD evaluations using artificial words – Requires choosing pairs of words ● Instead compare average polarity of all positive words in one corpus to that of all negative words in another 11

Artificial data: Results Polarity in lexicon Average polarity in corpus Lampeter CLMETEV BNC Positive 0.58 0.50 0.40 Negative -0.74 -0.67 -0.76 12

Hunting new senses ● Hypothesis: Words with largest change in polarity between two corpora have undergone amelioration or pejoration ● Identify candidate ameliorations and pejorations – 10 largest increases/decreases in polarity from CLMETEV to BNC 13

Usage extraction ● For each candidate extract 10 random usages (or as many as are available) from each corpus – Extract the sentence containing each usage ● Randomly pair each usage from CLMETEV with a usage from BNC 14

Usage annotation ● Use Amazon Mechanical T urk to obtain judgements ● Present turkers with pairs of usages ● T urkers judge which usage is more positive/negative (or if usages are equally positive) ● 10 independent judgements per pair 15

Hunting new senses: Results Candidate type Proportion of judgements for corpus of more positive usage CLMETEV BNC Neither (earlier) (later) Ameliorations 0.28 0.34 0.37 Pejorations 0.36 0.27 0.36 16

Noisy seed words ● Seed words may undergo amelioration and pejoration! ● Randomly change polarity of n% of positive and negative seeds – E.g., good is negative, bad is positive ● Repeat experiment on inferring synchronic polarity 17

Noisy seed words: Results 18

Conclusions ● First computational study focusing on amelioration and pejoration – Encouraging results identifying historical and artificial ameliorations and pejorations ● Future work: – More extensive evaluation – Methods for identifying semantic change and dialectal variation in word usage 19

Thank you ● We thank the following organizations for financially supporting this research – The Natural Sciences and Engineering Research Council of Canada – The University of T oronto – The Dictionary Society of North America 20

Automatically identifying changes in the semantic orientation of - PowerPoint PPT Presentation

Automatically identifying changes in the semantic orientation of words Paul Cook and Suzanne Stevenson University of Toronto Amelioration and pejoration Changes in a word's meaning to have a more positive or negative evaluation

Automatically Identifying Automatically Identifying and Georeferencing Georeferencing and

Semantic Taxonomies Semantic Class Learning from the Web Long-term goal: automatically create

Semantic change : a words meaning changes independently of its form Evidence for semantic

Creating Semantic Mashups: Bridging Web 2.0 and the Semantic Web Jamie Taylor, Colin Evans, Toby

: on the Semantic Web : on the Semantic Web Building a Semantic Prototype for Danish Building a

Semantic Processing Augmenting CFGs Currying Quantifier scope Semantic Grammars L445 / L545

Align, Disambiguate, and Walk A Unified Approach for Measuring Semantic Similarity Semantic

Semantic Similarity MultiJEDI ERC 259234 Semantic Similarity Semantic Similarity Mostly

Module 13 Introduction to Semantic Technology, Ontologies and the Semantic Web Module 13 Outline

Automatically Documenting Program Changes Ray Buse Wes Weimer De ltaDoc 13 May 2011 diffs

Automatically Constructing Semantic Web Services from Online Sources Craig A. Knoblock Jos

Identifying, Finding and Encoding Semantic Relations Christiane Fellbaum Princeton University

One Page Everywhere Fluid, Responsive Design with Semantic.gs The Semantic Grid System Grid

RDF, RDFS and OWL: Graph Data Models for the Semantic Web Semantic Web: The Idea Semantic

Motivation Bootstrapping Semantic Lexicons A semantic lexicon contains semantic category Ex:

Semantic Web 2008 Se a t c eb 008 Semantic Web ca. 2008 S ti W b 2008 Semantic Web

On the Nature and Amelioration of Gender Dysphoria Alley Stoughton Kansas State University

CEE 370 Environmental Engineering Principles Lecture #27 Water Treatment I: Introduction,

CEE 370 Environmental Engineering Principles Lecture #27 Water Treatment I: Introduction,

MQTT Protocol for Real Time GNSS Data and Correction Distribution Precise Positioning Precise

Range Expressions Ameliorate Depictive Sincerity Violations Elizabeth Coppock Assistant

1 Mutation pressure Let = the mutation rate from A a Let = the mutation rate from a

DIOGENE A Plant Breeding Software Users Students (Master, Thesis) Confirmed

CLE - Release during COVID July 22, 2020 SIFI | Louisiana and Georgia Louisiana | ICE detention