Unsupervised Methods for NLP WSD Samuel Brody Department of - PowerPoint PPT Presentation

Unsupervised Methods for NLP WSD Samuel Brody Department of Biomedical Informatics Columbia University samuel.brody@dbmi.columbia.edu October 8, 2009 Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 1 / 61

Outline 1 Introduction - Unsupervised NLP The Competition - Supervised Methods Colleagues - Human Knowledge Unsupervised Learning 2 Word Sense Disambiguation (WSD) Unsupervised Labeling Bayesian Sense Induction 3 Work in Progress - Aspect & Sentiment in Reviews 4 Conclusion Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 2 / 61

The Competition - Supervised Machine Learning Supervised methods are used for many NLP tasks (parsing, relation extraction, WSD) Why? + high accuracy with sufficient annotation + full collection of powerful and easy-to-use tools (e.g., SVM, kNN, Maximum Entropy) Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 5 / 61

The Competition - Supervised Machine Learning Why not? – annotation is expensive – doesn’t transfer well between domains and tasks – is it a good model for human learning? do humans perform singular-value decomposition? discriminative rather than generative concepts come from the annotation rather than the data Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 6 / 61

Colleagues - Knowledge Bases Many “unsupervised" approaches make use of manually compiled knowledge bases. Dictionaries Thesauri FrameNet PropBank Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 8 / 61

The Problem with Knowledge WordNet senses for bank : river bank ... 1 financial institution bank building 2 9 10 a flight maneuver bank of earth 3 – lack of coverage – no domain/task specificity – over representation of marginal cases – based on a specific theory Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 9 / 61

Colleagues - Scientific Theory Linguistic Theory Psychology Neurology Formal Logic Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 10 / 61

Ignorance = Bliss? “Whenever I fire a linguist our system performance improves” - Fred Jelinek Why? ( see “Some Of My Best Friends Are Linguists" - Fred Jelinek) strict models do not allow for “grey” areas attempts to cover rare cases leads to excessive complexity models do not scale to practical cases Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 11 / 61

Unsupervised Learning Unsupervised techniques offer many tools and insights: EM - classification / generalization Automatic Alignment - corpus statistics - information theory Bayesian Models, LDA - probabilistic view - minimal assumptions Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 13 / 61

Competition & Colleagues We can still benefit from: insights and tools from supervised learning careful use of knowledge bases aspects of scientific theory Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 14 / 61

Good Senses Make Good Neighbors: Exploiting Distributional Similarity for Unsupervised WSD Brody and Lapata (2008) Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 17 / 61

Motivation Supervised WSD Most accurate WSD systems to date are supervised. Rely on sense-labeled training data to train standard classifiers. – Acquiring sufficient labeled data is very expensive. – Limits the use in new domains and languages. – Makes supervised WSD unfeasible for many applications. Unsupervised WSD + Independent of labeled data. + Most promising solution for large-scale use. – Much less accurate than supervised methods. Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 18 / 61

Solution The Idea: Automatic Labeling go directly to the data replace manual annotation retain use of supervised classifiers Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 19 / 61

Prev. Approach - Linguistic Knowledge Synonyms from a Lexical Resource (Leacock et al., 1998; Mihalcea, 2002) Obtain synonymous / related words for each sense. Search a large corpus / web for the synonyms. Find good sense indicators from the retrieved contexts. Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 20 / 61

Example WordNet senses for the word “sense” : A general conscious awareness . 1 (e.g., a sense of security ) The meaning of a word. 2 (e.g., The dictionary gave several senses for the word ) Sound practical judgment . 3 (e.g., I can’t see the sense in doing it now ) A natural appreciation or ability . 4 (e.g., keen musical sense ). Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 21 / 61

Using WordNet Semantic Neighbors from WordNet Neighbors of awareness : sentience , sensation, sensitivity, sensitiveness, sensibility, modality, module, knowingness, ... Neighbors of meaning : signified , acceptation, signification, significance, meaning, import, symbolization, symbolisation,... Neighbors of judgment : gumption , logic, sagacity, judgment, judgement, discernment, prudence, judiciousness, eye, ... Neighbors of ability : hold, grasp, appreciation few exact synonyms many related words neighbors are not “substitutable” neighbors are themselves polysemous Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 22 / 61

Neighbor Polysemy Monosemous Semantic Neighbors Neighbors of awareness : cognisance, self-awareness Neighbors of meaning : signified, signification, nuance, moral, intention greatly reduced number of neighbors no monosemous neighbors for last two senses neighbors may be rare Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 23 / 61

Our Approach Distributional Neighbors Extension of McCarthy et al. (2004). Based on distributional similarity - words are related if used in similar contexts. Uses semantic similarity to associate neighbors with senses. Method Advantages + relates directly to context cues + domain specific + polysemy restricted by similarity Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 24 / 61

Using Statistics Distributional Neighbors Neighbors of awareness : awareness, feeling, instinct, enthusiasm, sensation, vision, tradition, consciousness, anger, panic, loyalty Neighbors of meaning : emotion, belief, meaning, manner, necessity, tension, motivation No neighbors for last two senses. Not prevalent in the corpus (corroborated by the test data). Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 25 / 61

Associating Neighbors and Senses Neighbors from a lexical resource are already associated. Distributional neighbors are not. Use semantic similarity on the knowledge base. (WordNet::Similarity – Pedersen et al. 2004) Choose target sense most similar to any sense of the neighbor. Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 26 / 61

Methodology Acquire “neighbors” - words related to (a sense of) the target 1 Extract instances of neighbors from a large corpus 2 Label instances with associated sense 3 Use labeled data to train supervised classifier 4 “... an attempt to state the meaning of a word” becomes “... an attempt to state the sense (s#2) of a word.” Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 27 / 61

Unsupervised Methods for NLP WSD Samuel Brody Department of - PowerPoint PPT Presentation

Unsupervised Methods for NLP WSD Samuel Brody Department of Biomedical Informatics Columbia University samuel.brody@dbmi.columbia.edu October 8, 2009 Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 1 / 61 Outline 1

Word Sense Disambiguation Unsupervised WSD Modern WSD L645 / B659 (Some material from Jurafsky

From From IR WSD IR WSD to to IR WSD IR WSD Julio Gonzalo Julio Gonzalo

Word Sense Disambiguation (WSD) Based on Foundations of Statistical NLP by C. Manning &

UNSUPERVISED LEARNING, CLUSTERING UNSUPERVISED LEARNING UNSUPERVISED LEARNING Supervised

SI485i : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly

SI425 : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly

Unsupervised Language Learning: Representation Learning for NLP Katia Shutova ILLC University

Unsupervised Learning and Clustering l In unsupervised learning you are given a data set with no

4CSLL5 Parameter Estimation (Supervised and Unsupervised) Unsupervised Maximum Likelihood

NLP: Two pictures Wordnet and Word Sense Problem NLP Disambiguation Semantics NLP Trinity

Recurrent Neural Networks Graham Neubig Site https://phontron.com/class/nn4nlp2017/ NLP and

Introduction to PCA Unsupervised Learning in R Unsupervised learning Two methods of

Using UMLS CUIs for WSD in the Biomedical Domain Bridget T. McInnes Ted Pedersen and John

Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources Overview Jan 11,

Machine Learning for NLP Unsupervised Learning Aurlie Herbelot 2019 Centre for Mind/Brain

Lexical Semantics & WSD Ling571 Deep Processing Techniques for NLP February 15, 2017

CS474 Natural Language Processing Inductive ML framework Last class Examples of task

Grandparents Raising Grandchildren A Grass Roots Project 1 The Steering Committee Sam Burnett

PEBS E-cal Work Plan and Status PEBS meeting 27-28 January 2009 Tatsuya Nakada LPHE/EPFL

Liquid S Liquid Scintillator for CANDLE cintillator for CANDLE cintillator for CANDLE S System

Presenter: Omar Salman Manzoor Word Sense Disambiguation refers to the task of identifying

Petroleum & Bunker SURVEY Its Shipping & Survey ABOUT COMPANY Our Company CISS

WE WELLS LLS AND AS AND ASR WELL R WELLS S PR PROCUR CUREME EMENT NTS Pre re-sub

BIB-R : a Benchmark for the Interpretation of Bibliographic Records Joffrey Decourselle, Fabien

Unsupervised Methods for NLP WSD Samuel Brody Department of - PowerPoint PPT Presentation

Unsupervised Methods for NLP WSD Samuel Brody Department of Biomedical Informatics Columbia University samuel.brody@dbmi.columbia.edu October 8, 2009 Sam Brody Unsupervised Methods for NLP WSD October 8, 2009 1 / 61 Outline 1

Word Sense Disambiguation Unsupervised WSD Modern WSD L645 / B659 (Some material from Jurafsky

From From IR WSD IR WSD to to IR WSD IR WSD Julio Gonzalo Julio Gonzalo

Word Sense Disambiguation (WSD) Based on Foundations of Statistical NLP by C. Manning &amp;

UNSUPERVISED LEARNING, CLUSTERING UNSUPERVISED LEARNING UNSUPERVISED LEARNING Supervised

SI485i : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly

SI425 : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly

Unsupervised Language Learning: Representation Learning for NLP Katia Shutova ILLC University

Unsupervised Learning and Clustering l In unsupervised learning you are given a data set with no

4CSLL5 Parameter Estimation (Supervised and Unsupervised) Unsupervised Maximum Likelihood

NLP: Two pictures Wordnet and Word Sense Problem NLP Disambiguation Semantics NLP Trinity

Recurrent Neural Networks Graham Neubig Site https://phontron.com/class/nn4nlp2017/ NLP and

Introduction to PCA Unsupervised Learning in R Unsupervised learning Two methods of

Using UMLS CUIs for WSD in the Biomedical Domain Bridget T. McInnes Ted Pedersen and John

Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources Overview Jan 11,

Machine Learning for NLP Unsupervised Learning Aurlie Herbelot 2019 Centre for Mind/Brain

Lexical Semantics &amp; WSD Ling571 Deep Processing Techniques for NLP February 15, 2017

CS474 Natural Language Processing Inductive ML framework Last class Examples of task

Grandparents Raising Grandchildren A Grass Roots Project 1 The Steering Committee Sam Burnett

PEBS E-cal Work Plan and Status PEBS meeting 27-28 January 2009 Tatsuya Nakada LPHE/EPFL

Liquid S Liquid Scintillator for CANDLE cintillator for CANDLE cintillator for CANDLE S System

Presenter: Omar Salman Manzoor Word Sense Disambiguation refers to the task of identifying

Petroleum &amp; Bunker SURVEY Its Shipping &amp; Survey ABOUT COMPANY Our Company CISS

WE WELLS LLS AND AS AND ASR WELL R WELLS S PR PROCUR CUREME EMENT NTS Pre re-sub

BIB-R : a Benchmark for the Interpretation of Bibliographic Records Joffrey Decourselle, Fabien

Word Sense Disambiguation (WSD) Based on Foundations of Statistical NLP by C. Manning &

Lexical Semantics & WSD Ling571 Deep Processing Techniques for NLP February 15, 2017

Petroleum & Bunker SURVEY Its Shipping & Survey ABOUT COMPANY Our Company CISS