Topic Models for Word Sense Disambiguation and Token-based Idiom - PowerPoint PPT Presentation

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion Topic Models for Word Sense Disambiguation and Token-based Idiom Detection Linlin Li, Benjamin Roth and Caroline Sporleder Cluster of Excellence, MMCI Saarland University, Germany ACL 2010

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion What is Sense Disambiguation? Words

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion What is Sense Disambiguation? Words bank?

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion What is Sense Disambiguation? Phrases

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion What is Sense Disambiguation? Phrases spill the beans?

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion Overview context( c ) Target? SDM

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion Overview context( c ) sense paraphrase 1 sense paraphrase 2 Target? sense paraphrase i sense paraphrase n SDM

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion Overview context( c ) Target? p(s|c) sense paraphrase i SDM

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion A Topic Model PLSA (Hofmann, 1999) � p ( w | d ) = p ( z | d ) p ( w | z ) z A generative model, decompose the conditional probability word-document distribution p(w|d) into a word-topic distribution p(w|z) and a topic-document distribution p(z|d) Each semantic topic z is represented as a distribution over words p ( w | z ) Each document d is represented as a distribution over semantic topics p ( z | d ) Bayesian version, LDA (Blei et al., 2003) Gibbs Sampling (Griffiths and Steyvers, 2004)

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion The Sense Disambiguation Model Latent Topics for Sense Disambiguation Basic Idea Find the sense which maximizes the conditional probability of senses given a context s = arg max p ( s i | c ) s i This conditional probability is decomposed by incorporating a hidden variable z

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion The Sense Disambiguation Model Latent Topics for Sense Disambiguation Basic Idea Find the sense which maximizes the conditional probability of senses given a context s = arg max p ( s i | c ) s i This conditional probability is decomposed by incorporating a hidden variable z More about the sense disambiguation model... A sense ( s i ) is represented as a sense paraphrase that captures (some aspect of) the meaning of the sense.

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion The Sense Disambiguation Model Latent Topics for Sense Disambiguation Basic Idea Find the sense which maximizes the conditional probability of senses given a context s = arg max p ( s i | c ) s i This conditional probability is decomposed by incorporating a hidden variable z More about the sense disambiguation model... A sense ( s i ) is represented as a sense paraphrase that captures (some aspect of) the meaning of the sense. These paraphrases can be taken from existing resource such as WordNet (WSD tasks) or supplied by users (idiom task)

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion The Sense Disambiguation Model Latent Topics for Sense Disambiguation Basic Idea Find the sense which maximizes the conditional probability of senses given a context s = arg max p ( s i | c ) s i This conditional probability is decomposed by incorporating a hidden variable z More about the sense disambiguation model... A sense ( s i ) is represented as a sense paraphrase that captures (some aspect of) the meaning of the sense. These paraphrases can be taken from existing resource such as WordNet (WSD tasks) or supplied by users (idiom task) We proposed three models of how to incorporate the topic hidden variable

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion The Sense Disambiguation Model Model I Contexts and senses paraphrases are both treated as documents s = arg max p ( ds i | dc ) ds i

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion The Sense Disambiguation Model Model I Contexts and senses paraphrases are both treated as documents s = arg max p ( ds i | dc ) ds i Assume ds is conditionally independent of dc , given z � p ( ds | dc ) = p ( z | dc ) p ( ds | z ) z

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion The Sense Disambiguation Model Model I Contexts and senses paraphrases are both treated as documents s = arg max p ( ds i | dc ) ds i Assume ds is conditionally independent of dc , given z � p ( ds | dc ) = p ( z | dc ) p ( ds | z ) z No direct estimation of p ( ds | z ) p ( z | dc ) p ( z | ds ) � p ( ds | dc ) = p ( ds ) p ( z ) z

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion The Sense Disambiguation Model Model I Use prior sense information p ( s ) to approximate p ( ds ) p ( z | dc ) p ( z | ds ) � p ( ds | dc ) ≈ p ( s ) p ( z ) z The sense distribution in real corpus is often highly skewed (McCarthy, 2009) p ( s ) can be taken from existing resource (e.g., sense frequency given in WordNet) Assume topic distribution is uniform � p ( ds | dc ) ∝ p ( s ) p ( z | dc ) p ( z | ds ) z

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion The Sense Disambiguation Model Inference The test set and sense paraphrase set are relatively small. Estimate topics from a very large corpus (a Wikipedia dump), with broad thematic diversity and vocabulary coverage. Represent sense paraphrase documents and context documents by topics p ( z | dc ) , p ( z | ds ) .

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion The Sense Disambiguation Model Model II In case no prior sense information is available � p ( ds | dc ) ∝ p(s) p ( z | dc ) p ( z | ds ) z Vector-space model on inferred topic frequency statistics v ( z | d ) Maximizing the cosine value of two document vectors cos ( ds , dc ) arg max cos ( v ( z | dc ) , v ( z | ds i )) ds i

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion The Sense Disambiguation Model Model III Sometimes, a sense paraphrase is chracterized only by one typical, strongly connected word Consider sense paraphrase ds as a collection of conditionally independent words, given context documents � p ( ds | dc ) = p ( w i | dc ) w i ∈ ds Take the maximum instead of the product "rock the boat" → {"break the norm", "cause trouble"} p("break the norm, cause trouble"|dc), very strong requirement p("norm"|dc) OR p("trouble"|dc) ⇒ idiomatic sense Model III: � { max p ( w i | z ) p ( z | dc ) } arg max w i ∈ qs j qs j z

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion Data Coarse-grained WSD SemEval-2007 Task-07 benchmark dataset (Navigli et al., 2009) Sense categories were obtained by clustering senses from WordNet 2.1 sense inventory (Navigli, 2006) Fine-grained WSD SemEval-2007 Task-17 dataset (Pradhan et al., 2009) The sense inventory is from WordNet 2.1 Idiom Sense Disambiguation The idiom dataset (Sporleder and Li, 2009) 3964 instances of 17 potential English idiomatic expressions, manually annotated as literal or idiomatic

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion Sense Paraphrases WSD Tasks The word forms, glosses and example sentences of the sense synset the reference synsets (excluding hypernym) Idiom Task Paraphrases the nonliteral meaning from several online idiom dictionaries e.g., rock the boat → {"break the norm", "cause trouble"} For the literal sense, we use 2-3 manually selected words e.g., break the ice → {"ice", "water", "snow"}

Topic Models for Word Sense Disambiguation and Token-based Idiom - PowerPoint PPT Presentation

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion Topic Models for Word Sense Disambiguation and Token-based Idiom Detection Linlin Li, Benjamin Roth and Caroline Sporleder Cluster of Excellence, MMCI

Word Sense Word Sense Word Sense Disambiguation Disambiguation Disambiguation Presented by

Word Sense Disambiguation Word Sense Disambiguation (WSD) Given A

Word Meaning & Word Sense Disambiguation CMSC 723 / LING 723 / INST 725 M ARINE C ARPUAT

Word Sense Disambiguation WORD SENSE DISAMBIGUATION Homonymy and Polysemy As we have seen,

WSD Word Sense Disambiguation: Determine from context (or otherwise) what Word Sense

Virtual Student Orientation Information for Families SLIDESMANIA.COM TOPIC TOPIC TOPIC TOPIC

ConnectHome ConnectHome Topic 2 Topic 2 Nation Webinar Nation Webinar Topic 3 Topic 3 Topic

Semantics Avalanche: Word Sense Disambiguation, Dependency Parsing, Semantic Role Labeling/Verb

Final Projects Word Sense Disambiguation: A Unified Evaluation Framework and Empirical Comparison

Similarity-based Word Sense Disambiguation Yael Karov Shimon Edelman Weizmann Institute MIT

Word Sense Disambiguation Unsupervised WSD Modern WSD L645 / B659 (Some material from Jurafsky

Word Sense Disambiguation for Ontological Document Classification Speaker: Georgiana Ifrim

Semantics Avalanche: Word Sense Disambiguation, Dependency Parsing, Semantic Role

Natural Language Processing: Word Sense Disambiguation Roman Kern <rkern@tugraz.at>

Data-driven sense induction for disambiguation and lexical selection in translation Marianna

PIV Token Issuance PIV Token Issuance Ketan Mehta Mehta_Ketan@bah.com October 6, 2004 1

Combining Probabilistic and Translation- Based Models for Information Retrieval based on Word

Experiments on Active Learning for Croatian Word Sense Disambiguation c and Jan Domagoj

Online: Unit Testjng Michael Meeks <michael.meeks@collabora.com> mmeeks / irc.freenode.net

Lexical Semantics & WSD Ling571 Deep Processing Techniques for NLP February 24, 2016

INF4080 2020 FALL NATURAL LANGUAGE PROCESSING Jan Tore Lnning 2 (Mostly Text)

Computational Semantics and Pragmatics Autumn 2011 Raquel Fernndez Institute for Logic,

Identifying Generic Expressions Nils Reiter and Anette Frank Department of Computational

Word Senses Polysemy: many meanings The book uses aspect in these senses Informal

Topic Models for Word Sense Disambiguation and Token-based Idiom - PowerPoint PPT Presentation

Introduction The Sense Disambiguation Model Experimental Setup Experiments Conclusion Topic Models for Word Sense Disambiguation and Token-based Idiom Detection Linlin Li, Benjamin Roth and Caroline Sporleder Cluster of Excellence, MMCI

Word Sense Word Sense Word Sense Disambiguation Disambiguation Disambiguation Presented by

Word Sense Disambiguation Word Sense Disambiguation (WSD) Given A

Word Meaning &amp; Word Sense Disambiguation CMSC 723 / LING 723 / INST 725 M ARINE C ARPUAT

Word Sense Disambiguation WORD SENSE DISAMBIGUATION Homonymy and Polysemy As we have seen,

WSD Word Sense Disambiguation: Determine from context (or otherwise) what Word Sense

Virtual Student Orientation Information for Families SLIDESMANIA.COM TOPIC TOPIC TOPIC TOPIC

ConnectHome ConnectHome Topic 2 Topic 2 Nation Webinar Nation Webinar Topic 3 Topic 3 Topic

Semantics Avalanche: Word Sense Disambiguation, Dependency Parsing, Semantic Role Labeling/Verb

Final Projects Word Sense Disambiguation: A Unified Evaluation Framework and Empirical Comparison

Similarity-based Word Sense Disambiguation Yael Karov Shimon Edelman Weizmann Institute MIT

Word Sense Disambiguation Unsupervised WSD Modern WSD L645 / B659 (Some material from Jurafsky

Word Sense Disambiguation for Ontological Document Classification Speaker: Georgiana Ifrim

Semantics Avalanche: Word Sense Disambiguation, Dependency Parsing, Semantic Role

Natural Language Processing: Word Sense Disambiguation Roman Kern &lt;rkern@tugraz.at&gt;

Data-driven sense induction for disambiguation and lexical selection in translation Marianna

PIV Token Issuance PIV Token Issuance Ketan Mehta Mehta_Ketan@bah.com October 6, 2004 1

Combining Probabilistic and Translation- Based Models for Information Retrieval based on Word

Experiments on Active Learning for Croatian Word Sense Disambiguation c and Jan Domagoj

Online: Unit Testjng Michael Meeks &lt;michael.meeks@collabora.com&gt; mmeeks / irc.freenode.net

Lexical Semantics &amp; WSD Ling571 Deep Processing Techniques for NLP February 24, 2016

INF4080 2020 FALL NATURAL LANGUAGE PROCESSING Jan Tore Lnning 2 (Mostly Text)

Computational Semantics and Pragmatics Autumn 2011 Raquel Fernndez Institute for Logic,

Identifying Generic Expressions Nils Reiter and Anette Frank Department of Computational

Word Senses Polysemy: many meanings The book uses aspect in these senses Informal

Word Meaning & Word Sense Disambiguation CMSC 723 / LING 723 / INST 725 M ARINE C ARPUAT

Natural Language Processing: Word Sense Disambiguation Roman Kern <rkern@tugraz.at>

Online: Unit Testjng Michael Meeks <michael.meeks@collabora.com> mmeeks / irc.freenode.net

Lexical Semantics & WSD Ling571 Deep Processing Techniques for NLP February 24, 2016