Word Embeddings CS 6956: Deep Learning for NLP Overview - PowerPoint PPT Presentation

Word Embeddings CS 6956: Deep Learning for NLP

Overview • Representing meaning • Word embeddings: Early work • Word embeddings via language models • Word2vec and Glove • Evaluating embeddings • Design choices and open questions 1

Overview • Representing meaning • Word embeddings: Early work • Word embeddings via language models • Word2vec and Glove • Evaluating embeddings • Design choices and open questions 2

The evaluation problem • Suppose we have a way to convert words to vectors – Pick your favorite method • The (sometimes unstated) implication here is that these vectors represent the meaning of words • How can we verify this claim? Thoughts? 3

Using word embeddings Once we have word embeddings, what can we do with them? Several possibilities: 1. Measure word similarities and distances 𝐛 " 𝐜 Eg: Cosine similarity of two words A and B = 𝐛 𝐜 Other similarity functions are possible 2. Use this to find similar words or most dissimilar words Eg: Find the odd word among the following: cat , tiger , dog , table 4

Using word embeddings Once we have word embeddings, what can we do with them? Several possibilities: 3. Document or short snippet similarities Question: If we have word vectors, how do we represent documents in the same vector space? Several answers. Most common: average or add the word embeddings Gives natural definitions for document similarities 5

Two broad families of evaluations 1. Intrinsic evaluation: Evaluate the representation directly without training another model Typically simple tasks where success or failure is (almost) entirely a – function of the representation Easy to compute, but doesn’t say much about the embeddings as – features 2. Extrinsic evaluation: Evaluate the impact of the representation on another task – Typically, a neural network – Can be more practically useful, but slow and depends on the quality of the model for the task being tested 6

Intrinsic evaluation example Word Analogies Given an incomplete analogy of the form a : b :: c : ? Find the word that best answers fits The famous example: King : Queen :: Man : ? 9

Intrinsic evaluation example: Word Analogies Given word embeddings, one way to answer the question “a : b :: c : ?” is argmax * 𝑦 , − 𝑦 . + 𝑦 0 1 𝑦 * | 𝑦 , − 𝑦 . + 𝑦 0 | That is, if the answer is the word d, then we have 𝑦 , − 𝑦 . ≈ 𝑦 0 − 𝑦 4 10

Intrinsic evaluation example Word Analogies Given word embeddings, one way to answer the question “a : b :: c : ?” is argmax * 𝑦 , − 𝑦 . + 𝑦 0 1 𝑦 * | 𝑦 , − 𝑦 . + 𝑦 0 | That is, if the answer is the word d, then we have 𝑦 , − 𝑦 . ≈ 𝑦 0 − 𝑦 4 Not the only way to answer the question. Instead of this additive method, we could do something multiplicative 11

Word analogies data sets Several standard datasets exist for word analogies – Some capture syntactic patterns • give : giving :: take : ? – Some capture semantic patterns • queen: king :: tigress : ? – Some require world knowledge • Utah : Salt Lake City :: Iowa : ? 12

General trends • More data helps with analogy evaluations • Skipgram and Glove are typically competitive and top the charts in general – But even sparse PMI vectors over the entire vocabulary is not bad! • Very low and very high dimensional vectors don’t work – Need a sweet spot for best results 13

Word similarity evaluation • Another intrinsic evaluation • Pairs of words are hand-annotated with similarity scores • The goal of the embeddings is to produce these scores – Or perhaps more reasonably, similar clusterings or rankings as the scores • Standard software libraries exist for evaluating embeddings in this fashion 14

Word Embeddings CS 6956: Deep Learning for NLP Overview - PowerPoint PPT Presentation

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word embeddings: Early work Word embeddings via language models Word2vec and Glove Evaluating embeddings Design choices and open questions 1

Word Embeddings Natural Language Processing VU (706.230) - Andi Rexha 02/04/2020 Word Embeddings

Word embeddings Rappel Embeddings ( pas Word Embeddings ) Est une lookup table Formalisme:

Word Embeddings Revisited: Contextual Embeddings CS 6956: Deep Learning for NLP Overview

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Embeddings @ Twitter Making ML easy with Embeddings !!! Sept 2018 Agenda 1 Team 2 Whats an

Word Embeddings Tutorial HILA GONEN PHD STUDENT AT YOAV GOLDBERGS LAB BAR ILAN UNIVERSITY

Mixed membership word embeddings: Corpus-specific embeddings without big data James Foulds

Symmetric Pattern Based Word Embeddings for Improved Word Similarity Prediction Roy Schwartz + ,

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky & Martin How to

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky & Martin How to

Memory Memory Decoders M bits M bits RWM NVRWM ROM S 0 S 0 Word 0 Word 0 S 1 Word 1 Word

Lecture 8: NLP and Word Embeddings Alireza Akhavan Pour CLASS.VISION

Word Embeddings through Hellinger PCA Rmi Lebret and Ronan Collobert Idiap Research Institute /

Searching for the X-Factor: Exploring Corpus Subjectivity for Word Embeddings Maksim Tkachenko

On the curvatures of subalgebras of nilpotent Lie algebras Ana Hini c Gali c La Trobe

Recap: Cameras, lenses, and 6.891 calibration Computer Vision and Applications Last time:

Scanners Divers Prefer a broad view Master specific details before general concepts

Holographic Entanglement Entropy renormalization through extrinsic counterterms Based on

Prof. Tommy Andersson, MD, PhD Karolinska University Hospital Stockholm, Sweden AZ Groeninge

Webinar 2: Furthering Facility Compliance Information About the Pandemic April 2-13 April 14,

A National Web Conference on Assessing Safety Risks Associated With EHRs Presented by: David

Current and Future Directions in User Interface Design & Why UX is Important! Anne Miller,

Word Embeddings CS 6956: Deep Learning for NLP Overview - PowerPoint PPT Presentation

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word embeddings: Early work Word embeddings via language models Word2vec and Glove Evaluating embeddings Design choices and open questions 1

Word Embeddings Natural Language Processing VU (706.230) - Andi Rexha 02/04/2020 Word Embeddings

Word embeddings Rappel Embeddings ( pas Word Embeddings ) Est une lookup table Formalisme:

Word Embeddings Revisited: Contextual Embeddings CS 6956: Deep Learning for NLP Overview

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Embeddings @ Twitter Making ML easy with Embeddings !!! Sept 2018 Agenda 1 Team 2 Whats an

Word Embeddings Tutorial HILA GONEN PHD STUDENT AT YOAV GOLDBERGS LAB BAR ILAN UNIVERSITY

Mixed membership word embeddings: Corpus-specific embeddings without big data James Foulds

Symmetric Pattern Based Word Embeddings for Improved Word Similarity Prediction Roy Schwartz + ,

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky &amp; Martin How to

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky &amp; Martin How to

Memory Memory Decoders M bits M bits RWM NVRWM ROM S 0 S 0 Word 0 Word 0 S 1 Word 1 Word

Lecture 8: NLP and Word Embeddings Alireza Akhavan Pour CLASS.VISION

Word Embeddings through Hellinger PCA Rmi Lebret and Ronan Collobert Idiap Research Institute /

Searching for the X-Factor: Exploring Corpus Subjectivity for Word Embeddings Maksim Tkachenko

On the curvatures of subalgebras of nilpotent Lie algebras Ana Hini c Gali c La Trobe

Recap: Cameras, lenses, and 6.891 calibration Computer Vision and Applications Last time:

Scanners Divers Prefer a broad view Master specific details before general concepts

Holographic Entanglement Entropy renormalization through extrinsic counterterms Based on

Prof. Tommy Andersson, MD, PhD Karolinska University Hospital Stockholm, Sweden AZ Groeninge

Webinar 2: Furthering Facility Compliance Information About the Pandemic April 2-13 April 14,

A National Web Conference on Assessing Safety Risks Associated With EHRs Presented by: David

Current and Future Directions in User Interface Design &amp; Why UX is Important! Anne Miller,

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky & Martin How to

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky & Martin How to

Current and Future Directions in User Interface Design & Why UX is Important! Anne Miller,