GloVe: Global Vectors for Word Representation Fengyang Zhang, - - PowerPoint PPT Presentation

glove
SMART_READER_LITE
LIVE PREVIEW

GloVe: Global Vectors for Word Representation Fengyang Zhang, - - PowerPoint PPT Presentation

GloVe: Global Vectors for Word Representation Fengyang Zhang, Yutong Wang Presentation Overview 2. GloVe 1. What Model 3. Demo 4. Result is GloVe? Inference What is GloVe? Word Embedding: Word embeddings are in fact a class of


slide-1
SLIDE 1

GloVe:

Global Vectors for Word Representation

Fengyang Zhang, Yutong Wang

slide-2
SLIDE 2

Presentation Overview

  • 1. What

is GloVe?

  • 3. Demo
  • 2. GloVe

Model Inference

  • 4. Result
slide-3
SLIDE 3

What is GloVe?

Word Embedding:

Word embeddings are in fact a class of techniques where individual words are represented as real-valued vectors in a predefined vector space.

slide-4
SLIDE 4

One-hot Word Embedding:

Simple and easy

Relationship

slide-5
SLIDE 5

GloVe

GloVe is essentially a log-bilinear model with a weighted least-squares objective. It is an unsupervised learning algorithm for obtaining vector representations for words, trained on the non-zero entries of a global word-word co-occurrence matrix.

slide-6
SLIDE 6

GloVe: Encoding meaning in vector differences.

Crucial insight: Ratio of co-occurence probabilities can encode meaning. K = solid K = gas K = water K = fashion P(k|ice) large small large small P(k|steam) small large large small P(k|ice)/P(k|steam) large small ~1 ~1

slide-7
SLIDE 7

Inference

The appropriate starting point for word vector learning should be with ratios of co-occurrence probabilities rather than the probabilities themselves.

slide-8
SLIDE 8

Inference

Log-bilinear model: Vector differences:

Think: a = “ice”, b = “steam”

slide-9
SLIDE 9

Inference

  • X - cooccurrence matrix
  • w - word vectors
  • b - bias
  • ŵ - context word vectors
slide-10
SLIDE 10

Demo

slide-11
SLIDE 11

Result

slide-12
SLIDE 12

Result

Word analogy tasks

slide-13
SLIDE 13

Result

Word similarity tasks

slide-14
SLIDE 14

Any questions ?

Thanks!