Word Embeddings Luke Zettlemoyer (Slides adapted from Danqi Chen, - - PowerPoint PPT Presentation

word embeddings
SMART_READER_LITE
LIVE PREVIEW

Word Embeddings Luke Zettlemoyer (Slides adapted from Danqi Chen, - - PowerPoint PPT Presentation

CSEP 517 Natural Language Processing Word Embeddings Luke Zettlemoyer (Slides adapted from Danqi Chen, Greg Durrett, Chris Manning, Dan Jurafsky) How to represent words? N-gram language models P ( w it is 76 F and ) It is 76 F and ___.


slide-1
SLIDE 1

CSEP 517 Natural Language Processing

Word Embeddings

Luke Zettlemoyer

(Slides adapted from Danqi Chen, Greg Durrett, Chris Manning, Dan Jurafsky)

slide-2
SLIDE 2

How to represent words?

N-gram language models

It is 76 F and ___.

[0.0001, 0.1, 0, 0, 0.002, …, 0.3, …, 0]

P(w ∣ it is 76 F and)

red

Text classification I like this movie. 👎

I don’t like this movie. 👏

[0, 1, 0, 0, 0, …, 1, …, 1] [0, 1, 0, 1, 0, …, 1, …, 1]

P(y = 1 ∣ x) = σ(θ⊺w + b)

w(1) w(2)

sunny don’t

slide-3
SLIDE 3

Representing words as discrete symbols

In traditional NLP, we regard words as discrete symbols: hotel, conference, motel — a localist representation Words can be represented by one-hot vectors:

  • ne 1, the rest 0’s

Vector dimension = number of words in vocabulary (e.g., 500,000)

hotel = [0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0] motel = [0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0]

Challenge: How to compute similarity of two words?

slide-4
SLIDE 4

Representing words by their context

Distributional hypothesis: words that occur in similar contexts tend to have similar meanings

J.R.Firth 1957

  • “You shall know a word by the company it keeps”
  • One of the most successful ideas of modern statistical

NLP!

These context words will represent banking.

slide-5
SLIDE 5

Distributional hypothesis

“tejuino”

C1: A bottle of ___ is on the table. C2: Everybody likes ___. C3: Don’t have ___ before you drive. C4: We make ___ out of corn.

slide-6
SLIDE 6

Distributional hypothesis

C1 C2 C3 C4

tejuino

1 1 1 1

loud motor-oil

1

tortillas

1 1

choices

1

wine

1 1 1 C1: A bottle of ___ is on the table.

C2: Everybody likes ___.

C3: Don’t have ___ before you drive.

C4: We make ___ out of corn.

“words that occur in similar contexts tend to have similar meanings”

slide-7
SLIDE 7

Words as vectors

  • We’ll build a new model of meaning focusing on similarity
  • Each word is a vector
  • Similar words are “nearby in space”
  • word-word co-occurrence matrix:
  • A first solution: we can just use context vectors to represent

the meaning of words!

slide-8
SLIDE 8

Words as vectors

cos(u, v) = u · v kukkvk

<latexit sha1_base64="LwUwX9BK80Y7mx5ulvUsqXb08A0=">ACSHicbVDLSsNAFJ3UV62vqEs3g0WoICURQTdC0Y3LCvYBTQiT6aQdOsmEmUmhpPk8Ny7d+Q1uXCjizklbqbYeGDj3nHu5d4fMyqVZb0YhZXVtfWN4mZpa3tnd8/cP2hKnghMGpgzLto+koTRiDQUVYy0Y0FQ6DPS8ge3ud8aEiEpjx7UKCZuiHoRDShGSkue6WEuK06IVN8P0iQ7gz98mJ3Ca+gEAuF07kMHd7n61ZSlznhuO2M4L4e6zDyzbFWtCeAysWekDGaoe+az0+U4CUmkMENSdmwrVm6KhKYkazkJLECA9Qj3Q0jVBIpJtOgsjgiVa6MOBCv0jBifp7IkWhlKPQ1535kXLRy8X/vE6igis3pVGcKBLh6aIgYVBxmKcKu1QrNhIE4QF1bdC3Ec6O6WzL+kQ7MUvL5PmedW2qvb9Rbl2M4ujCI7AMagAG1yCGrgDdAGDyCV/AOPown4834NL6mrQVjNnMI/qBQ+AbQvrTU</latexit><latexit sha1_base64="LwUwX9BK80Y7mx5ulvUsqXb08A0=">ACSHicbVDLSsNAFJ3UV62vqEs3g0WoICURQTdC0Y3LCvYBTQiT6aQdOsmEmUmhpPk8Ny7d+Q1uXCjizklbqbYeGDj3nHu5d4fMyqVZb0YhZXVtfWN4mZpa3tnd8/cP2hKnghMGpgzLto+koTRiDQUVYy0Y0FQ6DPS8ge3ud8aEiEpjx7UKCZuiHoRDShGSkue6WEuK06IVN8P0iQ7gz98mJ3Ca+gEAuF07kMHd7n61ZSlznhuO2M4L4e6zDyzbFWtCeAysWekDGaoe+az0+U4CUmkMENSdmwrVm6KhKYkazkJLECA9Qj3Q0jVBIpJtOgsjgiVa6MOBCv0jBifp7IkWhlKPQ1535kXLRy8X/vE6igis3pVGcKBLh6aIgYVBxmKcKu1QrNhIE4QF1bdC3Ec6O6WzL+kQ7MUvL5PmedW2qvb9Rbl2M4ujCI7AMagAG1yCGrgDdAGDyCV/AOPown4834NL6mrQVjNnMI/qBQ+AbQvrTU</latexit><latexit sha1_base64="LwUwX9BK80Y7mx5ulvUsqXb08A0=">ACSHicbVDLSsNAFJ3UV62vqEs3g0WoICURQTdC0Y3LCvYBTQiT6aQdOsmEmUmhpPk8Ny7d+Q1uXCjizklbqbYeGDj3nHu5d4fMyqVZb0YhZXVtfWN4mZpa3tnd8/cP2hKnghMGpgzLto+koTRiDQUVYy0Y0FQ6DPS8ge3ud8aEiEpjx7UKCZuiHoRDShGSkue6WEuK06IVN8P0iQ7gz98mJ3Ca+gEAuF07kMHd7n61ZSlznhuO2M4L4e6zDyzbFWtCeAysWekDGaoe+az0+U4CUmkMENSdmwrVm6KhKYkazkJLECA9Qj3Q0jVBIpJtOgsjgiVa6MOBCv0jBifp7IkWhlKPQ1535kXLRy8X/vE6igis3pVGcKBLh6aIgYVBxmKcKu1QrNhIE4QF1bdC3Ec6O6WzL+kQ7MUvL5PmedW2qvb9Rbl2M4ujCI7AMagAG1yCGrgDdAGDyCV/AOPown4834NL6mrQVjNnMI/qBQ+AbQvrTU</latexit><latexit sha1_base64="LwUwX9BK80Y7mx5ulvUsqXb08A0=">ACSHicbVDLSsNAFJ3UV62vqEs3g0WoICURQTdC0Y3LCvYBTQiT6aQdOsmEmUmhpPk8Ny7d+Q1uXCjizklbqbYeGDj3nHu5d4fMyqVZb0YhZXVtfWN4mZpa3tnd8/cP2hKnghMGpgzLto+koTRiDQUVYy0Y0FQ6DPS8ge3ud8aEiEpjx7UKCZuiHoRDShGSkue6WEuK06IVN8P0iQ7gz98mJ3Ca+gEAuF07kMHd7n61ZSlznhuO2M4L4e6zDyzbFWtCeAysWekDGaoe+az0+U4CUmkMENSdmwrVm6KhKYkazkJLECA9Qj3Q0jVBIpJtOgsjgiVa6MOBCv0jBifp7IkWhlKPQ1535kXLRy8X/vE6igis3pVGcKBLh6aIgYVBxmKcKu1QrNhIE4QF1bdC3Ec6O6WzL+kQ7MUvL5PmedW2qvb9Rbl2M4ujCI7AMagAG1yCGrgDdAGDyCV/AOPown4834NL6mrQVjNnMI/qBQ+AbQvrTU</latexit>

cos(u, v) = PV

i=1 uivi

qPV

i=1 u2 i

qPV

i=1 v2 i

<latexit sha1_base64="Ex14HCNmlweatqm/BdVuIyGZI=">ACX3icbZFLSwMxFIUz47tWHXUlboJFqCBlpgi6EUQ3LivYKnTaIZNmajDzMI+BEvIn3Qlu/Cdm2graeiFwON+9JPckLhgV0vc/HdldW19Y3Ortl3f2d3z9g96Ilcky7OWc6fYyQIoxnpSioZeS4QWnMyFP8elfxp5JwQfPsU4KMkjROKMJxUhaK/JKnItmCL5EidamXP4o0tzBq9hmHCEdShUGml6HZih7hmtIgrLiBpjwRuXS3jYruC/rJwxE3kNv+VPCy6LYC4aYF6dyHsPRzlWKckZkiIfuAXcqARlxQzYmqhEqRA+BWNSd/KDKVEDPQ0HwNPrTOCSc7tyScur8nNEqFmKSx7ay2F4usMv9jfSWTq4GmWaEkyfDsokQxKHNYhQ1HlBMs2cQKhDm1b4X4BdlIpf2Smg0hWFx5WfTarcBvBQ8XjZvbeRyb4BicgCYIwCW4AfegA7oAg0/HdbaduvPlbri7rjdrdZ35zCH4U+7RNyRuQc=</latexit><latexit sha1_base64="Ex14HCNmlweatqm/BdVuIyGZI=">ACX3icbZFLSwMxFIUz47tWHXUlboJFqCBlpgi6EUQ3LivYKnTaIZNmajDzMI+BEvIn3Qlu/Cdm2graeiFwON+9JPckLhgV0vc/HdldW19Y3Ortl3f2d3z9g96Ilcky7OWc6fYyQIoxnpSioZeS4QWnMyFP8elfxp5JwQfPsU4KMkjROKMJxUhaK/JKnItmCL5EidamXP4o0tzBq9hmHCEdShUGml6HZih7hmtIgrLiBpjwRuXS3jYruC/rJwxE3kNv+VPCy6LYC4aYF6dyHsPRzlWKckZkiIfuAXcqARlxQzYmqhEqRA+BWNSd/KDKVEDPQ0HwNPrTOCSc7tyScur8nNEqFmKSx7ay2F4usMv9jfSWTq4GmWaEkyfDsokQxKHNYhQ1HlBMs2cQKhDm1b4X4BdlIpf2Smg0hWFx5WfTarcBvBQ8XjZvbeRyb4BicgCYIwCW4AfegA7oAg0/HdbaduvPlbri7rjdrdZ35zCH4U+7RNyRuQc=</latexit><latexit sha1_base64="Ex14HCNmlweatqm/BdVuIyGZI=">ACX3icbZFLSwMxFIUz47tWHXUlboJFqCBlpgi6EUQ3LivYKnTaIZNmajDzMI+BEvIn3Qlu/Cdm2graeiFwON+9JPckLhgV0vc/HdldW19Y3Ortl3f2d3z9g96Ilcky7OWc6fYyQIoxnpSioZeS4QWnMyFP8elfxp5JwQfPsU4KMkjROKMJxUhaK/JKnItmCL5EidamXP4o0tzBq9hmHCEdShUGml6HZih7hmtIgrLiBpjwRuXS3jYruC/rJwxE3kNv+VPCy6LYC4aYF6dyHsPRzlWKckZkiIfuAXcqARlxQzYmqhEqRA+BWNSd/KDKVEDPQ0HwNPrTOCSc7tyScur8nNEqFmKSx7ay2F4usMv9jfSWTq4GmWaEkyfDsokQxKHNYhQ1HlBMs2cQKhDm1b4X4BdlIpf2Smg0hWFx5WfTarcBvBQ8XjZvbeRyb4BicgCYIwCW4AfegA7oAg0/HdbaduvPlbri7rjdrdZ35zCH4U+7RNyRuQc=</latexit><latexit sha1_base64="Ex14HCNmlweatqm/BdVuIyGZI=">ACX3icbZFLSwMxFIUz47tWHXUlboJFqCBlpgi6EUQ3LivYKnTaIZNmajDzMI+BEvIn3Qlu/Cdm2graeiFwON+9JPckLhgV0vc/HdldW19Y3Ortl3f2d3z9g96Ilcky7OWc6fYyQIoxnpSioZeS4QWnMyFP8elfxp5JwQfPsU4KMkjROKMJxUhaK/JKnItmCL5EidamXP4o0tzBq9hmHCEdShUGml6HZih7hmtIgrLiBpjwRuXS3jYruC/rJwxE3kNv+VPCy6LYC4aYF6dyHsPRzlWKckZkiIfuAXcqARlxQzYmqhEqRA+BWNSd/KDKVEDPQ0HwNPrTOCSc7tyScur8nNEqFmKSx7ay2F4usMv9jfSWTq4GmWaEkyfDsokQxKHNYhQ1HlBMs2cQKhDm1b4X4BdlIpf2Smg0hWFx5WfTarcBvBQ8XjZvbeRyb4BicgCYIwCW4AfegA7oAg0/HdbaduvPlbri7rjdrdZ35zCH4U+7RNyRuQc=</latexit>

What is the range of ?

cos( ⋅ )

slide-9
SLIDE 9

Words as vectors

Problem: not all counts are equal, words can randomly co-occur

  • Solution: re-weight by how likely it is for the two

words to co-occur by simple chance

  • PPMI = Positive Pointwise Mutual Information
slide-10
SLIDE 10

Sparse

  • Still, the vectors we get from word-word occurrence

matrix are sparse (most are 0’s) & long (vocabulary size)

  • Alternative: we want to represent words as short (50-300

dimensional) & dense (real-valued) vectors

  • The focus of this lecture
  • The basis of all the modern NLP systems

vs dense vectors

slide-11
SLIDE 11

Dense vectors

employees =               0.286 0.792 −0.177 −0.107 10.109 −0.542 0.349 0.271 0.487              

<latexit sha1_base64="rlyV9z4BFdXN5ATSBwve48t4Vs=">ACa3icbZHPT9swFMedjA3oNijAGI7WKuQdlmVhELaAxKC0cmrYDUVJXjvhYLx4nsF0QV9bI/cTf+Ay78DzhNQAP2JEsfd8P38dZ1IY9Lw7x3239P7D8spq4+Onz2vrzY0v5ybNYc+T2WqL2NmQAoFfRQo4TLTwJYwkV8fVLmL25AG5Gq3zjLYJiwqRITwRladT8EyHcYgFJtMZgJnTQxrFMBWqyBKGWtzOqdcOugc0iyEvaCEn17bD8Mn8hbkl9Srtf1OUDXsdXoVBKFfQadry0GNn+ePmi2v7S2CvgW/hap42zU/BuNU54noJBLZszA9zIcFkyj4BLmjSg3kDF+zaYwsKhYAmZYLya012rjOk1fYopAv1346CJcbMkthW2v2uzOtcKf4vN8hx0h0WQmU5guLVRZNcUkxpaTwdCw0c5cwC41rYXSm/YpxtN/TsCb4r5/8Fs6Ddmnzr07r6Li2Y4V8Jd/JD+KTkByRU3JG+oSTe2fN2XK2nQd3091xv1WlrlP3bJIX4e4+Ani5r64=</latexit><latexit sha1_base64="rlyV9z4BFdXN5ATSBwve48t4Vs=">ACa3icbZHPT9swFMedjA3oNijAGI7WKuQdlmVhELaAxKC0cmrYDUVJXjvhYLx4nsF0QV9bI/cTf+Ay78DzhNQAP2JEsfd8P38dZ1IY9Lw7x3239P7D8spq4+Onz2vrzY0v5ybNYc+T2WqL2NmQAoFfRQo4TLTwJYwkV8fVLmL25AG5Gq3zjLYJiwqRITwRladT8EyHcYgFJtMZgJnTQxrFMBWqyBKGWtzOqdcOugc0iyEvaCEn17bD8Mn8hbkl9Srtf1OUDXsdXoVBKFfQadry0GNn+ePmi2v7S2CvgW/hap42zU/BuNU54noJBLZszA9zIcFkyj4BLmjSg3kDF+zaYwsKhYAmZYLya012rjOk1fYopAv1346CJcbMkthW2v2uzOtcKf4vN8hx0h0WQmU5guLVRZNcUkxpaTwdCw0c5cwC41rYXSm/YpxtN/TsCb4r5/8Fs6Ddmnzr07r6Li2Y4V8Jd/JD+KTkByRU3JG+oSTe2fN2XK2nQd3091xv1WlrlP3bJIX4e4+Ani5r64=</latexit><latexit sha1_base64="rlyV9z4BFdXN5ATSBwve48t4Vs=">ACa3icbZHPT9swFMedjA3oNijAGI7WKuQdlmVhELaAxKC0cmrYDUVJXjvhYLx4nsF0QV9bI/cTf+Ay78DzhNQAP2JEsfd8P38dZ1IY9Lw7x3239P7D8spq4+Onz2vrzY0v5ybNYc+T2WqL2NmQAoFfRQo4TLTwJYwkV8fVLmL25AG5Gq3zjLYJiwqRITwRladT8EyHcYgFJtMZgJnTQxrFMBWqyBKGWtzOqdcOugc0iyEvaCEn17bD8Mn8hbkl9Srtf1OUDXsdXoVBKFfQadry0GNn+ePmi2v7S2CvgW/hap42zU/BuNU54noJBLZszA9zIcFkyj4BLmjSg3kDF+zaYwsKhYAmZYLya012rjOk1fYopAv1346CJcbMkthW2v2uzOtcKf4vN8hx0h0WQmU5guLVRZNcUkxpaTwdCw0c5cwC41rYXSm/YpxtN/TsCb4r5/8Fs6Ddmnzr07r6Li2Y4V8Jd/JD+KTkByRU3JG+oSTe2fN2XK2nQd3091xv1WlrlP3bJIX4e4+Ani5r64=</latexit><latexit sha1_base64="rlyV9z4BFdXN5ATSBwve48t4Vs=">ACa3icbZHPT9swFMedjA3oNijAGI7WKuQdlmVhELaAxKC0cmrYDUVJXjvhYLx4nsF0QV9bI/cTf+Ay78DzhNQAP2JEsfd8P38dZ1IY9Lw7x3239P7D8spq4+Onz2vrzY0v5ybNYc+T2WqL2NmQAoFfRQo4TLTwJYwkV8fVLmL25AG5Gq3zjLYJiwqRITwRladT8EyHcYgFJtMZgJnTQxrFMBWqyBKGWtzOqdcOugc0iyEvaCEn17bD8Mn8hbkl9Srtf1OUDXsdXoVBKFfQadry0GNn+ePmi2v7S2CvgW/hap42zU/BuNU54noJBLZszA9zIcFkyj4BLmjSg3kDF+zaYwsKhYAmZYLya012rjOk1fYopAv1346CJcbMkthW2v2uzOtcKf4vN8hx0h0WQmU5guLVRZNcUkxpaTwdCw0c5cwC41rYXSm/YpxtN/TsCb4r5/8Fs6Ddmnzr07r6Li2Y4V8Jd/JD+KTkByRU3JG+oSTe2fN2XK2nQd3091xv1WlrlP3bJIX4e4+Ani5r64=</latexit>
slide-12
SLIDE 12

Why dense vectors?

  • Short vectors are easier to use as features in ML systems
  • Dense vectors may generalize better than storing explicit counts
  • They do better at capturing synonymy
  • co-occurs with “car”,

co-occurs with “automobile”

w1 w2

  • Different methods for getting dense vectors:
  • Singular value decomposition (SVD)
  • word2vec and friends: “learn” the vectors!
slide-13
SLIDE 13

Word2vec and friends

(Mikolov et al, 2013): Distributed Representations of Words and Phrases and their Compositionality

slide-14
SLIDE 14

Word2vec

  • Input: a large text corpora, V, d
  • Output:
  • V: a pre-defined vocabulary
  • d: dimension of word vectors (e.g. 300)
  • Text corpora:
  • Wikipedia + Gigaword 5: 6B
  • Twitter: 27B
  • Common Crawl: 840B

vcat =     −0.224 0.130 −0.290 0.276    

<latexit sha1_base64="ZS1t+SATcIQYaJ4VZuEjXjz0Y=">ACOXicbZDPShxBEMZrjIk65s8aj3poIoFcsvRsQlSIHjxuIKrws6y9PTWro09PUN3jbgM8wx5m1zyFt4ELx4U8ZoXSM+uiFE/aPj4VRVd9SW5Vo4vwhmXs2+fjM3vxAuvn3/kNj6eOByworsSMzndmjRDjUymCHFGk8yi2KNF4mJzs1PXDU7ROZWafxjn2UjEyaqikI/6jfZpv4wJz6j0pKq2wjBkTJlngqy6qwKv/Jmq/WdxXHIm9E3XpsabfIpaq3/CGM0g4eBfmON/lE7LmJ7s3a9uqvmAFAu984jweZLFI0JLVwrhvxnHqlsKSkxiqMC4e5kCdihF1vjUjR9crJ5RX7MmADTPrnyE2oY8nSpE6N04T3+n3O3ZPazV8qdYtaLjRK5XJC0Ijpx8NC80oY3WMbKAsStJjb4S0yu/K5LGwQpIPO/QhRE9Pfm4OWs3Ih7rn0/gJU83DCnyCLxDBOmzDLrShAxJ+wyVcw03wJ7gKboO7aetMcD+zDP8p+PsPf3eqbQ=</latexit><latexit sha1_base64="X7JObiHYNXwbsISLOmkjXbSsJws=">ACOXicbZBNSyNBEIZ7/Fh1dHejHvXQrCx42dCTFT9AQfDiMYJRIRNCT6cSG3t6hu4aMQzG/w3Xrz5E7wJXjwo4lW825OIrLovNLw8VUVXvVGqpEXGbryR0bHxbxOTU/70zPcfPyuzcwc2yYyAhkhUYo4ibkFJDQ2UqOAoNcDjSMFhdLJT1g9PwViZ6H3sp9CKeU/LrhQcHWpX6qftPEQ4w9yRotjywh6UudpzNHIs8L/w6q12goNQ59Vg7+sNCXaYENUW1v1Q9Cd94F2ZYlV2UD0qwnezNL24nm4/HJ1Xm9XrsNOIrIYNArFrW0GLMVWzg1KoaDw8xCysUJ70HTWc1jsK18cHlBfzvSod3EuKeRDui/EzmPre3Hket0+x3bz7US/q/WzLC73sqlTjMELYfdTNFMaFljLQjDQhUfWe4MNLtSsUxN1ygC9t3IQSfT/5qDmrVwIW659LYJENkgXyiyTgKyRbJL6qRBLkgt+SePHiX3p36D0NW0e8t5l58kHe8yuUTqy7</latexit><latexit sha1_base64="X7JObiHYNXwbsISLOmkjXbSsJws=">ACOXicbZBNSyNBEIZ7/Fh1dHejHvXQrCx42dCTFT9AQfDiMYJRIRNCT6cSG3t6hu4aMQzG/w3Xrz5E7wJXjwo4lW825OIrLovNLw8VUVXvVGqpEXGbryR0bHxbxOTU/70zPcfPyuzcwc2yYyAhkhUYo4ibkFJDQ2UqOAoNcDjSMFhdLJT1g9PwViZ6H3sp9CKeU/LrhQcHWpX6qftPEQ4w9yRotjywh6UudpzNHIs8L/w6q12goNQ59Vg7+sNCXaYENUW1v1Q9Cd94F2ZYlV2UD0qwnezNL24nm4/HJ1Xm9XrsNOIrIYNArFrW0GLMVWzg1KoaDw8xCysUJ70HTWc1jsK18cHlBfzvSod3EuKeRDui/EzmPre3Hket0+x3bz7US/q/WzLC73sqlTjMELYfdTNFMaFljLQjDQhUfWe4MNLtSsUxN1ygC9t3IQSfT/5qDmrVwIW659LYJENkgXyiyTgKyRbJL6qRBLkgt+SePHiX3p36D0NW0e8t5l58kHe8yuUTqy7</latexit><latexit sha1_base64="yUhkDlYwUEoQ+3MeiaCkTY5/M=">ACOXicbZBNSyNBEIZ71PVj3NWsHr0BsHLhp4oxgUFwYvHCEaFTAg9nUps7OkZumvEMzf8uK/8CZ48bCLePUP2JME8euFhpenquiqN0qVtMjYvTc1PfNjdm5+wV/8+WtpufJ75dQmRHQEolKzHnELSipoYUSFZynBngcKTiLg/L+tkVGCsTfYLDFDoxH2jZl4KjQ91K86qbhwjXmDtSFPt+GMFA6jyNORp5Xfh/WK1e36Zh6LNasMVKU6K/bIzqjR0/BN17G+hWqzGRqJfTAxVTJRs1u5C3uJyGLQKBS3th2wFDs5NyiFgsIPMwspF5d8AG1nNY/BdvLR5QXdcKRH+4lxTyMd0fcTOY+tHcaR63T7XdjPtRJ+V2tn2N/t5FKnGYIW4/6maKY0DJG2pMGBKqhM1wY6Xal4oIbLtCF7bsQgs8nfzWn9VrgQj1m1YO9SRzZI2sk0SkAY5IEekSVpEkBvyQP6R/96t9+g9ec/j1ilvMrNKPsh7eQWaI6kG</latexit>

vdog =     −0.124 0.430 −0.200 0.329    

<latexit sha1_base64="7A8Mq63LMTMc+l3UeNcP10h1a/Y=">ACOXicbVBNSxBFHyjJjGTDzd6zKVRhFy9KyCBiIuXgRNsRVYWdZenrero09PUP3G3EZ5m958V94C3jQgyJe8wfSuyuSqAUNRdUr+r1KCq0cf47mJmde/X6zfzb8N37Dx8XGp8W91eWokdmevcHibCoVYGO6RI42FhUWSJxoPk+MfYPzhB61Ru9mhUYC8TQ6MGSgryUr/RPulXMeEpVWk+rOutME5wqExVZIKsOq3Dr7wZtdZHIe8ub7Gx8RLc6n0lrWxijSR8D/cYKb/IJ2HMSPZCVb76woA2v3GRZzmszQkNTCuW7EC+pVwpKSGuswLh0WQh6LIXY9NSJD16sml9ds1SspG+TWP0Nsov6bqETm3ChL/KTf78g9cbiS163pMFmr1KmKAmNnH40KDWjnI1rZKmyKEmPBHSKr8rk0fCkm+7NCXED09+TnZbzUj3+5P38Z3mGIePsMyfIEINmAbdqANHZBwBpdwA7fBeXAd3AX309GZ4CGzBP8h+PMXGHGq4A=</latexit><latexit sha1_base64="uVvmVkONvo7ZgSwrHlzj6B+jPNQ=">ACOXicbVDPaxNBGJ1tba1rf2zr0ctgKHhpmE0CtdBCQRAvQkSTFrJLmJ39kgyZnV1mvg0Ny/5bXvwvAlePCjixYP/gJOkiDZ9MPB473vM972kUNIiY5+9jc0HW9sPdx75j3f39g+Cw6O+zUsjoCdylZvrhFtQUkMPJSq4LgzwLFwlUxfLvyrGRgrc/0e5wXEGR9rOZKCo5OGQXc2rCKEG6zSfFzXF36UwFjqsg4GnlT+yesGbY6NIp81uy02YI4qcXYSmq3zvwIdPo3MAwarMmWoOskvCWNS/rm3a9p/Ko7D5FaS7KDQKxa0dhKzAuOIGpVBQ+1FpoeBiyscwcFTzDGxcLS+v6bFTUjrKjXsa6VL9N1HxzNp5lrhJt9/E3vUW4n3eoMTRi7iSuigRtFh9NCoVxZwuaqSpNCBQzR3hwki3KxUTbrhAV7bvSgjvnrxO+q1m6Np969o4JyvskKfkGXlOQnJKLslr0iU9IsgH8oV8I9+9j95X74f3czW64d1mnpD/4P3+A5WBq/0=</latexit><latexit sha1_base64="uVvmVkONvo7ZgSwrHlzj6B+jPNQ=">ACOXicbVDPaxNBGJ1tba1rf2zr0ctgKHhpmE0CtdBCQRAvQkSTFrJLmJ39kgyZnV1mvg0Ny/5bXvwvAlePCjixYP/gJOkiDZ9MPB473vM972kUNIiY5+9jc0HW9sPdx75j3f39g+Cw6O+zUsjoCdylZvrhFtQUkMPJSq4LgzwLFwlUxfLvyrGRgrc/0e5wXEGR9rOZKCo5OGQXc2rCKEG6zSfFzXF36UwFjqsg4GnlT+yesGbY6NIp81uy02YI4qcXYSmq3zvwIdPo3MAwarMmWoOskvCWNS/rm3a9p/Ko7D5FaS7KDQKxa0dhKzAuOIGpVBQ+1FpoeBiyscwcFTzDGxcLS+v6bFTUjrKjXsa6VL9N1HxzNp5lrhJt9/E3vUW4n3eoMTRi7iSuigRtFh9NCoVxZwuaqSpNCBQzR3hwki3KxUTbrhAV7bvSgjvnrxO+q1m6Np969o4JyvskKfkGXlOQnJKLslr0iU9IsgH8oV8I9+9j95X74f3czW64d1mnpD/4P3+A5WBq/0=</latexit><latexit sha1_base64="QWAMEHLqCoErtEma6Wi/7uLqM=">ACOXicbVDLSiNBFK12fMT2lRmXbgqD4MZQHQUVHAi4cRnBqJAOobr6JhZWVzdVt8XQ9G/NZv5idgNuXCji1h+w8kB8HSg4nHMPde+JMiUtMvbfm/kxOze/UFn0l5ZXVteqP3+d2zQ3AtoiVam5jLgFJTW0UaKCy8wATyIF9H18ci/uAFjZarPcJhBN+EDLftScHRSr9q6RUhwi0WcToy9+GMFA6iJLOBp5W/o7rB409mgY+qy+t8tGxEkNxibSbuPQD0Hb4FetcbqbAz6lQRTUiNTtHrVf2GcijwBjUJxazsBy7BbcINSKCj9MLeQcXHNB9BxVPMEbLcYX17SLafEtJ8a9zTSsfo+UfDE2mESuUm35X97I3E7xOjv2DbiF1liNoMfmonyuKR3VSGNpQKAaOsKFkW5XKq64QJd2b4rIfh8ldy3qgHrt1TVmseTeuokA2ySbZJQPZJk5yQFmkTQf6QO/JAHr2/3r35D1PRme8aWadfID38gqQ96kA</latexit>

vthe =     0.234 0.266 0.239 −0.199    

<latexit sha1_base64="odYGt+syjpaXyhzBR2lH0qeQ+vM=">ACOHicbZBSxtBFMfWtuma1tje6yHoSJ4adjVogYUBC/etGBUyIYwO3lJBmdnl5m3krDsZ+in6cWP0VvpxYMiXv0EnU2CWPUPAz/+7z3mvX+cKWkpCP54c6/mX795W3vnL7z/8HGxvTpxKa5EdgSqUrNWcwtKqmxRZIUnmUGeRIrPI3P96v6QUaK1N9TOMOwkfaNmXgpOzuvXDi24REY6oCGW5a4fxTiQusgSTkaOSj9orG98Z1FUwebmDaFXwLGmGz6Ueoew/93fpK0AgmYs8hnMHK3vLPiAHAUbf+O+qlIk9Qk1Dc2nYZNQpuCEpFJZ+lFvMuDjnA2w71DxB2ykmh5ds1Tk91k+Ne5rYxH08UfDE2nESu0639A+rVXmS7V2Tv3tTiF1lhNqMf2onytGKatSZD1pUJAaO+DCSLcrE0NuCXte9CJ+e/BxO1huhS/CHS2MHpqrBF/gKaxDCFuzBARxBCwT8gr9wDTfepXfl3Xp309Y5bzbzGf6Td/8PLj6qUQ=</latexit><latexit sha1_base64="kC+ZoeREatr7v57EgIo3XrJSs=">ACOHicbZDPatAEMZXSds46p84ybE9LA0FX2okOzgxJGDIpbe6UDsBy5jVemwvWa3E7sjYCD2DnyaXHvsIuYVekgpvRZ678o2pX6wcKPb2bYmS9MpDoeXfO1vajx092Srvu02fPX+yV9w+6Jk41hw6PZayvQmZACgUdFCjhKtHAolDCZXh9UdQvp6CNiNVHnCfQj9hYiZHgDK01KL+fDrIAYZTiDPz90ghLFQWRIx1GKWu161Vj+mQVBAo7GerOAt17VbzbdANTwT/+gfORVvaXoQ/DXcNR6tQgqvz4v2oPybTCMeRqBQi6ZMT3fS7CfMY2CS8jdIDWQMH7NxtCzqFgEp8tD8/pG+sM6SjW9imkS/fviYxFxsyj0Hba/SZms1aY/6v1Uhyd9jOhkhRB8dVHo1RSjGmRIh0KDRzl3ALjWthdKZ8wzTjarF0bgr958kPo1q+TfCDTeOMrFQiL8lrUiE+OSEt8o60SYdwckO+kHvyzfnkfHW+Oz9WrVvOeuaQ/CPn529DFayf</latexit><latexit sha1_base64="kC+ZoeREatr7v57EgIo3XrJSs=">ACOHicbZDPatAEMZXSds46p84ybE9LA0FX2okOzgxJGDIpbe6UDsBy5jVemwvWa3E7sjYCD2DnyaXHvsIuYVekgpvRZ678o2pX6wcKPb2bYmS9MpDoeXfO1vajx092Srvu02fPX+yV9w+6Jk41hw6PZayvQmZACgUdFCjhKtHAolDCZXh9UdQvp6CNiNVHnCfQj9hYiZHgDK01KL+fDrIAYZTiDPz90ghLFQWRIx1GKWu161Vj+mQVBAo7GerOAt17VbzbdANTwT/+gfORVvaXoQ/DXcNR6tQgqvz4v2oPybTCMeRqBQi6ZMT3fS7CfMY2CS8jdIDWQMH7NxtCzqFgEp8tD8/pG+sM6SjW9imkS/fviYxFxsyj0Hba/SZms1aY/6v1Uhyd9jOhkhRB8dVHo1RSjGmRIh0KDRzl3ALjWthdKZ8wzTjarF0bgr958kPo1q+TfCDTeOMrFQiL8lrUiE+OSEt8o60SYdwckO+kHvyzfnkfHW+Oz9WrVvOeuaQ/CPn529DFayf</latexit><latexit sha1_base64="nKGHRyDYykoHfjg5UdytxCiaVE=">ACOHicbZBNSyNBEIZ7/Hb8iu7RS7NB8GKYUVEDCsJe9qbCRoVMCD2dStKkp2forgkJw/wsL/4Mb8tePLiIV3+BPXEQv15oeHiriq56w0QKg57315manpmdm19YdJeWV1bXKusblyZONYcGj2Wsr0NmQAoFDRQo4TrRwKJQwlU4+FXUr4agjYjVHxwn0IpYT4mu4Ayt1a6cDdtZgDCDPuQ5yduEJPqCyJGoxyl2vtru3T4OgIODEvbqBex4Nb9edwNQnbf+dqXq1byJ6FfwS6iSUuftyl3QiXkagUIumTFN30uwlTGNgkvI3SA1kDA+YD1oWlQsAtPKJofndMs6HdqNtX0K6cR9P5GxyJhxFNpOu1/fK4V5ne1Zordo1YmVJIiKP76UTeVFGNapEg7QgNHObAuBZ2V8r7TDONmvXhuB/PvkrXO7WfJvghVc9PS7jWCb5CfZJj45JKfkNzknDcLJDflHsh/59a5dx6dp9fWKaec+UE+yHl+AUjqOo=</latexit>

vlanguage =     0.290 −0.441 0.762 0.982    

<latexit sha1_base64="b4xc+Okp2p3fvc/1+aow+HYTGjA=">ACPXicbZBNaxsxEIZn03y42yZ1m2MvIibQSxetCU0CKRh6aWQktgJeI3RymNHRKtdpNkQs+wf6X/obfecsmhpfTa2U7hHy9IHh4ZwbNvGmhlSPOfwZLz5ZXVtcaz8MXL9c3XjVfv+m5vLQSuzLXuT1NhUOtDHZJkcbTwqLIUo0n6fmnWf3kAq1TuTmaYGDTEyMGispyFvD5vHFsEoIL6nSwkxKMcG6/hgmKU6UqYpMkFWXdcij9j5nSRK+59HOTjwjHu1+aC9gf68dJmhGt/3DZotHfC72GOIbaHXYl6MrADgcNn8ko1yWGRqSWjXj3lBg0pYUlJjHSalw0LIc79e36MRGbpBNb+ZtveGbFxbv0zxObu3YlKZM5Ns9R3+v3O3MPazHyq1i9pvDeolClKQiMXH41LzShnsyjZSFmUpKcehLTK78rkmbBCkg89CHED09+DL12FPMo/urTOICFGvAWtuAdxLALHfgMh9AFCd/gCn7B7+B7cB38Cf4uWpeCm5lNuKfg386Saz9</latexit><latexit sha1_base64="hZMi7uSxdZH/uZAh+zd/W8R0ZM=">ACPXicbZBNSxBEIZ7jPFjNLqao5cmInhx6FnED1AQAiEXQYmrws6w9PTWrs329AzdNcsuw/yxXPIfvHnLJYeI6NGrvbsifpCw8NbVXTVm+RKWmTsxpv6MP1xZnZu3l9Y/LS0XFtZPbdZYQ0RKYyc5lwC0pqaKBEBZe5AZ4mCi6S3tdR/aIPxspMn+EwhzjlXS07UnB0Vqt21m+VEcIAS8V1t+BdqKpDP0qgK3WZpxyNHFQ+C+r7jEaRv8WC7e1wRCzY3alPYH+v7keg2y/9rdo6C9hY9C2Ez7B+RI9/PTibyet2nXUzkSRgkahuLXNkOUYl9ygFAoqPyos5Fz03HpNh5qnYONyfH1FN5zTp3MuKeRjt1/J0qeWjtME9fp9ruyr2sj871as8DOXlxKnRcIWkw+6hSKYkZHUdK2NCBQDR1wYaTblYorbrhAF7jvQghfn/wWzutByILw1KVxQCaI2vkC9kIdklR+Q7OSENIshP8pv8JbfeL+Pd+fdT1qnvOeZz+Q/eY9Pt1muGg=</latexit><latexit sha1_base64="hZMi7uSxdZH/uZAh+zd/W8R0ZM=">ACPXicbZBNSxBEIZ7jPFjNLqao5cmInhx6FnED1AQAiEXQYmrws6w9PTWrs329AzdNcsuw/yxXPIfvHnLJYeI6NGrvbsifpCw8NbVXTVm+RKWmTsxpv6MP1xZnZu3l9Y/LS0XFtZPbdZYQ0RKYyc5lwC0pqaKBEBZe5AZ4mCi6S3tdR/aIPxspMn+EwhzjlXS07UnB0Vqt21m+VEcIAS8V1t+BdqKpDP0qgK3WZpxyNHFQ+C+r7jEaRv8WC7e1wRCzY3alPYH+v7keg2y/9rdo6C9hY9C2Ez7B+RI9/PTibyet2nXUzkSRgkahuLXNkOUYl9ygFAoqPyos5Fz03HpNh5qnYONyfH1FN5zTp3MuKeRjt1/J0qeWjtME9fp9ruyr2sj871as8DOXlxKnRcIWkw+6hSKYkZHUdK2NCBQDR1wYaTblYorbrhAF7jvQghfn/wWzutByILw1KVxQCaI2vkC9kIdklR+Q7OSENIshP8pv8JbfeL+Pd+fdT1qnvOeZz+Q/eY9Pt1muGg=</latexit><latexit sha1_base64="QL8ADr7tR9srk1Wpqs+QUhsdqR4=">ACPXicbZBLSywxEIXT6r1q672OunQTHAQ3Nulh8AEKghuXCo4K08OQztSMwXS6SarFoek/5sb/4M6dGxeKuHVr5oH4OhD4OFVFqk6cKWmRsXtvYnLqz9/pmVl/bv7f/4XK4tKpTXMjoCFSlZrzmFtQUkMDJSo4zwzwJFZwFl8eDOpnV2CsTPUJ9jNoJbynZVcKjs5qV06u2kWEcI2F4rqX8x6U5Z4fxdCTusgSjkZelz4LajuMRpG/wYJ6PRwQC7Y2ayPY2a75EejOR3+7UmUBG4r+hHAMVTLWUbtyF3VSkSegUShubTNkGbYKblAKBaUf5RYyLi7dek2HmidgW8Xw+pKuOadDu6lxTyMdup8nCp5Y209i1+n2u7DfawPzt1ozx+52q5A6yxG0GH3UzRXFlA6ipB1pQKDqO+DCSLcrFRfcIEucN+FEH4/+Sec1oKQBeExq+7vjuOYIStklayTkGyRfXJIjkiDCHJDHsgTefZuvUfvxXsdtU545l8kXe2zuyz6sd</latexit>

f : V → Rd

<latexit sha1_base64="v4LU3fRnmQUVApJPSDxJstTKueI=">ACBnicbVDLSsNAFJ3UV62vqEsRBovgqiQiK6KblxWsQ9oYplMJu3QyUyYmSgldOXGX3HjQhG3foM7/8ZJm4W2HrhwOde7r0nSBhV2nG+rdLC4tLySnm1sra+sblb+0lEglJk0smJCdACnCKCdNTUjnUQSFAeMtIPhZe6374lUVPBbPUqIH6M+pxHFSBupZ+9H57AFPUn7A42kFA/Qi5EeBEF2M74Le3bVqTkTwHniFqQKCjR69pcXCpzGhGvMkFJd10m0nyGpKWZkXPFSRKEh6hPuoZyFBPlZ5M3xvDQKCGMhDTFNZyovycyFCs1igPTmd+oZr1c/M/rpjo68zPKk1QTjqeLopRBLWCeCQypJFizkSEIS2puhXiAJMLaJFcxIbizL8+T1nHNdWru9Um1flHEUQZ74AcARecgjq4Ag3QBg8gmfwCt6sJ+vFerc+pq0lq5jZBX9gf4AqouYng=</latexit><latexit sha1_base64="v4LU3fRnmQUVApJPSDxJstTKueI=">ACBnicbVDLSsNAFJ3UV62vqEsRBovgqiQiK6KblxWsQ9oYplMJu3QyUyYmSgldOXGX3HjQhG3foM7/8ZJm4W2HrhwOde7r0nSBhV2nG+rdLC4tLySnm1sra+sblb+0lEglJk0smJCdACnCKCdNTUjnUQSFAeMtIPhZe6374lUVPBbPUqIH6M+pxHFSBupZ+9H57AFPUn7A42kFA/Qi5EeBEF2M74Le3bVqTkTwHniFqQKCjR69pcXCpzGhGvMkFJd10m0nyGpKWZkXPFSRKEh6hPuoZyFBPlZ5M3xvDQKCGMhDTFNZyovycyFCs1igPTmd+oZr1c/M/rpjo68zPKk1QTjqeLopRBLWCeCQypJFizkSEIS2puhXiAJMLaJFcxIbizL8+T1nHNdWru9Um1flHEUQZ74AcARecgjq4Ag3QBg8gmfwCt6sJ+vFerc+pq0lq5jZBX9gf4AqouYng=</latexit><latexit sha1_base64="v4LU3fRnmQUVApJPSDxJstTKueI=">ACBnicbVDLSsNAFJ3UV62vqEsRBovgqiQiK6KblxWsQ9oYplMJu3QyUyYmSgldOXGX3HjQhG3foM7/8ZJm4W2HrhwOde7r0nSBhV2nG+rdLC4tLySnm1sra+sblb+0lEglJk0smJCdACnCKCdNTUjnUQSFAeMtIPhZe6374lUVPBbPUqIH6M+pxHFSBupZ+9H57AFPUn7A42kFA/Qi5EeBEF2M74Le3bVqTkTwHniFqQKCjR69pcXCpzGhGvMkFJd10m0nyGpKWZkXPFSRKEh6hPuoZyFBPlZ5M3xvDQKCGMhDTFNZyovycyFCs1igPTmd+oZr1c/M/rpjo68zPKk1QTjqeLopRBLWCeCQypJFizkSEIS2puhXiAJMLaJFcxIbizL8+T1nHNdWru9Um1flHEUQZ74AcARecgjq4Ag3QBg8gmfwCt6sJ+vFerc+pq0lq5jZBX9gf4AqouYng=</latexit><latexit sha1_base64="v4LU3fRnmQUVApJPSDxJstTKueI=">ACBnicbVDLSsNAFJ3UV62vqEsRBovgqiQiK6KblxWsQ9oYplMJu3QyUyYmSgldOXGX3HjQhG3foM7/8ZJm4W2HrhwOde7r0nSBhV2nG+rdLC4tLySnm1sra+sblb+0lEglJk0smJCdACnCKCdNTUjnUQSFAeMtIPhZe6374lUVPBbPUqIH6M+pxHFSBupZ+9H57AFPUn7A42kFA/Qi5EeBEF2M74Le3bVqTkTwHniFqQKCjR69pcXCpzGhGvMkFJd10m0nyGpKWZkXPFSRKEh6hPuoZyFBPlZ5M3xvDQKCGMhDTFNZyovycyFCs1igPTmd+oZr1c/M/rpjo68zPKk1QTjqeLopRBLWCeCQypJFizkSEIS2puhXiAJMLaJFcxIbizL8+T1nHNdWru9Um1flHEUQZ74AcARecgjq4Ag3QBg8gmfwCt6sJ+vFerc+pq0lq5jZBX9gf4AqouYng=</latexit>
slide-15
SLIDE 15

Word2vec

word = “sweden”

slide-16
SLIDE 16

Word2vec

Continuous Bag of Words (CBOW)

Skip-grams

slide-17
SLIDE 17

Skip-gram

  • The idea: we want to use words to predict their context words
  • Context: a fixed window of size 2m
slide-18
SLIDE 18

Skip-gram

slide-19
SLIDE 19

Skip-gram: objective function

  • For each position

, predict context words within context size m, given center word :

t = 1,2,…T wj

L(θ) =

T

Y

t=1

Y

mjm,j6=0

P(wt+j | wt; θ)

<latexit sha1_base64="eVY3k2h9oFNi4RVzO+rdBhuftGs=">ACTXicbZFNaxsxEIa1zreTJm567EXUBySmt1SaCEQnrpoQcX4iTgdRatdhwrkbRbaTbBLPsHcyn01n/RSw4tJURr7yFfA0KP3pnRx6s4k8Ki7/2GnPzC4tLyvN1bVX6xut15vHNs0Nhz5PZWpOY2ZBCg19FCjhNDPAVCzhJL78UuVPrsBYkeojnGQwVOxci5HgDJ0UtZJQMRxzJotvZSfEMSDbpvs0zEyaRAXuB+VZcVTWy/eKhJ+0IvZpHYr0o78suh1rl39zkVJQyUSeh3hHq3K6NW2+/606DPIaihTeroRa1fYZLyXIFGLpm1g8DPcFgwg4JLKJthbiFj/JKdw8ChZgrsJi6UdItpyR0lBo3NKp+rCjYMraiYpdZfV2+zRXiS/lBjmOPg8LobMcQfPZQaNcUkxpZS1NhAGOcuKAcSPcXSkfM8M4ug9oOhOCp09+DscfuoHfDb5/bB8c1nYsk7fkHemQgHwiB+Qr6ZE+4eSG/CF/yT/vp3fr/fuZqUNr+5Qx5FY+keuray+g=</latexit><latexit sha1_base64="eVY3k2h9oFNi4RVzO+rdBhuftGs=">ACTXicbZFNaxsxEIa1zreTJm567EXUBySmt1SaCEQnrpoQcX4iTgdRatdhwrkbRbaTbBLPsHcyn01n/RSw4tJURr7yFfA0KP3pnRx6s4k8Ki7/2GnPzC4tLyvN1bVX6xut15vHNs0Nhz5PZWpOY2ZBCg19FCjhNDPAVCzhJL78UuVPrsBYkeojnGQwVOxci5HgDJ0UtZJQMRxzJotvZSfEMSDbpvs0zEyaRAXuB+VZcVTWy/eKhJ+0IvZpHYr0o78suh1rl39zkVJQyUSeh3hHq3K6NW2+/606DPIaihTeroRa1fYZLyXIFGLpm1g8DPcFgwg4JLKJthbiFj/JKdw8ChZgrsJi6UdItpyR0lBo3NKp+rCjYMraiYpdZfV2+zRXiS/lBjmOPg8LobMcQfPZQaNcUkxpZS1NhAGOcuKAcSPcXSkfM8M4ug9oOhOCp09+DscfuoHfDb5/bB8c1nYsk7fkHemQgHwiB+Qr6ZE+4eSG/CF/yT/vp3fr/fuZqUNr+5Qx5FY+keuray+g=</latexit><latexit sha1_base64="eVY3k2h9oFNi4RVzO+rdBhuftGs=">ACTXicbZFNaxsxEIa1zreTJm567EXUBySmt1SaCEQnrpoQcX4iTgdRatdhwrkbRbaTbBLPsHcyn01n/RSw4tJURr7yFfA0KP3pnRx6s4k8Ki7/2GnPzC4tLyvN1bVX6xut15vHNs0Nhz5PZWpOY2ZBCg19FCjhNDPAVCzhJL78UuVPrsBYkeojnGQwVOxci5HgDJ0UtZJQMRxzJotvZSfEMSDbpvs0zEyaRAXuB+VZcVTWy/eKhJ+0IvZpHYr0o78suh1rl39zkVJQyUSeh3hHq3K6NW2+/606DPIaihTeroRa1fYZLyXIFGLpm1g8DPcFgwg4JLKJthbiFj/JKdw8ChZgrsJi6UdItpyR0lBo3NKp+rCjYMraiYpdZfV2+zRXiS/lBjmOPg8LobMcQfPZQaNcUkxpZS1NhAGOcuKAcSPcXSkfM8M4ug9oOhOCp09+DscfuoHfDb5/bB8c1nYsk7fkHemQgHwiB+Qr6ZE+4eSG/CF/yT/vp3fr/fuZqUNr+5Qx5FY+keuray+g=</latexit><latexit sha1_base64="eVY3k2h9oFNi4RVzO+rdBhuftGs=">ACTXicbZFNaxsxEIa1zreTJm567EXUBySmt1SaCEQnrpoQcX4iTgdRatdhwrkbRbaTbBLPsHcyn01n/RSw4tJURr7yFfA0KP3pnRx6s4k8Ki7/2GnPzC4tLyvN1bVX6xut15vHNs0Nhz5PZWpOY2ZBCg19FCjhNDPAVCzhJL78UuVPrsBYkeojnGQwVOxci5HgDJ0UtZJQMRxzJotvZSfEMSDbpvs0zEyaRAXuB+VZcVTWy/eKhJ+0IvZpHYr0o78suh1rl39zkVJQyUSeh3hHq3K6NW2+/606DPIaihTeroRa1fYZLyXIFGLpm1g8DPcFgwg4JLKJthbiFj/JKdw8ChZgrsJi6UdItpyR0lBo3NKp+rCjYMraiYpdZfV2+zRXiS/lBjmOPg8LobMcQfPZQaNcUkxpZS1NhAGOcuKAcSPcXSkfM8M4ug9oOhOCp09+DscfuoHfDb5/bB8c1nYsk7fkHemQgHwiB+Qr6ZE+4eSG/CF/yT/vp3fr/fuZqUNr+5Qx5FY+keuray+g=</latexit>

all the parameters to be optimized

  • The objective function

is the (average) negative log likelihood:

J(θ)

J(θ) = − 1 T log L(θ) = − 1 T

T

X

t=1

X

mjm,j6=0

log P(wt+j | wt; θ)

<latexit sha1_base64="23utKwn7ZJE6urpMOKPMcw5eqOk=">ACe3icdVFdaxQxFM2MWuq7aqPgQXca3tMiOKghSKvoj4sEK3Lexsl0z2zm7aJDMmd5Ql5E/403zn/gimNkdRFu9EHLuV/JuXklhcUk+R7FV65e27i+eaNz89btre3unbtHtqwNhxEvZWlOcmZBCg0jFCjhpDLAVC7hOD9/28SP4OxotSHuKxgothci0JwhoGadr+72e4AGRP6D7dywrDuEu9O/SZLOc0UwXnEn3wf8vzdZq6nA/9ae/vT1FMwmf6Nn6UrsN0gEl3q3aDvtfQs3TMx8GiBltHP+atgP8tNtLBsnK6GWQtqBHWhtOu9+yWclrBRq5ZNaO06TCiWMGBZfgO1ltoWL8nM1hHKBmCuzErbTz9FgZrQoTga6Yr9s8IxZe1S5SGzEcNejDXkv2LjGotXEyd0VSNovh5U1JiSZtF0JkwFEuA2DciPBWyhcsCIthXZ0gQnrxy5fB0bNBmgzSj897B29aOTbJfKQ9ElKXpID8o4MyYhw8iN6ED2O+tHPuBfvxLvr1Dhqa+6Rvyx+8QtA7b8+</latexit><latexit sha1_base64="23utKwn7ZJE6urpMOKPMcw5eqOk=">ACe3icdVFdaxQxFM2MWuq7aqPgQXca3tMiOKghSKvoj4sEK3Lexsl0z2zm7aJDMmd5Ql5E/403zn/gimNkdRFu9EHLuV/JuXklhcUk+R7FV65e27i+eaNz89btre3unbtHtqwNhxEvZWlOcmZBCg0jFCjhpDLAVC7hOD9/28SP4OxotSHuKxgothci0JwhoGadr+72e4AGRP6D7dywrDuEu9O/SZLOc0UwXnEn3wf8vzdZq6nA/9ae/vT1FMwmf6Nn6UrsN0gEl3q3aDvtfQs3TMx8GiBltHP+atgP8tNtLBsnK6GWQtqBHWhtOu9+yWclrBRq5ZNaO06TCiWMGBZfgO1ltoWL8nM1hHKBmCuzErbTz9FgZrQoTga6Yr9s8IxZe1S5SGzEcNejDXkv2LjGotXEyd0VSNovh5U1JiSZtF0JkwFEuA2DciPBWyhcsCIthXZ0gQnrxy5fB0bNBmgzSj897B29aOTbJfKQ9ElKXpID8o4MyYhw8iN6ED2O+tHPuBfvxLvr1Dhqa+6Rvyx+8QtA7b8+</latexit><latexit sha1_base64="23utKwn7ZJE6urpMOKPMcw5eqOk=">ACe3icdVFdaxQxFM2MWuq7aqPgQXca3tMiOKghSKvoj4sEK3Lexsl0z2zm7aJDMmd5Ql5E/403zn/gimNkdRFu9EHLuV/JuXklhcUk+R7FV65e27i+eaNz89btre3unbtHtqwNhxEvZWlOcmZBCg0jFCjhpDLAVC7hOD9/28SP4OxotSHuKxgothci0JwhoGadr+72e4AGRP6D7dywrDuEu9O/SZLOc0UwXnEn3wf8vzdZq6nA/9ae/vT1FMwmf6Nn6UrsN0gEl3q3aDvtfQs3TMx8GiBltHP+atgP8tNtLBsnK6GWQtqBHWhtOu9+yWclrBRq5ZNaO06TCiWMGBZfgO1ltoWL8nM1hHKBmCuzErbTz9FgZrQoTga6Yr9s8IxZe1S5SGzEcNejDXkv2LjGotXEyd0VSNovh5U1JiSZtF0JkwFEuA2DciPBWyhcsCIthXZ0gQnrxy5fB0bNBmgzSj897B29aOTbJfKQ9ElKXpID8o4MyYhw8iN6ED2O+tHPuBfvxLvr1Dhqa+6Rvyx+8QtA7b8+</latexit><latexit sha1_base64="23utKwn7ZJE6urpMOKPMcw5eqOk=">ACe3icdVFdaxQxFM2MWuq7aqPgQXca3tMiOKghSKvoj4sEK3Lexsl0z2zm7aJDMmd5Ql5E/403zn/gimNkdRFu9EHLuV/JuXklhcUk+R7FV65e27i+eaNz89btre3unbtHtqwNhxEvZWlOcmZBCg0jFCjhpDLAVC7hOD9/28SP4OxotSHuKxgothci0JwhoGadr+72e4AGRP6D7dywrDuEu9O/SZLOc0UwXnEn3wf8vzdZq6nA/9ae/vT1FMwmf6Nn6UrsN0gEl3q3aDvtfQs3TMx8GiBltHP+atgP8tNtLBsnK6GWQtqBHWhtOu9+yWclrBRq5ZNaO06TCiWMGBZfgO1ltoWL8nM1hHKBmCuzErbTz9FgZrQoTga6Yr9s8IxZe1S5SGzEcNejDXkv2LjGotXEyd0VSNovh5U1JiSZtF0JkwFEuA2DciPBWyhcsCIthXZ0gQnrxy5fB0bNBmgzSj897B29aOTbJfKQ9ElKXpID8o4MyYhw8iN6ED2O+tHPuBfvxLvr1Dhqa+6Rvyx+8QtA7b8+</latexit>
slide-20
SLIDE 20

How to define ?

P(wt+j ∣ wt; θ)

  • We have two sets of vectors for each word in the vocabulary

ui ∈ Rd

<latexit sha1_base64="Qsgo7bHXmdt/5AiowyqkJ/9E+0=">ACBnicbVBNS8NAEJ3Ur1q/oh5FWCyCp5KIoMeiF49V7Ae0MWy2m3bpZhN2N0IJOXnxr3jxoIhXf4M3/42btgdtfTDweG+GmXlBwpnSjvNtlZaWV1bXyuVjc2t7R17d6+l4lQS2iQxj2UnwIpyJmhTM81pJ5EURwGn7WB0VfjtByoVi8WdHifUi/BAsJARrI3k24e9COthEGZp7jPUYwJNhSC7ze/7vl1as4EaJG4M1KFGRq+/dXrxySNqNCEY6W6rpNoL8NSM8JpXumliaYjPCAdg0VOKLKyZv5OjYKH0UxtKU0Gi/p7IcKTUOApMZ3GjmvcK8T+vm+rwsuYSFJNBZkuClOdIyKTFCfSUo0HxuCiWTmVkSGWGKiTXIVE4I7/IiaZ3WXKfm3pxV65ezOMpwAEdwAi6cQx2uoQFNIPAIz/AKb9aT9WK9Wx/T1pI1m9mHP7A+fwB1FZkZ</latexit><latexit sha1_base64="Qsgo7bHXmdt/5AiowyqkJ/9E+0=">ACBnicbVBNS8NAEJ3Ur1q/oh5FWCyCp5KIoMeiF49V7Ae0MWy2m3bpZhN2N0IJOXnxr3jxoIhXf4M3/42btgdtfTDweG+GmXlBwpnSjvNtlZaWV1bXyuVjc2t7R17d6+l4lQS2iQxj2UnwIpyJmhTM81pJ5EURwGn7WB0VfjtByoVi8WdHifUi/BAsJARrI3k24e9COthEGZp7jPUYwJNhSC7ze/7vl1as4EaJG4M1KFGRq+/dXrxySNqNCEY6W6rpNoL8NSM8JpXumliaYjPCAdg0VOKLKyZv5OjYKH0UxtKU0Gi/p7IcKTUOApMZ3GjmvcK8T+vm+rwsuYSFJNBZkuClOdIyKTFCfSUo0HxuCiWTmVkSGWGKiTXIVE4I7/IiaZ3WXKfm3pxV65ezOMpwAEdwAi6cQx2uoQFNIPAIz/AKb9aT9WK9Wx/T1pI1m9mHP7A+fwB1FZkZ</latexit><latexit sha1_base64="Qsgo7bHXmdt/5AiowyqkJ/9E+0=">ACBnicbVBNS8NAEJ3Ur1q/oh5FWCyCp5KIoMeiF49V7Ae0MWy2m3bpZhN2N0IJOXnxr3jxoIhXf4M3/42btgdtfTDweG+GmXlBwpnSjvNtlZaWV1bXyuVjc2t7R17d6+l4lQS2iQxj2UnwIpyJmhTM81pJ5EURwGn7WB0VfjtByoVi8WdHifUi/BAsJARrI3k24e9COthEGZp7jPUYwJNhSC7ze/7vl1as4EaJG4M1KFGRq+/dXrxySNqNCEY6W6rpNoL8NSM8JpXumliaYjPCAdg0VOKLKyZv5OjYKH0UxtKU0Gi/p7IcKTUOApMZ3GjmvcK8T+vm+rwsuYSFJNBZkuClOdIyKTFCfSUo0HxuCiWTmVkSGWGKiTXIVE4I7/IiaZ3WXKfm3pxV65ezOMpwAEdwAi6cQx2uoQFNIPAIz/AKb9aT9WK9Wx/T1pI1m9mHP7A+fwB1FZkZ</latexit><latexit sha1_base64="Qsgo7bHXmdt/5AiowyqkJ/9E+0=">ACBnicbVBNS8NAEJ3Ur1q/oh5FWCyCp5KIoMeiF49V7Ae0MWy2m3bpZhN2N0IJOXnxr3jxoIhXf4M3/42btgdtfTDweG+GmXlBwpnSjvNtlZaWV1bXyuVjc2t7R17d6+l4lQS2iQxj2UnwIpyJmhTM81pJ5EURwGn7WB0VfjtByoVi8WdHifUi/BAsJARrI3k24e9COthEGZp7jPUYwJNhSC7ze/7vl1as4EaJG4M1KFGRq+/dXrxySNqNCEY6W6rpNoL8NSM8JpXumliaYjPCAdg0VOKLKyZv5OjYKH0UxtKU0Gi/p7IcKTUOApMZ3GjmvcK8T+vm+rwsuYSFJNBZkuClOdIyKTFCfSUo0HxuCiWTmVkSGWGKiTXIVE4I7/IiaZ3WXKfm3pxV65ezOMpwAEdwAi6cQx2uoQFNIPAIz/AKb9aT9WK9Wx/T1pI1m9mHP7A+fwB1FZkZ</latexit>

: embedding for target word i

: embedding for context word i’

Q: Why two sets of vectors?

vi0 ∈ Rd

<latexit sha1_base64="jlnCkKyjEgmzyrWVfCH8VFvPB4=">ACXicbVBNS8NAEJ34WetX1KOXxSJ6KokIeix68VjFfkAby2a7aZduNmF3UyghVy/+FS8eFPHqP/Dmv3HT5qCtDwYe780wM8+POVPacb6tpeWV1bX10kZ5c2t7Z9fe2+qKJGENkjEI9n2saKcCdrQTHPajiXFoc9pyx9d535rTKVikbjXk5h6IR4IFjCtZF6NuqGWA/9IB1nvZSdZKjLRKH56V320O/ZFafqTIEWiVuQChSo9+yvbj8iSUiFJhwr1XGdWHsplpoRTrNyN1E0xmSEB7RjqMAhV46/SRDx0bpoyCSpoRGU/X3RIpDpSahbzrzG9W8l4v/eZ1EB5deykScaCrIbFGQcKQjlMeC+kxSovnEwkM7ciMsQSE23CK5sQ3PmXF0nzrOo6Vf2vFK7KuIowSEcwSm4cAE1uIE6NIDAIzDK7xZT9aL9W59zFqXrGLmAP7A+vwBvMiaVw=</latexit><latexit sha1_base64="jlnCkKyjEgmzyrWVfCH8VFvPB4=">ACXicbVBNS8NAEJ34WetX1KOXxSJ6KokIeix68VjFfkAby2a7aZduNmF3UyghVy/+FS8eFPHqP/Dmv3HT5qCtDwYe780wM8+POVPacb6tpeWV1bX10kZ5c2t7Z9fe2+qKJGENkjEI9n2saKcCdrQTHPajiXFoc9pyx9d535rTKVikbjXk5h6IR4IFjCtZF6NuqGWA/9IB1nvZSdZKjLRKH56V320O/ZFafqTIEWiVuQChSo9+yvbj8iSUiFJhwr1XGdWHsplpoRTrNyN1E0xmSEB7RjqMAhV46/SRDx0bpoyCSpoRGU/X3RIpDpSahbzrzG9W8l4v/eZ1EB5deykScaCrIbFGQcKQjlMeC+kxSovnEwkM7ciMsQSE23CK5sQ3PmXF0nzrOo6Vf2vFK7KuIowSEcwSm4cAE1uIE6NIDAIzDK7xZT9aL9W59zFqXrGLmAP7A+vwBvMiaVw=</latexit><latexit sha1_base64="jlnCkKyjEgmzyrWVfCH8VFvPB4=">ACXicbVBNS8NAEJ34WetX1KOXxSJ6KokIeix68VjFfkAby2a7aZduNmF3UyghVy/+FS8eFPHqP/Dmv3HT5qCtDwYe780wM8+POVPacb6tpeWV1bX10kZ5c2t7Z9fe2+qKJGENkjEI9n2saKcCdrQTHPajiXFoc9pyx9d535rTKVikbjXk5h6IR4IFjCtZF6NuqGWA/9IB1nvZSdZKjLRKH56V320O/ZFafqTIEWiVuQChSo9+yvbj8iSUiFJhwr1XGdWHsplpoRTrNyN1E0xmSEB7RjqMAhV46/SRDx0bpoyCSpoRGU/X3RIpDpSahbzrzG9W8l4v/eZ1EB5deykScaCrIbFGQcKQjlMeC+kxSovnEwkM7ciMsQSE23CK5sQ3PmXF0nzrOo6Vf2vFK7KuIowSEcwSm4cAE1uIE6NIDAIzDK7xZT9aL9W59zFqXrGLmAP7A+vwBvMiaVw=</latexit><latexit sha1_base64="jlnCkKyjEgmzyrWVfCH8VFvPB4=">ACXicbVBNS8NAEJ34WetX1KOXxSJ6KokIeix68VjFfkAby2a7aZduNmF3UyghVy/+FS8eFPHqP/Dmv3HT5qCtDwYe780wM8+POVPacb6tpeWV1bX10kZ5c2t7Z9fe2+qKJGENkjEI9n2saKcCdrQTHPajiXFoc9pyx9d535rTKVikbjXk5h6IR4IFjCtZF6NuqGWA/9IB1nvZSdZKjLRKH56V320O/ZFafqTIEWiVuQChSo9+yvbj8iSUiFJhwr1XGdWHsplpoRTrNyN1E0xmSEB7RjqMAhV46/SRDx0bpoyCSpoRGU/X3RIpDpSahbzrzG9W8l4v/eZ1EB5deykScaCrIbFGQcKQjlMeC+kxSovnEwkM7ciMsQSE23CK5sQ3PmXF0nzrOo6Vf2vFK7KuIowSEcwSm4cAE1uIE6NIDAIzDK7xZT9aL9W59zFqXrGLmAP7A+vwBvMiaVw=</latexit>
  • Use inner product to measure how likely word i

appears with context word i’, the larger the better

“softmax” we learned last time!

ui · vi0

<latexit sha1_base64="RzTZ0bVG1tX3m7GXesoGab/HjRI=">AC3icbVBNS8NAEN3Ur1q/oh69LC2ip5KIoMeiF48VbCu0IWw2m3bpZjfsbgol5O7Fv+LFgyJe/QPe/Ddu2gja+mDg8d4M/OChFGlHefLqysrq1vVDdrW9s7u3v2/kFXiVRi0sGCXkfIEUY5aSjqWbkPpExQEjvWB8Xfi9CZGKCn6npwnxYjTkNKIYaSP5dn0QIz0KoizNfQoHOBQa/kiT3M/oSe7bDafpzACXiVuSBijR9u3PQShwGhOuMUNK9V0n0V6GpKaYkbw2SBVJEB6jIekbylFMlJfNfsnhsVFCGAlpims4U39PZChWahoHprM4Uy16hfif1091dOlCepJhzPF0Upg1rAIhgYUkmwZlNDEJbU3ArxCEmEtYmvZkJwF19eJt2zpus03dvzRuqjKMKjkAdnAIXIAWuAFt0AEYPIAn8AJerUfr2Xqz3uetFaucOQR/YH18A6ZPm2s=</latexit><latexit sha1_base64="RzTZ0bVG1tX3m7GXesoGab/HjRI=">AC3icbVBNS8NAEN3Ur1q/oh69LC2ip5KIoMeiF48VbCu0IWw2m3bpZjfsbgol5O7Fv+LFgyJe/QPe/Ddu2gja+mDg8d4M/OChFGlHefLqysrq1vVDdrW9s7u3v2/kFXiVRi0sGCXkfIEUY5aSjqWbkPpExQEjvWB8Xfi9CZGKCn6npwnxYjTkNKIYaSP5dn0QIz0KoizNfQoHOBQa/kiT3M/oSe7bDafpzACXiVuSBijR9u3PQShwGhOuMUNK9V0n0V6GpKaYkbw2SBVJEB6jIekbylFMlJfNfsnhsVFCGAlpims4U39PZChWahoHprM4Uy16hfif1091dOlCepJhzPF0Upg1rAIhgYUkmwZlNDEJbU3ArxCEmEtYmvZkJwF19eJt2zpus03dvzRuqjKMKjkAdnAIXIAWuAFt0AEYPIAn8AJerUfr2Xqz3uetFaucOQR/YH18A6ZPm2s=</latexit><latexit sha1_base64="RzTZ0bVG1tX3m7GXesoGab/HjRI=">AC3icbVBNS8NAEN3Ur1q/oh69LC2ip5KIoMeiF48VbCu0IWw2m3bpZjfsbgol5O7Fv+LFgyJe/QPe/Ddu2gja+mDg8d4M/OChFGlHefLqysrq1vVDdrW9s7u3v2/kFXiVRi0sGCXkfIEUY5aSjqWbkPpExQEjvWB8Xfi9CZGKCn6npwnxYjTkNKIYaSP5dn0QIz0KoizNfQoHOBQa/kiT3M/oSe7bDafpzACXiVuSBijR9u3PQShwGhOuMUNK9V0n0V6GpKaYkbw2SBVJEB6jIekbylFMlJfNfsnhsVFCGAlpims4U39PZChWahoHprM4Uy16hfif1091dOlCepJhzPF0Upg1rAIhgYUkmwZlNDEJbU3ArxCEmEtYmvZkJwF19eJt2zpus03dvzRuqjKMKjkAdnAIXIAWuAFt0AEYPIAn8AJerUfr2Xqz3uetFaucOQR/YH18A6ZPm2s=</latexit><latexit sha1_base64="RzTZ0bVG1tX3m7GXesoGab/HjRI=">AC3icbVBNS8NAEN3Ur1q/oh69LC2ip5KIoMeiF48VbCu0IWw2m3bpZjfsbgol5O7Fv+LFgyJe/QPe/Ddu2gja+mDg8d4M/OChFGlHefLqysrq1vVDdrW9s7u3v2/kFXiVRi0sGCXkfIEUY5aSjqWbkPpExQEjvWB8Xfi9CZGKCn6npwnxYjTkNKIYaSP5dn0QIz0KoizNfQoHOBQa/kiT3M/oSe7bDafpzACXiVuSBijR9u3PQShwGhOuMUNK9V0n0V6GpKaYkbw2SBVJEB6jIekbylFMlJfNfsnhsVFCGAlpims4U39PZChWahoHprM4Uy16hfif1091dOlCepJhzPF0Upg1rAIhgYUkmwZlNDEJbU3ArxCEmEtYmvZkJwF19eJt2zpus03dvzRuqjKMKjkAdnAIXIAWuAFt0AEYPIAn8AJerUfr2Xqz3uetFaucOQR/YH18A6ZPm2s=</latexit>

P(wt+j | wt) = exp(uwt · vwt+j) P

k∈V exp(uwt · vk)

<latexit sha1_base64="YxU1x4J5AlDT3J/Dp+p53Qpgi+U=">ACcnicjVFda9swFJXdbc3SfaQdfdlg0xYGCRvBHoPupRC6lz1msKSFOBhZlstkmyk67ZB6Af07/Wtv6Iv+wFVPA+6dg87IDic+/V1VFWCW4giq6CcOPBw0ebncfdrSdPnz3vbe/MTFlryqa0FKU+yohgis2BQ6CHVWaEZkJdpgtv679w1OmDS/VD1hVbCHJseIFpwS8lPYuJoOz1MKHnw4nkuf4LIUh3sdJoQm1CTuvBokcJIVtnap9a6vo3kJ+I982sjNBDd0NjG1TO0SJ1zhmfvfCcuhc2mvH42iBvg+iVvSRy0mae8yUtaS6aACmLMPI4qWFigVPBXDepDasIXZJjNvdUEcnMwjaROfzeKzkuSu2PAtyotzskcasZOYr12uau95a/Jc3r6H4srBcVTUwRX9fVNQCQ4nX+eOca0ZBrDwhVHO/K6YnxKcN/pe6PoT47pPvk9mnURyN4u+f+ODNo4OeoXeoQGK0R4ao29ogqaIoutgN3gdvAl+hS/Dt2GbXRi0PS/QXwg/3gCAcL/l</latexit><latexit sha1_base64="YxU1x4J5AlDT3J/Dp+p53Qpgi+U=">ACcnicjVFda9swFJXdbc3SfaQdfdlg0xYGCRvBHoPupRC6lz1msKSFOBhZlstkmyk67ZB6Af07/Wtv6Iv+wFVPA+6dg87IDic+/V1VFWCW4giq6CcOPBw0ebncfdrSdPnz3vbe/MTFlryqa0FKU+yohgis2BQ6CHVWaEZkJdpgtv679w1OmDS/VD1hVbCHJseIFpwS8lPYuJoOz1MKHnw4nkuf4LIUh3sdJoQm1CTuvBokcJIVtnap9a6vo3kJ+I982sjNBDd0NjG1TO0SJ1zhmfvfCcuhc2mvH42iBvg+iVvSRy0mae8yUtaS6aACmLMPI4qWFigVPBXDepDasIXZJjNvdUEcnMwjaROfzeKzkuSu2PAtyotzskcasZOYr12uau95a/Jc3r6H4srBcVTUwRX9fVNQCQ4nX+eOca0ZBrDwhVHO/K6YnxKcN/pe6PoT47pPvk9mnURyN4u+f+ODNo4OeoXeoQGK0R4ao29ogqaIoutgN3gdvAl+hS/Dt2GbXRi0PS/QXwg/3gCAcL/l</latexit><latexit sha1_base64="YxU1x4J5AlDT3J/Dp+p53Qpgi+U=">ACcnicjVFda9swFJXdbc3SfaQdfdlg0xYGCRvBHoPupRC6lz1msKSFOBhZlstkmyk67ZB6Af07/Wtv6Iv+wFVPA+6dg87IDic+/V1VFWCW4giq6CcOPBw0ebncfdrSdPnz3vbe/MTFlryqa0FKU+yohgis2BQ6CHVWaEZkJdpgtv679w1OmDS/VD1hVbCHJseIFpwS8lPYuJoOz1MKHnw4nkuf4LIUh3sdJoQm1CTuvBokcJIVtnap9a6vo3kJ+I982sjNBDd0NjG1TO0SJ1zhmfvfCcuhc2mvH42iBvg+iVvSRy0mae8yUtaS6aACmLMPI4qWFigVPBXDepDasIXZJjNvdUEcnMwjaROfzeKzkuSu2PAtyotzskcasZOYr12uau95a/Jc3r6H4srBcVTUwRX9fVNQCQ4nX+eOca0ZBrDwhVHO/K6YnxKcN/pe6PoT47pPvk9mnURyN4u+f+ODNo4OeoXeoQGK0R4ao29ogqaIoutgN3gdvAl+hS/Dt2GbXRi0PS/QXwg/3gCAcL/l</latexit><latexit sha1_base64="hP+6LrUf2d3tZaldqaQvEKMXyw=">AB2XicbZDNSgMxFIXv1L86Vq1rN8EiuCozbnQpuHFZwbZCO5RM5k4bmskMyR2hDH0BF25EfC93vo3pz0JbDwQ+zknIvSculLQUBN9ebWd3b/+gfugfNfzjk9Nmo2fz0gjsilzl5jnmFpXU2CVJCp8LgzyLFfbj6f0i7+gsTLXTzQrMr4WMtUCk7O6oyaraAdLMW2IVxDC9YaNb+GS7KDUJxa0dhEFBUcUNSaFw7g9LiwUXUz7GgUPNM7RtRxzi6dk7A0N+5oYkv394uKZ9bOstjdzDhN7Ga2MP/LBiWlt1EldVESarH6KC0Vo5wtdmaJNChIzRxwYaSblYkJN1yQa8Z3HYSbG29D7odBu3wMYA6nMFXEIN3AHD9CBLghI4BXevYn35n2suqp569LO4I+8zx84xIo4</latexit><latexit sha1_base64="e+z+9fOroxs43UnXStWJ/Sb1g8o=">ACZ3icjVFdSxwxFM1MrdXV2rXgiwUbuxR2EZaZvtQXQfDFxy10V2FnGTKZjMZNMkNyR1CfoB/zd/hS/+ALPjCvXjoQcCh3Nu7r05ySrBDUTRXRB+WPq4/GltbW2/njS3tzfWTKWlM2pKUo9WlGDBNcsSFwEOy0ozITLCTbHo0908umTa8VH9hVrGJGeKF5wS8FLavhl0r1ILexcOJ5Ln+CqFHj7ASaEJtQm7rqJHCeFbZ2qfWur6N5CfhZvmzkpoPrOZuYWqZ2ihOu8Mj9b4dpz7m03Yn6UQP8lsQL0kELDNL2bZKXtJZMARXEmHEcVTCxRAOngrlWUhtWETolZ2zsqSKSmYltInP4p1dyXJTaHwW4Uf+9Yk0ZiYzXzlf07z25uJ73riGYn9iuapqYIo+DSpqgaHE8/xzjWjIGaeEKq53xXTc+LTBv9LR9C/PrJb8noVz+O+vGfCK2gb+gH6qIY/UaH6BgN0BRdB9sBTvB9+Ah3A53n+IKg0VuX9ELhJ1HMpa/Bw=</latexit><latexit sha1_base64="e+z+9fOroxs43UnXStWJ/Sb1g8o=">ACZ3icjVFdSxwxFM1MrdXV2rXgiwUbuxR2EZaZvtQXQfDFxy10V2FnGTKZjMZNMkNyR1CfoB/zd/hS/+ALPjCvXjoQcCh3Nu7r05ySrBDUTRXRB+WPq4/GltbW2/njS3tzfWTKWlM2pKUo9WlGDBNcsSFwEOy0ozITLCTbHo0908umTa8VH9hVrGJGeKF5wS8FLavhl0r1ILexcOJ5Ln+CqFHj7ASaEJtQm7rqJHCeFbZ2qfWur6N5CfhZvmzkpoPrOZuYWqZ2ihOu8Mj9b4dpz7m03Yn6UQP8lsQL0kELDNL2bZKXtJZMARXEmHEcVTCxRAOngrlWUhtWETolZ2zsqSKSmYltInP4p1dyXJTaHwW4Uf+9Yk0ZiYzXzlf07z25uJ73riGYn9iuapqYIo+DSpqgaHE8/xzjWjIGaeEKq53xXTc+LTBv9LR9C/PrJb8noVz+O+vGfCK2gb+gH6qIY/UaH6BgN0BRdB9sBTvB9+Ah3A53n+IKg0VuX9ELhJ1HMpa/Bw=</latexit><latexit sha1_base64="JwMv/sOfzaSR0cvFGLscrahs/c0=">ACcnicjVHRTtswFHXCxlg3oGPihUmbtwqpFVOV7AVeJiH2sdOWgtSU0WO4Cp7UT2DVBZ/gB+jze+ghc+YG7IpA32sCNZOjrn3uvr46wS3EAU3QbhyrPnqy/WXnZevV7f2Oy+2ZqYstaUjWkpSn2SEcMEV2wMHAQ7qTQjMhPsOJt/W/rHF0wbXqfsKjYTJTxQtOCXgp7V6P+pephb1zhxPJc3yZwgB/xUmhCbUJu6r6iSRwlhW2dqn1rq+jeQn4t3zRyM0EN3A2MbVM7RwnXOGJ+98J84FzabcXDaMG+CmJW9JDLUZp9ybJS1pLpoAKYsw0jiqYWaKBU8FcJ6kNqwidk1M29VQRyczMNpE5vOuVHBel9kcBbtQ/OyRxixk5iuXa5rH3lL8lzetoTiYWa6qGpiDxcVtcBQ4mX+OeaURALTwjV3O+K6RnxaYP/pY4PIX785Kdk8mUYR8P4R9Q7PGrjWEPv0CfURzHaR4foOxqhMaLoLtgO3gcfgvtwJ/wYtmFQdvzFv2F8PMvfzC/4Q=</latexit><latexit sha1_base64="YxU1x4J5AlDT3J/Dp+p53Qpgi+U=">ACcnicjVFda9swFJXdbc3SfaQdfdlg0xYGCRvBHoPupRC6lz1msKSFOBhZlstkmyk67ZB6Af07/Wtv6Iv+wFVPA+6dg87IDic+/V1VFWCW4giq6CcOPBw0ebncfdrSdPnz3vbe/MTFlryqa0FKU+yohgis2BQ6CHVWaEZkJdpgtv679w1OmDS/VD1hVbCHJseIFpwS8lPYuJoOz1MKHnw4nkuf4LIUh3sdJoQm1CTuvBokcJIVtnap9a6vo3kJ+I982sjNBDd0NjG1TO0SJ1zhmfvfCcuhc2mvH42iBvg+iVvSRy0mae8yUtaS6aACmLMPI4qWFigVPBXDepDasIXZJjNvdUEcnMwjaROfzeKzkuSu2PAtyotzskcasZOYr12uau95a/Jc3r6H4srBcVTUwRX9fVNQCQ4nX+eOca0ZBrDwhVHO/K6YnxKcN/pe6PoT47pPvk9mnURyN4u+f+ODNo4OeoXeoQGK0R4ao29ogqaIoutgN3gdvAl+hS/Dt2GbXRi0PS/QXwg/3gCAcL/l</latexit><latexit sha1_base64="YxU1x4J5AlDT3J/Dp+p53Qpgi+U=">ACcnicjVFda9swFJXdbc3SfaQdfdlg0xYGCRvBHoPupRC6lz1msKSFOBhZlstkmyk67ZB6Af07/Wtv6Iv+wFVPA+6dg87IDic+/V1VFWCW4giq6CcOPBw0ebncfdrSdPnz3vbe/MTFlryqa0FKU+yohgis2BQ6CHVWaEZkJdpgtv679w1OmDS/VD1hVbCHJseIFpwS8lPYuJoOz1MKHnw4nkuf4LIUh3sdJoQm1CTuvBokcJIVtnap9a6vo3kJ+I982sjNBDd0NjG1TO0SJ1zhmfvfCcuhc2mvH42iBvg+iVvSRy0mae8yUtaS6aACmLMPI4qWFigVPBXDepDasIXZJjNvdUEcnMwjaROfzeKzkuSu2PAtyotzskcasZOYr12uau95a/Jc3r6H4srBcVTUwRX9fVNQCQ4nX+eOca0ZBrDwhVHO/K6YnxKcN/pe6PoT47pPvk9mnURyN4u+f+ODNo4OeoXeoQGK0R4ao29ogqaIoutgN3gdvAl+hS/Dt2GbXRi0PS/QXwg/3gCAcL/l</latexit><latexit sha1_base64="YxU1x4J5AlDT3J/Dp+p53Qpgi+U=">ACcnicjVFda9swFJXdbc3SfaQdfdlg0xYGCRvBHoPupRC6lz1msKSFOBhZlstkmyk67ZB6Af07/Wtv6Iv+wFVPA+6dg87IDic+/V1VFWCW4giq6CcOPBw0ebncfdrSdPnz3vbe/MTFlryqa0FKU+yohgis2BQ6CHVWaEZkJdpgtv679w1OmDS/VD1hVbCHJseIFpwS8lPYuJoOz1MKHnw4nkuf4LIUh3sdJoQm1CTuvBokcJIVtnap9a6vo3kJ+I982sjNBDd0NjG1TO0SJ1zhmfvfCcuhc2mvH42iBvg+iVvSRy0mae8yUtaS6aACmLMPI4qWFigVPBXDepDasIXZJjNvdUEcnMwjaROfzeKzkuSu2PAtyotzskcasZOYr12uau95a/Jc3r6H4srBcVTUwRX9fVNQCQ4nX+eOca0ZBrDwhVHO/K6YnxKcN/pe6PoT47pPvk9mnURyN4u+f+ODNo4OeoXeoQGK0R4ao29ogqaIoutgN3gdvAl+hS/Dt2GbXRi0PS/QXwg/3gCAcL/l</latexit><latexit sha1_base64="YxU1x4J5AlDT3J/Dp+p53Qpgi+U=">ACcnicjVFda9swFJXdbc3SfaQdfdlg0xYGCRvBHoPupRC6lz1msKSFOBhZlstkmyk67ZB6Af07/Wtv6Iv+wFVPA+6dg87IDic+/V1VFWCW4giq6CcOPBw0ebncfdrSdPnz3vbe/MTFlryqa0FKU+yohgis2BQ6CHVWaEZkJdpgtv679w1OmDS/VD1hVbCHJseIFpwS8lPYuJoOz1MKHnw4nkuf4LIUh3sdJoQm1CTuvBokcJIVtnap9a6vo3kJ+I982sjNBDd0NjG1TO0SJ1zhmfvfCcuhc2mvH42iBvg+iVvSRy0mae8yUtaS6aACmLMPI4qWFigVPBXDepDasIXZJjNvdUEcnMwjaROfzeKzkuSu2PAtyotzskcasZOYr12uau95a/Jc3r6H4srBcVTUwRX9fVNQCQ4nX+eOca0ZBrDwhVHO/K6YnxKcN/pe6PoT47pPvk9mnURyN4u+f+ODNo4OeoXeoQGK0R4ao29ogqaIoutgN3gdvAl+hS/Dt2GbXRi0PS/QXwg/3gCAcL/l</latexit><latexit sha1_base64="YxU1x4J5AlDT3J/Dp+p53Qpgi+U=">ACcnicjVFda9swFJXdbc3SfaQdfdlg0xYGCRvBHoPupRC6lz1msKSFOBhZlstkmyk67ZB6Af07/Wtv6Iv+wFVPA+6dg87IDic+/V1VFWCW4giq6CcOPBw0ebncfdrSdPnz3vbe/MTFlryqa0FKU+yohgis2BQ6CHVWaEZkJdpgtv679w1OmDS/VD1hVbCHJseIFpwS8lPYuJoOz1MKHnw4nkuf4LIUh3sdJoQm1CTuvBokcJIVtnap9a6vo3kJ+I982sjNBDd0NjG1TO0SJ1zhmfvfCcuhc2mvH42iBvg+iVvSRy0mae8yUtaS6aACmLMPI4qWFigVPBXDepDasIXZJjNvdUEcnMwjaROfzeKzkuSu2PAtyotzskcasZOYr12uau95a/Jc3r6H4srBcVTUwRX9fVNQCQ4nX+eOca0ZBrDwhVHO/K6YnxKcN/pe6PoT47pPvk9mnURyN4u+f+ODNo4OeoXeoQGK0R4ao29ogqaIoutgN3gdvAl+hS/Dt2GbXRi0PS/QXwg/3gCAcL/l</latexit><latexit sha1_base64="YxU1x4J5AlDT3J/Dp+p53Qpgi+U=">ACcnicjVFda9swFJXdbc3SfaQdfdlg0xYGCRvBHoPupRC6lz1msKSFOBhZlstkmyk67ZB6Af07/Wtv6Iv+wFVPA+6dg87IDic+/V1VFWCW4giq6CcOPBw0ebncfdrSdPnz3vbe/MTFlryqa0FKU+yohgis2BQ6CHVWaEZkJdpgtv679w1OmDS/VD1hVbCHJseIFpwS8lPYuJoOz1MKHnw4nkuf4LIUh3sdJoQm1CTuvBokcJIVtnap9a6vo3kJ+I982sjNBDd0NjG1TO0SJ1zhmfvfCcuhc2mvH42iBvg+iVvSRy0mae8yUtaS6aACmLMPI4qWFigVPBXDepDasIXZJjNvdUEcnMwjaROfzeKzkuSu2PAtyotzskcasZOYr12uau95a/Jc3r6H4srBcVTUwRX9fVNQCQ4nX+eOca0ZBrDwhVHO/K6YnxKcN/pe6PoT47pPvk9mnURyN4u+f+ODNo4OeoXeoQGK0R4ao29ogqaIoutgN3gdvAl+hS/Dt2GbXRi0PS/QXwg/3gCAcL/l</latexit>

are all the parameters in this model!

θ = {{uk}, {vk}}

<latexit sha1_base64="uE6wEg+cbVDNn7T6D276YV5+N9k=">ACGHicbVDLSsNAFJ3UV62vqks3g0VwITURQTdC0Y3LCvYBTQiT6aQdOnkwc1MoIZ/hxl9x40IRt935N07TgNp6hoHDOfdy7z1eLgC0/wySiura+sb5c3K1vbO7l51/6CtokRS1qKRiGTXI4oJHrIWcBCsG0tGAk+wje6m/mdMZOKR+EjTGLmBGQcp9TAlpyq+c2DBkQfIPtVL+AwNDz0yRzR3Z2hn+Uca7YmVutmXUzB14mVkFqEDTrU7tfkSTgIVABVGqZ5kxOCmRwKlgWcVOFIsJHZEB62kakoApJ80Py/CJVvrYj6T+IeBc/d2RkCpSeDpytmeatGbif95vQT8ayflYZwAC+l8kJ8IDBGepYT7XDIKYqIJoZLrXTEdEko6CwrOgRr8eRl0r6oW2bderisNW6LOMroCB2jU2ShK9RA96iJWoiJ/SC3tC78Wy8Gh/G57y0ZBQ9h+gPjOk3ue6g1w=</latexit><latexit sha1_base64="uE6wEg+cbVDNn7T6D276YV5+N9k=">ACGHicbVDLSsNAFJ3UV62vqks3g0VwITURQTdC0Y3LCvYBTQiT6aQdOnkwc1MoIZ/hxl9x40IRt935N07TgNp6hoHDOfdy7z1eLgC0/wySiura+sb5c3K1vbO7l51/6CtokRS1qKRiGTXI4oJHrIWcBCsG0tGAk+wje6m/mdMZOKR+EjTGLmBGQcp9TAlpyq+c2DBkQfIPtVL+AwNDz0yRzR3Z2hn+Uca7YmVutmXUzB14mVkFqEDTrU7tfkSTgIVABVGqZ5kxOCmRwKlgWcVOFIsJHZEB62kakoApJ80Py/CJVvrYj6T+IeBc/d2RkCpSeDpytmeatGbif95vQT8ayflYZwAC+l8kJ8IDBGepYT7XDIKYqIJoZLrXTEdEko6CwrOgRr8eRl0r6oW2bderisNW6LOMroCB2jU2ShK9RA96iJWoiJ/SC3tC78Wy8Gh/G57y0ZBQ9h+gPjOk3ue6g1w=</latexit><latexit sha1_base64="uE6wEg+cbVDNn7T6D276YV5+N9k=">ACGHicbVDLSsNAFJ3UV62vqks3g0VwITURQTdC0Y3LCvYBTQiT6aQdOnkwc1MoIZ/hxl9x40IRt935N07TgNp6hoHDOfdy7z1eLgC0/wySiura+sb5c3K1vbO7l51/6CtokRS1qKRiGTXI4oJHrIWcBCsG0tGAk+wje6m/mdMZOKR+EjTGLmBGQcp9TAlpyq+c2DBkQfIPtVL+AwNDz0yRzR3Z2hn+Uca7YmVutmXUzB14mVkFqEDTrU7tfkSTgIVABVGqZ5kxOCmRwKlgWcVOFIsJHZEB62kakoApJ80Py/CJVvrYj6T+IeBc/d2RkCpSeDpytmeatGbif95vQT8ayflYZwAC+l8kJ8IDBGepYT7XDIKYqIJoZLrXTEdEko6CwrOgRr8eRl0r6oW2bderisNW6LOMroCB2jU2ShK9RA96iJWoiJ/SC3tC78Wy8Gh/G57y0ZBQ9h+gPjOk3ue6g1w=</latexit><latexit sha1_base64="uE6wEg+cbVDNn7T6D276YV5+N9k=">ACGHicbVDLSsNAFJ3UV62vqks3g0VwITURQTdC0Y3LCvYBTQiT6aQdOnkwc1MoIZ/hxl9x40IRt935N07TgNp6hoHDOfdy7z1eLgC0/wySiura+sb5c3K1vbO7l51/6CtokRS1qKRiGTXI4oJHrIWcBCsG0tGAk+wje6m/mdMZOKR+EjTGLmBGQcp9TAlpyq+c2DBkQfIPtVL+AwNDz0yRzR3Z2hn+Uca7YmVutmXUzB14mVkFqEDTrU7tfkSTgIVABVGqZ5kxOCmRwKlgWcVOFIsJHZEB62kakoApJ80Py/CJVvrYj6T+IeBc/d2RkCpSeDpytmeatGbif95vQT8ayflYZwAC+l8kJ8IDBGepYT7XDIKYqIJoZLrXTEdEko6CwrOgRr8eRl0r6oW2bderisNW6LOMroCB2jU2ShK9RA96iJWoiJ/SC3tC78Wy8Gh/G57y0ZBQ9h+gPjOk3ue6g1w=</latexit>

Any issues?

slide-21
SLIDE 21

How to train the model

Calculating all the gradients together! Q: How many parameters are in total?

) = − 1 T

T

X

t=1

X

mjm,j6=0

log P(wt+j | wt; θ)

<latexit sha1_base64="23utKwn7ZJE6urpMOKPMcw5eqOk=">ACe3icdVFdaxQxFM2MWuq7aqPgQXca3tMiOKghSKvoj4sEK3Lexsl0z2zm7aJDMmd5Ql5E/403zn/gimNkdRFu9EHLuV/JuXklhcUk+R7FV65e27i+eaNz89btre3unbtHtqwNhxEvZWlOcmZBCg0jFCjhpDLAVC7hOD9/28SP4OxotSHuKxgothci0JwhoGadr+72e4AGRP6D7dywrDuEu9O/SZLOc0UwXnEn3wf8vzdZq6nA/9ae/vT1FMwmf6Nn6UrsN0gEl3q3aDvtfQs3TMx8GiBltHP+atgP8tNtLBsnK6GWQtqBHWhtOu9+yWclrBRq5ZNaO06TCiWMGBZfgO1ltoWL8nM1hHKBmCuzErbTz9FgZrQoTga6Yr9s8IxZe1S5SGzEcNejDXkv2LjGotXEyd0VSNovh5U1JiSZtF0JkwFEuA2DciPBWyhcsCIthXZ0gQnrxy5fB0bNBmgzSj897B29aOTbJfKQ9ElKXpID8o4MyYhw8iN6ED2O+tHPuBfvxLvr1Dhqa+6Rvyx+8QtA7b8+</latexit><latexit sha1_base64="23utKwn7ZJE6urpMOKPMcw5eqOk=">ACe3icdVFdaxQxFM2MWuq7aqPgQXca3tMiOKghSKvoj4sEK3Lexsl0z2zm7aJDMmd5Ql5E/403zn/gimNkdRFu9EHLuV/JuXklhcUk+R7FV65e27i+eaNz89btre3unbtHtqwNhxEvZWlOcmZBCg0jFCjhpDLAVC7hOD9/28SP4OxotSHuKxgothci0JwhoGadr+72e4AGRP6D7dywrDuEu9O/SZLOc0UwXnEn3wf8vzdZq6nA/9ae/vT1FMwmf6Nn6UrsN0gEl3q3aDvtfQs3TMx8GiBltHP+atgP8tNtLBsnK6GWQtqBHWhtOu9+yWclrBRq5ZNaO06TCiWMGBZfgO1ltoWL8nM1hHKBmCuzErbTz9FgZrQoTga6Yr9s8IxZe1S5SGzEcNejDXkv2LjGotXEyd0VSNovh5U1JiSZtF0JkwFEuA2DciPBWyhcsCIthXZ0gQnrxy5fB0bNBmgzSj897B29aOTbJfKQ9ElKXpID8o4MyYhw8iN6ED2O+tHPuBfvxLvr1Dhqa+6Rvyx+8QtA7b8+</latexit><latexit sha1_base64="23utKwn7ZJE6urpMOKPMcw5eqOk=">ACe3icdVFdaxQxFM2MWuq7aqPgQXca3tMiOKghSKvoj4sEK3Lexsl0z2zm7aJDMmd5Ql5E/403zn/gimNkdRFu9EHLuV/JuXklhcUk+R7FV65e27i+eaNz89btre3unbtHtqwNhxEvZWlOcmZBCg0jFCjhpDLAVC7hOD9/28SP4OxotSHuKxgothci0JwhoGadr+72e4AGRP6D7dywrDuEu9O/SZLOc0UwXnEn3wf8vzdZq6nA/9ae/vT1FMwmf6Nn6UrsN0gEl3q3aDvtfQs3TMx8GiBltHP+atgP8tNtLBsnK6GWQtqBHWhtOu9+yWclrBRq5ZNaO06TCiWMGBZfgO1ltoWL8nM1hHKBmCuzErbTz9FgZrQoTga6Yr9s8IxZe1S5SGzEcNejDXkv2LjGotXEyd0VSNovh5U1JiSZtF0JkwFEuA2DciPBWyhcsCIthXZ0gQnrxy5fB0bNBmgzSj897B29aOTbJfKQ9ElKXpID8o4MyYhw8iN6ED2O+tHPuBfvxLvr1Dhqa+6Rvyx+8QtA7b8+</latexit><latexit sha1_base64="23utKwn7ZJE6urpMOKPMcw5eqOk=">ACe3icdVFdaxQxFM2MWuq7aqPgQXca3tMiOKghSKvoj4sEK3Lexsl0z2zm7aJDMmd5Ql5E/403zn/gimNkdRFu9EHLuV/JuXklhcUk+R7FV65e27i+eaNz89btre3unbtHtqwNhxEvZWlOcmZBCg0jFCjhpDLAVC7hOD9/28SP4OxotSHuKxgothci0JwhoGadr+72e4AGRP6D7dywrDuEu9O/SZLOc0UwXnEn3wf8vzdZq6nA/9ae/vT1FMwmf6Nn6UrsN0gEl3q3aDvtfQs3TMx8GiBltHP+atgP8tNtLBsnK6GWQtqBHWhtOu9+yWclrBRq5ZNaO06TCiWMGBZfgO1ltoWL8nM1hHKBmCuzErbTz9FgZrQoTga6Yr9s8IxZe1S5SGzEcNejDXkv2LjGotXEyd0VSNovh5U1JiSZtF0JkwFEuA2DciPBWyhcsCIthXZ0gQnrxy5fB0bNBmgzSj897B29aOTbJfKQ9ElKXpID8o4MyYhw8iN6ED2O+tHPuBfvxLvr1Dhqa+6Rvyx+8QtA7b8+</latexit>

J(θ) = −

<latexit sha1_base64="23utKwn7ZJE6urpMOKPMcw5eqOk=">ACe3icdVFdaxQxFM2MWuq7aqPgQXca3tMiOKghSKvoj4sEK3Lexsl0z2zm7aJDMmd5Ql5E/403zn/gimNkdRFu9EHLuV/JuXklhcUk+R7FV65e27i+eaNz89btre3unbtHtqwNhxEvZWlOcmZBCg0jFCjhpDLAVC7hOD9/28SP4OxotSHuKxgothci0JwhoGadr+72e4AGRP6D7dywrDuEu9O/SZLOc0UwXnEn3wf8vzdZq6nA/9ae/vT1FMwmf6Nn6UrsN0gEl3q3aDvtfQs3TMx8GiBltHP+atgP8tNtLBsnK6GWQtqBHWhtOu9+yWclrBRq5ZNaO06TCiWMGBZfgO1ltoWL8nM1hHKBmCuzErbTz9FgZrQoTga6Yr9s8IxZe1S5SGzEcNejDXkv2LjGotXEyd0VSNovh5U1JiSZtF0JkwFEuA2DciPBWyhcsCIthXZ0gQnrxy5fB0bNBmgzSj897B29aOTbJfKQ9ElKXpID8o4MyYhw8iN6ED2O+tHPuBfvxLvr1Dhqa+6Rvyx+8QtA7b8+</latexit><latexit sha1_base64="23utKwn7ZJE6urpMOKPMcw5eqOk=">ACe3icdVFdaxQxFM2MWuq7aqPgQXca3tMiOKghSKvoj4sEK3Lexsl0z2zm7aJDMmd5Ql5E/403zn/gimNkdRFu9EHLuV/JuXklhcUk+R7FV65e27i+eaNz89btre3unbtHtqwNhxEvZWlOcmZBCg0jFCjhpDLAVC7hOD9/28SP4OxotSHuKxgothci0JwhoGadr+72e4AGRP6D7dywrDuEu9O/SZLOc0UwXnEn3wf8vzdZq6nA/9ae/vT1FMwmf6Nn6UrsN0gEl3q3aDvtfQs3TMx8GiBltHP+atgP8tNtLBsnK6GWQtqBHWhtOu9+yWclrBRq5ZNaO06TCiWMGBZfgO1ltoWL8nM1hHKBmCuzErbTz9FgZrQoTga6Yr9s8IxZe1S5SGzEcNejDXkv2LjGotXEyd0VSNovh5U1JiSZtF0JkwFEuA2DciPBWyhcsCIthXZ0gQnrxy5fB0bNBmgzSj897B29aOTbJfKQ9ElKXpID8o4MyYhw8iN6ED2O+tHPuBfvxLvr1Dhqa+6Rvyx+8QtA7b8+</latexit><latexit sha1_base64="23utKwn7ZJE6urpMOKPMcw5eqOk=">ACe3icdVFdaxQxFM2MWuq7aqPgQXca3tMiOKghSKvoj4sEK3Lexsl0z2zm7aJDMmd5Ql5E/403zn/gimNkdRFu9EHLuV/JuXklhcUk+R7FV65e27i+eaNz89btre3unbtHtqwNhxEvZWlOcmZBCg0jFCjhpDLAVC7hOD9/28SP4OxotSHuKxgothci0JwhoGadr+72e4AGRP6D7dywrDuEu9O/SZLOc0UwXnEn3wf8vzdZq6nA/9ae/vT1FMwmf6Nn6UrsN0gEl3q3aDvtfQs3TMx8GiBltHP+atgP8tNtLBsnK6GWQtqBHWhtOu9+yWclrBRq5ZNaO06TCiWMGBZfgO1ltoWL8nM1hHKBmCuzErbTz9FgZrQoTga6Yr9s8IxZe1S5SGzEcNejDXkv2LjGotXEyd0VSNovh5U1JiSZtF0JkwFEuA2DciPBWyhcsCIthXZ0gQnrxy5fB0bNBmgzSj897B29aOTbJfKQ9ElKXpID8o4MyYhw8iN6ED2O+tHPuBfvxLvr1Dhqa+6Rvyx+8QtA7b8+</latexit><latexit sha1_base64="23utKwn7ZJE6urpMOKPMcw5eqOk=">ACe3icdVFdaxQxFM2MWuq7aqPgQXca3tMiOKghSKvoj4sEK3Lexsl0z2zm7aJDMmd5Ql5E/403zn/gimNkdRFu9EHLuV/JuXklhcUk+R7FV65e27i+eaNz89btre3unbtHtqwNhxEvZWlOcmZBCg0jFCjhpDLAVC7hOD9/28SP4OxotSHuKxgothci0JwhoGadr+72e4AGRP6D7dywrDuEu9O/SZLOc0UwXnEn3wf8vzdZq6nA/9ae/vT1FMwmf6Nn6UrsN0gEl3q3aDvtfQs3TMx8GiBltHP+atgP8tNtLBsnK6GWQtqBHWhtOu9+yWclrBRq5ZNaO06TCiWMGBZfgO1ltoWL8nM1hHKBmCuzErbTz9FgZrQoTga6Yr9s8IxZe1S5SGzEcNejDXkv2LjGotXEyd0VSNovh5U1JiSZtF0JkwFEuA2DciPBWyhcsCIthXZ0gQnrxy5fB0bNBmgzSj897B29aOTbJfKQ9ElKXpID8o4MyYhw8iN6ED2O+tHPuBfvxLvr1Dhqa+6Rvyx+8QtA7b8+</latexit>

rθJ(θ) =?

<latexit sha1_base64="oFtCj5NE4VIa6vcNKQly3hbvtM=">ACA3icbZDLSsNAFIYnXmu9Vd3pJliEuimJCLoRi27EVQV7gSaUk+mkHTqZhJkToYSCG1/FjQtF3PoS7nwbp5eFtv4w8PGfczhz/iARXKPjfFsLi0vLK6u5tfz6xubWdmFnt67jVFWo7GIVTMAzQSXrIYcBWsmikEUCNYI+tejeuOBKc1jeY+DhPkRdCUPOQU0Vruw70kIBLQzD3sMYXhbmsDxW7UHTKzlj2PLhTKJKpqu3Cl9eJaRoxiVSA1i3XSdDPQCGng3zXqpZArQPXdYyKCFi2s/GNwztI+N07DBW5km0x+7viQwirQdRYDojwJ6erY3M/2qtFMNzP+MySZFJOlkUpsLG2B4FYne4YhTFwABQxc1fbdoDBRNbHkTgjt78jzUT8quU3bvTouVq2kcOXJADkmJuOSMVMgNqZIaoeSRPJNX8mY9WS/Wu/UxaV2wpjN75I+szx9vapdb</latexit><latexit sha1_base64="oFtCj5NE4VIa6vcNKQly3hbvtM=">ACA3icbZDLSsNAFIYnXmu9Vd3pJliEuimJCLoRi27EVQV7gSaUk+mkHTqZhJkToYSCG1/FjQtF3PoS7nwbp5eFtv4w8PGfczhz/iARXKPjfFsLi0vLK6u5tfz6xubWdmFnt67jVFWo7GIVTMAzQSXrIYcBWsmikEUCNYI+tejeuOBKc1jeY+DhPkRdCUPOQU0Vruw70kIBLQzD3sMYXhbmsDxW7UHTKzlj2PLhTKJKpqu3Cl9eJaRoxiVSA1i3XSdDPQCGng3zXqpZArQPXdYyKCFi2s/GNwztI+N07DBW5km0x+7viQwirQdRYDojwJ6erY3M/2qtFMNzP+MySZFJOlkUpsLG2B4FYne4YhTFwABQxc1fbdoDBRNbHkTgjt78jzUT8quU3bvTouVq2kcOXJADkmJuOSMVMgNqZIaoeSRPJNX8mY9WS/Wu/UxaV2wpjN75I+szx9vapdb</latexit><latexit sha1_base64="oFtCj5NE4VIa6vcNKQly3hbvtM=">ACA3icbZDLSsNAFIYnXmu9Vd3pJliEuimJCLoRi27EVQV7gSaUk+mkHTqZhJkToYSCG1/FjQtF3PoS7nwbp5eFtv4w8PGfczhz/iARXKPjfFsLi0vLK6u5tfz6xubWdmFnt67jVFWo7GIVTMAzQSXrIYcBWsmikEUCNYI+tejeuOBKc1jeY+DhPkRdCUPOQU0Vruw70kIBLQzD3sMYXhbmsDxW7UHTKzlj2PLhTKJKpqu3Cl9eJaRoxiVSA1i3XSdDPQCGng3zXqpZArQPXdYyKCFi2s/GNwztI+N07DBW5km0x+7viQwirQdRYDojwJ6erY3M/2qtFMNzP+MySZFJOlkUpsLG2B4FYne4YhTFwABQxc1fbdoDBRNbHkTgjt78jzUT8quU3bvTouVq2kcOXJADkmJuOSMVMgNqZIaoeSRPJNX8mY9WS/Wu/UxaV2wpjN75I+szx9vapdb</latexit><latexit sha1_base64="oFtCj5NE4VIa6vcNKQly3hbvtM=">ACA3icbZDLSsNAFIYnXmu9Vd3pJliEuimJCLoRi27EVQV7gSaUk+mkHTqZhJkToYSCG1/FjQtF3PoS7nwbp5eFtv4w8PGfczhz/iARXKPjfFsLi0vLK6u5tfz6xubWdmFnt67jVFWo7GIVTMAzQSXrIYcBWsmikEUCNYI+tejeuOBKc1jeY+DhPkRdCUPOQU0Vruw70kIBLQzD3sMYXhbmsDxW7UHTKzlj2PLhTKJKpqu3Cl9eJaRoxiVSA1i3XSdDPQCGng3zXqpZArQPXdYyKCFi2s/GNwztI+N07DBW5km0x+7viQwirQdRYDojwJ6erY3M/2qtFMNzP+MySZFJOlkUpsLG2B4FYne4YhTFwABQxc1fbdoDBRNbHkTgjt78jzUT8quU3bvTouVq2kcOXJADkmJuOSMVMgNqZIaoeSRPJNX8mY9WS/Wu/UxaV2wpjN75I+szx9vapdb</latexit>

We can apply stochastic gradient descent (SGD)!

θ(t+1) = θ(t) ηrθJ(θ)

<latexit sha1_base64="2xbrEJR+XVhUcysjVyGPSHic0HY=">ACJnicbVDLSgNBEJz1GeMr6tHLYBAiYtgVQS+C6EU8RTBRyK5L72RihszOLjO9QljyNV78FS8eIiLe/BQnD/BZMFBd1U1PV5RKYdB1352p6ZnZufnCQnFxaXltbS23jBJphmvs0Qm+iYCw6VQvI4CJb9JNYc4kvw6p4N/et7ro1I1BX2Uh7EcKdEWzBAK4WlYx87HOE2r+Cut9Onx/RLsOUe9W1BfQWRhDAfe/2LypjshKWyW3VHoH+JNyFlMkEtLA38VsKymCtkEoxpem6KQ4aBZO8X/Qzw1NgXbjTUsVxNwE+ejMPt2Sou2E2fQjpSv0/kEBvTiyPbGQN2zG9vKP7nNTNsHwW5UGmGXLHxonYmKSZ0mBltCc0Zyp4lwLSwf6WsAxoY2mSLNgTv98l/SWO/6rlV7/KgfHI6iaNANskWqRCPHJITck5qpE4YeSBPZEBenEfn2Xl13satU85kZoP8gPxCRNypFM=</latexit><latexit sha1_base64="2xbrEJR+XVhUcysjVyGPSHic0HY=">ACJnicbVDLSgNBEJz1GeMr6tHLYBAiYtgVQS+C6EU8RTBRyK5L72RihszOLjO9QljyNV78FS8eIiLe/BQnD/BZMFBd1U1PV5RKYdB1352p6ZnZufnCQnFxaXltbS23jBJphmvs0Qm+iYCw6VQvI4CJb9JNYc4kvw6p4N/et7ro1I1BX2Uh7EcKdEWzBAK4WlYx87HOE2r+Cut9Onx/RLsOUe9W1BfQWRhDAfe/2LypjshKWyW3VHoH+JNyFlMkEtLA38VsKymCtkEoxpem6KQ4aBZO8X/Qzw1NgXbjTUsVxNwE+ejMPt2Sou2E2fQjpSv0/kEBvTiyPbGQN2zG9vKP7nNTNsHwW5UGmGXLHxonYmKSZ0mBltCc0Zyp4lwLSwf6WsAxoY2mSLNgTv98l/SWO/6rlV7/KgfHI6iaNANskWqRCPHJITck5qpE4YeSBPZEBenEfn2Xl13satU85kZoP8gPxCRNypFM=</latexit><latexit sha1_base64="2xbrEJR+XVhUcysjVyGPSHic0HY=">ACJnicbVDLSgNBEJz1GeMr6tHLYBAiYtgVQS+C6EU8RTBRyK5L72RihszOLjO9QljyNV78FS8eIiLe/BQnD/BZMFBd1U1PV5RKYdB1352p6ZnZufnCQnFxaXltbS23jBJphmvs0Qm+iYCw6VQvI4CJb9JNYc4kvw6p4N/et7ro1I1BX2Uh7EcKdEWzBAK4WlYx87HOE2r+Cut9Onx/RLsOUe9W1BfQWRhDAfe/2LypjshKWyW3VHoH+JNyFlMkEtLA38VsKymCtkEoxpem6KQ4aBZO8X/Qzw1NgXbjTUsVxNwE+ejMPt2Sou2E2fQjpSv0/kEBvTiyPbGQN2zG9vKP7nNTNsHwW5UGmGXLHxonYmKSZ0mBltCc0Zyp4lwLSwf6WsAxoY2mSLNgTv98l/SWO/6rlV7/KgfHI6iaNANskWqRCPHJITck5qpE4YeSBPZEBenEfn2Xl13satU85kZoP8gPxCRNypFM=</latexit><latexit sha1_base64="2xbrEJR+XVhUcysjVyGPSHic0HY=">ACJnicbVDLSgNBEJz1GeMr6tHLYBAiYtgVQS+C6EU8RTBRyK5L72RihszOLjO9QljyNV78FS8eIiLe/BQnD/BZMFBd1U1PV5RKYdB1352p6ZnZufnCQnFxaXltbS23jBJphmvs0Qm+iYCw6VQvI4CJb9JNYc4kvw6p4N/et7ro1I1BX2Uh7EcKdEWzBAK4WlYx87HOE2r+Cut9Onx/RLsOUe9W1BfQWRhDAfe/2LypjshKWyW3VHoH+JNyFlMkEtLA38VsKymCtkEoxpem6KQ4aBZO8X/Qzw1NgXbjTUsVxNwE+ejMPt2Sou2E2fQjpSv0/kEBvTiyPbGQN2zG9vKP7nNTNsHwW5UGmGXLHxonYmKSZ0mBltCc0Zyp4lwLSwf6WsAxoY2mSLNgTv98l/SWO/6rlV7/KgfHI6iaNANskWqRCPHJITck5qpE4YeSBPZEBenEfn2Xl13satU85kZoP8gPxCRNypFM=</latexit>

θ = {{uk}, {vk}}

<latexit sha1_base64="uE6wEg+cbVDNn7T6D276YV5+N9k=">ACGHicbVDLSsNAFJ3UV62vqks3g0VwITURQTdC0Y3LCvYBTQiT6aQdOnkwc1MoIZ/hxl9x40IRt935N07TgNp6hoHDOfdy7z1eLgC0/wySiura+sb5c3K1vbO7l51/6CtokRS1qKRiGTXI4oJHrIWcBCsG0tGAk+wje6m/mdMZOKR+EjTGLmBGQcp9TAlpyq+c2DBkQfIPtVL+AwNDz0yRzR3Z2hn+Uca7YmVutmXUzB14mVkFqEDTrU7tfkSTgIVABVGqZ5kxOCmRwKlgWcVOFIsJHZEB62kakoApJ80Py/CJVvrYj6T+IeBc/d2RkCpSeDpytmeatGbif95vQT8ayflYZwAC+l8kJ8IDBGepYT7XDIKYqIJoZLrXTEdEko6CwrOgRr8eRl0r6oW2bderisNW6LOMroCB2jU2ShK9RA96iJWoiJ/SC3tC78Wy8Gh/G57y0ZBQ9h+gPjOk3ue6g1w=</latexit><latexit sha1_base64="uE6wEg+cbVDNn7T6D276YV5+N9k=">ACGHicbVDLSsNAFJ3UV62vqks3g0VwITURQTdC0Y3LCvYBTQiT6aQdOnkwc1MoIZ/hxl9x40IRt935N07TgNp6hoHDOfdy7z1eLgC0/wySiura+sb5c3K1vbO7l51/6CtokRS1qKRiGTXI4oJHrIWcBCsG0tGAk+wje6m/mdMZOKR+EjTGLmBGQcp9TAlpyq+c2DBkQfIPtVL+AwNDz0yRzR3Z2hn+Uca7YmVutmXUzB14mVkFqEDTrU7tfkSTgIVABVGqZ5kxOCmRwKlgWcVOFIsJHZEB62kakoApJ80Py/CJVvrYj6T+IeBc/d2RkCpSeDpytmeatGbif95vQT8ayflYZwAC+l8kJ8IDBGepYT7XDIKYqIJoZLrXTEdEko6CwrOgRr8eRl0r6oW2bderisNW6LOMroCB2jU2ShK9RA96iJWoiJ/SC3tC78Wy8Gh/G57y0ZBQ9h+gPjOk3ue6g1w=</latexit><latexit sha1_base64="uE6wEg+cbVDNn7T6D276YV5+N9k=">ACGHicbVDLSsNAFJ3UV62vqks3g0VwITURQTdC0Y3LCvYBTQiT6aQdOnkwc1MoIZ/hxl9x40IRt935N07TgNp6hoHDOfdy7z1eLgC0/wySiura+sb5c3K1vbO7l51/6CtokRS1qKRiGTXI4oJHrIWcBCsG0tGAk+wje6m/mdMZOKR+EjTGLmBGQcp9TAlpyq+c2DBkQfIPtVL+AwNDz0yRzR3Z2hn+Uca7YmVutmXUzB14mVkFqEDTrU7tfkSTgIVABVGqZ5kxOCmRwKlgWcVOFIsJHZEB62kakoApJ80Py/CJVvrYj6T+IeBc/d2RkCpSeDpytmeatGbif95vQT8ayflYZwAC+l8kJ8IDBGepYT7XDIKYqIJoZLrXTEdEko6CwrOgRr8eRl0r6oW2bderisNW6LOMroCB2jU2ShK9RA96iJWoiJ/SC3tC78Wy8Gh/G57y0ZBQ9h+gPjOk3ue6g1w=</latexit><latexit sha1_base64="uE6wEg+cbVDNn7T6D276YV5+N9k=">ACGHicbVDLSsNAFJ3UV62vqks3g0VwITURQTdC0Y3LCvYBTQiT6aQdOnkwc1MoIZ/hxl9x40IRt935N07TgNp6hoHDOfdy7z1eLgC0/wySiura+sb5c3K1vbO7l51/6CtokRS1qKRiGTXI4oJHrIWcBCsG0tGAk+wje6m/mdMZOKR+EjTGLmBGQcp9TAlpyq+c2DBkQfIPtVL+AwNDz0yRzR3Z2hn+Uca7YmVutmXUzB14mVkFqEDTrU7tfkSTgIVABVGqZ5kxOCmRwKlgWcVOFIsJHZEB62kakoApJ80Py/CJVvrYj6T+IeBc/d2RkCpSeDpytmeatGbif95vQT8ayflYZwAC+l8kJ8IDBGepYT7XDIKYqIJoZLrXTEdEko6CwrOgRr8eRl0r6oW2bderisNW6LOMroCB2jU2ShK9RA96iJWoiJ/SC3tC78Wy8Gh/G57y0ZBQ9h+gPjOk3ue6g1w=</latexit>
slide-22
SLIDE 22

Skip-gram with negative sampling (SGNS)

σ(x) = 1 1 + exp(−x)

<latexit sha1_base64="Qv4DTd6P1Pmvw3zC7Y/cLIekIGA=">AC3icbVDLSgMxFM3UV62vUZduQovQIpaJCLoRim5cVrAP6Awlk2ba0GRmSDLSMnTvxl9x40IRt/6AO/GtJ2Fth64cDjnXu69x485U9pxvq3cyura+kZ+s7C1vbO7Z+8fNFWUSEIbJOKRbPtYUc5C2tBMc9qOJcXC57TlD2+mfuBSsWi8F6PY+oJ3A9ZwAjWRuraRVexvsDlUQVeQTeQmKRokiJ4Al06isuno8qka5ecqjMDXCYoIyWQod61v9xeRBJBQ04VqDnFh7KZaEU4nBTdRNMZkiPu0Y2iIBVeOvtlAo+N0oNBJE2FGs7U3xMpFkqNhW86BdYDtehNxf+8TqKDSy9lYZxoGpL5oiDhUEdwGgzsMUmJ5mNDMJHM3ArJAJtAtImvYEJAiy8vk+ZFTlVdHdeql1nceTBESiCMkDgAtTALaiDBiDgETyDV/BmPVkv1rv1MW/NWdnMIfgD6/MH4cOZBg=</latexit><latexit sha1_base64="Qv4DTd6P1Pmvw3zC7Y/cLIekIGA=">AC3icbVDLSgMxFM3UV62vUZduQovQIpaJCLoRim5cVrAP6Awlk2ba0GRmSDLSMnTvxl9x40IRt/6AO/GtJ2Fth64cDjnXu69x485U9pxvq3cyura+kZ+s7C1vbO7Z+8fNFWUSEIbJOKRbPtYUc5C2tBMc9qOJcXC57TlD2+mfuBSsWi8F6PY+oJ3A9ZwAjWRuraRVexvsDlUQVeQTeQmKRokiJ4Al06isuno8qka5ecqjMDXCYoIyWQod61v9xeRBJBQ04VqDnFh7KZaEU4nBTdRNMZkiPu0Y2iIBVeOvtlAo+N0oNBJE2FGs7U3xMpFkqNhW86BdYDtehNxf+8TqKDSy9lYZxoGpL5oiDhUEdwGgzsMUmJ5mNDMJHM3ArJAJtAtImvYEJAiy8vk+ZFTlVdHdeql1nceTBESiCMkDgAtTALaiDBiDgETyDV/BmPVkv1rv1MW/NWdnMIfgD6/MH4cOZBg=</latexit><latexit sha1_base64="Qv4DTd6P1Pmvw3zC7Y/cLIekIGA=">AC3icbVDLSgMxFM3UV62vUZduQovQIpaJCLoRim5cVrAP6Awlk2ba0GRmSDLSMnTvxl9x40IRt/6AO/GtJ2Fth64cDjnXu69x485U9pxvq3cyura+kZ+s7C1vbO7Z+8fNFWUSEIbJOKRbPtYUc5C2tBMc9qOJcXC57TlD2+mfuBSsWi8F6PY+oJ3A9ZwAjWRuraRVexvsDlUQVeQTeQmKRokiJ4Al06isuno8qka5ecqjMDXCYoIyWQod61v9xeRBJBQ04VqDnFh7KZaEU4nBTdRNMZkiPu0Y2iIBVeOvtlAo+N0oNBJE2FGs7U3xMpFkqNhW86BdYDtehNxf+8TqKDSy9lYZxoGpL5oiDhUEdwGgzsMUmJ5mNDMJHM3ArJAJtAtImvYEJAiy8vk+ZFTlVdHdeql1nceTBESiCMkDgAtTALaiDBiDgETyDV/BmPVkv1rv1MW/NWdnMIfgD6/MH4cOZBg=</latexit><latexit sha1_base64="Qv4DTd6P1Pmvw3zC7Y/cLIekIGA=">AC3icbVDLSgMxFM3UV62vUZduQovQIpaJCLoRim5cVrAP6Awlk2ba0GRmSDLSMnTvxl9x40IRt/6AO/GtJ2Fth64cDjnXu69x485U9pxvq3cyura+kZ+s7C1vbO7Z+8fNFWUSEIbJOKRbPtYUc5C2tBMc9qOJcXC57TlD2+mfuBSsWi8F6PY+oJ3A9ZwAjWRuraRVexvsDlUQVeQTeQmKRokiJ4Al06isuno8qka5ecqjMDXCYoIyWQod61v9xeRBJBQ04VqDnFh7KZaEU4nBTdRNMZkiPu0Y2iIBVeOvtlAo+N0oNBJE2FGs7U3xMpFkqNhW86BdYDtehNxf+8TqKDSy9lYZxoGpL5oiDhUEdwGgzsMUmJ5mNDMJHM3ArJAJtAtImvYEJAiy8vk+ZFTlVdHdeql1nceTBESiCMkDgAtTALaiDBiDgETyDV/BmPVkv1rv1MW/NWdnMIfgD6/MH4cOZBg=</latexit>

To compute loss, pick K random words as negative examples:

P(D = 1 | t, c) = σ(ut · vc)

<latexit sha1_base64="+eQ6DdAqXMFHX0OwYlYQ5Tw9T24=">ACJXicbVDLSsNAFJ34tr6iLt0MFqGClEQEXSiIunBZwVahCWEymbRDZ5Iwc1MoT/jxl9x48Iigit/xWmbhVoPzHDmnHuZe0+YCa7BcT6tufmFxaXldXK2vrG5pa9vdPSa4oa9JUpOoxJoJnrAmcBDsMVOMyFCwh7B3PfYf+kxpnib3MiYL0kn4TGnBIwU2OeN2g2+wC72JI8wHGF6aJ6e5h1Jap4k0A3jIh8GgD0apeYupf4woIeBXqzgR4lrglqaISjcAeVFKc8kSoIJo3XadDPyCKOBUsGHFyzXLCO2RDmsbmhDJtF9MthziA6NEOE6VOQngifqzoyBS64EMTeV4SP3XG4v/e0c4jO/4EmWA0vo9KM4FxhSPI4MR1wxCmJgCKGKm1kx7RJFKJhgKyYE9+/Ks6R1XHedunt3Ur28KuNYQXtoH9WQi07RJbpFDdREFD2hF/SGRtaz9Wq9Wx/T0jmr7NlFv2B9fQPKiqMc</latexit><latexit sha1_base64="+eQ6DdAqXMFHX0OwYlYQ5Tw9T24=">ACJXicbVDLSsNAFJ34tr6iLt0MFqGClEQEXSiIunBZwVahCWEymbRDZ5Iwc1MoT/jxl9x48Iigit/xWmbhVoPzHDmnHuZe0+YCa7BcT6tufmFxaXldXK2vrG5pa9vdPSa4oa9JUpOoxJoJnrAmcBDsMVOMyFCwh7B3PfYf+kxpnib3MiYL0kn4TGnBIwU2OeN2g2+wC72JI8wHGF6aJ6e5h1Jap4k0A3jIh8GgD0apeYupf4woIeBXqzgR4lrglqaISjcAeVFKc8kSoIJo3XadDPyCKOBUsGHFyzXLCO2RDmsbmhDJtF9MthziA6NEOE6VOQngifqzoyBS64EMTeV4SP3XG4v/e0c4jO/4EmWA0vo9KM4FxhSPI4MR1wxCmJgCKGKm1kx7RJFKJhgKyYE9+/Ks6R1XHedunt3Ur28KuNYQXtoH9WQi07RJbpFDdREFD2hF/SGRtaz9Wq9Wx/T0jmr7NlFv2B9fQPKiqMc</latexit><latexit sha1_base64="+eQ6DdAqXMFHX0OwYlYQ5Tw9T24=">ACJXicbVDLSsNAFJ34tr6iLt0MFqGClEQEXSiIunBZwVahCWEymbRDZ5Iwc1MoT/jxl9x48Iigit/xWmbhVoPzHDmnHuZe0+YCa7BcT6tufmFxaXldXK2vrG5pa9vdPSa4oa9JUpOoxJoJnrAmcBDsMVOMyFCwh7B3PfYf+kxpnib3MiYL0kn4TGnBIwU2OeN2g2+wC72JI8wHGF6aJ6e5h1Jap4k0A3jIh8GgD0apeYupf4woIeBXqzgR4lrglqaISjcAeVFKc8kSoIJo3XadDPyCKOBUsGHFyzXLCO2RDmsbmhDJtF9MthziA6NEOE6VOQngifqzoyBS64EMTeV4SP3XG4v/e0c4jO/4EmWA0vo9KM4FxhSPI4MR1wxCmJgCKGKm1kx7RJFKJhgKyYE9+/Ks6R1XHedunt3Ur28KuNYQXtoH9WQi07RJbpFDdREFD2hF/SGRtaz9Wq9Wx/T0jmr7NlFv2B9fQPKiqMc</latexit><latexit sha1_base64="+eQ6DdAqXMFHX0OwYlYQ5Tw9T24=">ACJXicbVDLSsNAFJ34tr6iLt0MFqGClEQEXSiIunBZwVahCWEymbRDZ5Iwc1MoT/jxl9x48Iigit/xWmbhVoPzHDmnHuZe0+YCa7BcT6tufmFxaXldXK2vrG5pa9vdPSa4oa9JUpOoxJoJnrAmcBDsMVOMyFCwh7B3PfYf+kxpnib3MiYL0kn4TGnBIwU2OeN2g2+wC72JI8wHGF6aJ6e5h1Jap4k0A3jIh8GgD0apeYupf4woIeBXqzgR4lrglqaISjcAeVFKc8kSoIJo3XadDPyCKOBUsGHFyzXLCO2RDmsbmhDJtF9MthziA6NEOE6VOQngifqzoyBS64EMTeV4SP3XG4v/e0c4jO/4EmWA0vo9KM4FxhSPI4MR1wxCmJgCKGKm1kx7RJFKJhgKyYE9+/Ks6R1XHedunt3Ur28KuNYQXtoH9WQi07RJbpFDdREFD2hF/SGRtaz9Wq9Wx/T0jmr7NlFv2B9fQPKiqMc</latexit>

Idea: recast problem as binary classification!

  • Target word is positive example
  • All words not in context are negative

J(θ) = −P(D = 1 | t, c) − 1 K

K

X

i=1

P(D = 0 | ti, c)

<latexit sha1_base64="5JOEV/IhfbnbaD9GxpGyWmwbF9o=">ACM3icbVDLSsNAFJ34rPVdelmsAgVtCRS0E2hqAtpNxWsCk0Nk+nEDp1JwsyNUEL+yY0/4kIQF4q49R+c1ix8HRg4nHMud+7xY8E12PaTNTU9Mzs3X1goLi4tr6yW1tYvdJQoyjo0EpG68olmgoesAxwEu4oVI9IX7NIfHo/9y1umNI/CcxjFrCfJTcgDTgkYySs1mxUXBgzIDq7jPdyunNQd7Erex7BLd4ziBorQ1MnSVoZdnUgv5XUnu25NonYe9bgJe6WyXbUnwH+Jk5MytH2Sg9uP6KJZCFQbTuOnYMvZQo4FSwrOgmsWEDskN6xoaEsl0L53cnOFto/RxECnzQsAT9ftESqTWI+mbpCQw0L+9sfif10gOylPIwTYCH9WhQkAkOExwXiPleMghgZQqji5q+YDogpCUzNRVOC8/vkv+Riv+rUqrWzWrlxlNdRQJtoC1WQgw5QA52iNuogiu7QI3pBr9a9Wy9We9f0Skrn9lAP2B9fAKhIqYl</latexit>
slide-23
SLIDE 23

Continuous Bag of Words (CBOW)

L(θ) =

T

Y

t=1

P (wt | {wt+j}, m  j  m, j 6= 0)

<latexit sha1_base64="3+l6Abc63xGDhSVFpwKTAlCK8fU=">ACRHicbZDLThsxFIY9lFKacknbZTdWo0pBhWgGVWo3SKjdNFkAgxWHk8ZxJDLZnap8pikbzcGx4AHY8QTdtEJsUZ3LgtuRLH/6/3N07D8plHQYhlfBwrPF50svl82Xq2srq03X785cHlpBfRErnJ7lHAHShroUQFR4UFrhMFh8npt4l/+Ausk7nZx3EBA82HRmZScPRS3Oz/aDMcAfINukNZYfM0rnAnqo+r/brqMgUZts9ipEzLlLqzLsfT2pWb9ItTb39k57MLr05IeMpZFYOR7hRx81W2AmnR9DNIcWmVc3bl6yNBelBoNCcef6UVjgoOIWpVBQN1jpoODilA+h79FwDW5QTUOo6QevpDTLrT8G6VS9O1Fx7dxYJ75Tcxy5h95EfMrl5h9GVTSFCWCEbNFWako5nSKE2lBYFq7IELK/1bqRhxywX63Bs+hOjhlx/DwXYnCjvR3qfW7td5HMvkHXlP2iQin8ku+U6pEcEOSe/yV/yL7gI/gTXwc2sdSGYz7wl9yq4/Q8lTK+5</latexit><latexit sha1_base64="3+l6Abc63xGDhSVFpwKTAlCK8fU=">ACRHicbZDLThsxFIY9lFKacknbZTdWo0pBhWgGVWo3SKjdNFkAgxWHk8ZxJDLZnap8pikbzcGx4AHY8QTdtEJsUZ3LgtuRLH/6/3N07D8plHQYhlfBwrPF50svl82Xq2srq03X785cHlpBfRErnJ7lHAHShroUQFR4UFrhMFh8npt4l/+Ausk7nZx3EBA82HRmZScPRS3Oz/aDMcAfINukNZYfM0rnAnqo+r/brqMgUZts9ipEzLlLqzLsfT2pWb9ItTb39k57MLr05IeMpZFYOR7hRx81W2AmnR9DNIcWmVc3bl6yNBelBoNCcef6UVjgoOIWpVBQN1jpoODilA+h79FwDW5QTUOo6QevpDTLrT8G6VS9O1Fx7dxYJ75Tcxy5h95EfMrl5h9GVTSFCWCEbNFWako5nSKE2lBYFq7IELK/1bqRhxywX63Bs+hOjhlx/DwXYnCjvR3qfW7td5HMvkHXlP2iQin8ku+U6pEcEOSe/yV/yL7gI/gTXwc2sdSGYz7wl9yq4/Q8lTK+5</latexit><latexit sha1_base64="3+l6Abc63xGDhSVFpwKTAlCK8fU=">ACRHicbZDLThsxFIY9lFKacknbZTdWo0pBhWgGVWo3SKjdNFkAgxWHk8ZxJDLZnap8pikbzcGx4AHY8QTdtEJsUZ3LgtuRLH/6/3N07D8plHQYhlfBwrPF50svl82Xq2srq03X785cHlpBfRErnJ7lHAHShroUQFR4UFrhMFh8npt4l/+Ausk7nZx3EBA82HRmZScPRS3Oz/aDMcAfINukNZYfM0rnAnqo+r/brqMgUZts9ipEzLlLqzLsfT2pWb9ItTb39k57MLr05IeMpZFYOR7hRx81W2AmnR9DNIcWmVc3bl6yNBelBoNCcef6UVjgoOIWpVBQN1jpoODilA+h79FwDW5QTUOo6QevpDTLrT8G6VS9O1Fx7dxYJ75Tcxy5h95EfMrl5h9GVTSFCWCEbNFWako5nSKE2lBYFq7IELK/1bqRhxywX63Bs+hOjhlx/DwXYnCjvR3qfW7td5HMvkHXlP2iQin8ku+U6pEcEOSe/yV/yL7gI/gTXwc2sdSGYz7wl9yq4/Q8lTK+5</latexit><latexit sha1_base64="3+l6Abc63xGDhSVFpwKTAlCK8fU=">ACRHicbZDLThsxFIY9lFKacknbZTdWo0pBhWgGVWo3SKjdNFkAgxWHk8ZxJDLZnap8pikbzcGx4AHY8QTdtEJsUZ3LgtuRLH/6/3N07D8plHQYhlfBwrPF50svl82Xq2srq03X785cHlpBfRErnJ7lHAHShroUQFR4UFrhMFh8npt4l/+Ausk7nZx3EBA82HRmZScPRS3Oz/aDMcAfINukNZYfM0rnAnqo+r/brqMgUZts9ipEzLlLqzLsfT2pWb9ItTb39k57MLr05IeMpZFYOR7hRx81W2AmnR9DNIcWmVc3bl6yNBelBoNCcef6UVjgoOIWpVBQN1jpoODilA+h79FwDW5QTUOo6QevpDTLrT8G6VS9O1Fx7dxYJ75Tcxy5h95EfMrl5h9GVTSFCWCEbNFWako5nSKE2lBYFq7IELK/1bqRhxywX63Bs+hOjhlx/DwXYnCjvR3qfW7td5HMvkHXlP2iQin8ku+U6pEcEOSe/yV/yL7gI/gTXwc2sdSGYz7wl9yq4/Q8lTK+5</latexit>

¯ vt = 1 2m X

mjm,j6=0

vt+j

<latexit sha1_base64="u3qE2VmpSoWtPsbLZcSm8TLfQ4=">ACP3icbZBNSwMxEIazftb6VfXoJVgEQS27IuhFKHrxqGCr0C1LNs1qNMmuyaxQwv4zL/4Fb169eFDEqzezbQ9+DYQ8vDOTzLxJrgB3/yxsYnJqemKzPV2bn5hcXa0nLbpLmrEVTkeqLmBgmuGIt4CDYRaYZkbFg5/HNUZk/v2Pa8FSdQT9jXUkuFU84JeCkqNYOY6JtKAlcxYm9K4oI8AEOE02oDQq7IwsbmlxGdlviULBbfD285FZJypFfGuPLGxeF+6VWt1v+IPAfyEYQR2N4iSqPYa9lOaSKaCGNMJ/Ay6lmjgVLCiGuaGZYTekEvWcaiIZKZrB/sXeN0pPZyk2h0FeKB+7BEGtOXsasJzW/c6X4X6TQ7LftVxlOTBFhx8lucCQ4tJM3OaURB9B4Rq7mbF9Io478BZXnUmBL9X/gvtnUbgN4LT3XrzcGRHBa2iNbSBArSHmugYnaAWougePaNX9OY9eC/eu/cxLB3zRj0r6Ed4n19v4rBb</latexit><latexit sha1_base64="u3qE2VmpSoWtPsbLZcSm8TLfQ4=">ACP3icbZBNSwMxEIazftb6VfXoJVgEQS27IuhFKHrxqGCr0C1LNs1qNMmuyaxQwv4zL/4Fb169eFDEqzezbQ9+DYQ8vDOTzLxJrgB3/yxsYnJqemKzPV2bn5hcXa0nLbpLmrEVTkeqLmBgmuGIt4CDYRaYZkbFg5/HNUZk/v2Pa8FSdQT9jXUkuFU84JeCkqNYOY6JtKAlcxYm9K4oI8AEOE02oDQq7IwsbmlxGdlviULBbfD285FZJypFfGuPLGxeF+6VWt1v+IPAfyEYQR2N4iSqPYa9lOaSKaCGNMJ/Ay6lmjgVLCiGuaGZYTekEvWcaiIZKZrB/sXeN0pPZyk2h0FeKB+7BEGtOXsasJzW/c6X4X6TQ7LftVxlOTBFhx8lucCQ4tJM3OaURB9B4Rq7mbF9Io478BZXnUmBL9X/gvtnUbgN4LT3XrzcGRHBa2iNbSBArSHmugYnaAWougePaNX9OY9eC/eu/cxLB3zRj0r6Ed4n19v4rBb</latexit><latexit sha1_base64="u3qE2VmpSoWtPsbLZcSm8TLfQ4=">ACP3icbZBNSwMxEIazftb6VfXoJVgEQS27IuhFKHrxqGCr0C1LNs1qNMmuyaxQwv4zL/4Fb169eFDEqzezbQ9+DYQ8vDOTzLxJrgB3/yxsYnJqemKzPV2bn5hcXa0nLbpLmrEVTkeqLmBgmuGIt4CDYRaYZkbFg5/HNUZk/v2Pa8FSdQT9jXUkuFU84JeCkqNYOY6JtKAlcxYm9K4oI8AEOE02oDQq7IwsbmlxGdlviULBbfD285FZJypFfGuPLGxeF+6VWt1v+IPAfyEYQR2N4iSqPYa9lOaSKaCGNMJ/Ay6lmjgVLCiGuaGZYTekEvWcaiIZKZrB/sXeN0pPZyk2h0FeKB+7BEGtOXsasJzW/c6X4X6TQ7LftVxlOTBFhx8lucCQ4tJM3OaURB9B4Rq7mbF9Io478BZXnUmBL9X/gvtnUbgN4LT3XrzcGRHBa2iNbSBArSHmugYnaAWougePaNX9OY9eC/eu/cxLB3zRj0r6Ed4n19v4rBb</latexit><latexit sha1_base64="u3qE2VmpSoWtPsbLZcSm8TLfQ4=">ACP3icbZBNSwMxEIazftb6VfXoJVgEQS27IuhFKHrxqGCr0C1LNs1qNMmuyaxQwv4zL/4Fb169eFDEqzezbQ9+DYQ8vDOTzLxJrgB3/yxsYnJqemKzPV2bn5hcXa0nLbpLmrEVTkeqLmBgmuGIt4CDYRaYZkbFg5/HNUZk/v2Pa8FSdQT9jXUkuFU84JeCkqNYOY6JtKAlcxYm9K4oI8AEOE02oDQq7IwsbmlxGdlviULBbfD285FZJypFfGuPLGxeF+6VWt1v+IPAfyEYQR2N4iSqPYa9lOaSKaCGNMJ/Ay6lmjgVLCiGuaGZYTekEvWcaiIZKZrB/sXeN0pPZyk2h0FeKB+7BEGtOXsasJzW/c6X4X6TQ7LftVxlOTBFhx8lucCQ4tJM3OaURB9B4Rq7mbF9Io478BZXnUmBL9X/gvtnUbgN4LT3XrzcGRHBa2iNbSBArSHmugYnaAWougePaNX9OY9eC/eu/cxLB3zRj0r6Ed4n19v4rBb</latexit>
slide-24
SLIDE 24

GloVe: Global Vectors

(Pennington et al, 2014): GloVe: Global Vectors for Word Representation

  • Let’s take the global co-occurrence statistics: Xi,j
  • Training faster
  • Scalable to very large corpora
slide-25
SLIDE 25

GloVe: Global Vectors

(Pennington et al, 2014): GloVe: Global Vectors for Word Representation

slide-26
SLIDE 26

FastText: Sub-Word Embeddings

(Bojanowski et al, 2017): Enriching Word Vectors with Subword Information

  • More to come! Contextualized word embeddings
  • Similar as Skip-gram, but break words into n-grams with n = 3 to 6

where: 3-grams: <wh, whe, her, ere, re> 4-grams: <whe, wher, here, ere> 5-grams: <wher, where, here> 6-grams: <where, where>

  • Replace by

ui · vj

<latexit sha1_base64="oX8M9O0Ff2ekfvBmSkoLpI7y8XY=">ACHicbVBNS8NAEN3Ur1q/oh49uFgETyURQY9FLx4r2FZoQ9hsNu3azW7Y3RKyNGLf8WLB0W8+hO8+W/ctBG09cHA470ZuYFCaNKO86XVlaXldq67XNja3tnfs3b2OEqnEpI0FE/IuQIowyklbU83IXSIJigNGusHoqvC7YyIVFfxWTxLixWjAaUQx0kby7cN+jPQwiLI09yns41Bo+CONc/et+tOw5kCLhK3JHVQouXbn/1Q4DQmXGOGlOq5TqK9DElNMSN5rZ8qkiA8QgPSM5SjmCgvmz6Sw2OjhDAS0hTXcKr+nshQrNQkDkxncaOa9wrxP6+X6ujCyhPUk04ni2KUga1gEUqMKSYM0mhiAsqbkV4iGSCGuTXc2E4M6/vEg6pw3Xabg3Z/XmZRlHFRyAI3ACXHAOmuAatEAbYPAnsALeLUerWfrzXqftVascmYf/IH18Q1e4Jov</latexit><latexit sha1_base64="oX8M9O0Ff2ekfvBmSkoLpI7y8XY=">ACHicbVBNS8NAEN3Ur1q/oh49uFgETyURQY9FLx4r2FZoQ9hsNu3azW7Y3RKyNGLf8WLB0W8+hO8+W/ctBG09cHA470ZuYFCaNKO86XVlaXldq67XNja3tnfs3b2OEqnEpI0FE/IuQIowyklbU83IXSIJigNGusHoqvC7YyIVFfxWTxLixWjAaUQx0kby7cN+jPQwiLI09yns41Bo+CONc/et+tOw5kCLhK3JHVQouXbn/1Q4DQmXGOGlOq5TqK9DElNMSN5rZ8qkiA8QgPSM5SjmCgvmz6Sw2OjhDAS0hTXcKr+nshQrNQkDkxncaOa9wrxP6+X6ujCyhPUk04ni2KUga1gEUqMKSYM0mhiAsqbkV4iGSCGuTXc2E4M6/vEg6pw3Xabg3Z/XmZRlHFRyAI3ACXHAOmuAatEAbYPAnsALeLUerWfrzXqftVascmYf/IH18Q1e4Jov</latexit><latexit sha1_base64="oX8M9O0Ff2ekfvBmSkoLpI7y8XY=">ACHicbVBNS8NAEN3Ur1q/oh49uFgETyURQY9FLx4r2FZoQ9hsNu3azW7Y3RKyNGLf8WLB0W8+hO8+W/ctBG09cHA470ZuYFCaNKO86XVlaXldq67XNja3tnfs3b2OEqnEpI0FE/IuQIowyklbU83IXSIJigNGusHoqvC7YyIVFfxWTxLixWjAaUQx0kby7cN+jPQwiLI09yns41Bo+CONc/et+tOw5kCLhK3JHVQouXbn/1Q4DQmXGOGlOq5TqK9DElNMSN5rZ8qkiA8QgPSM5SjmCgvmz6Sw2OjhDAS0hTXcKr+nshQrNQkDkxncaOa9wrxP6+X6ujCyhPUk04ni2KUga1gEUqMKSYM0mhiAsqbkV4iGSCGuTXc2E4M6/vEg6pw3Xabg3Z/XmZRlHFRyAI3ACXHAOmuAatEAbYPAnsALeLUerWfrzXqftVascmYf/IH18Q1e4Jov</latexit><latexit sha1_base64="oX8M9O0Ff2ekfvBmSkoLpI7y8XY=">ACHicbVBNS8NAEN3Ur1q/oh49uFgETyURQY9FLx4r2FZoQ9hsNu3azW7Y3RKyNGLf8WLB0W8+hO8+W/ctBG09cHA470ZuYFCaNKO86XVlaXldq67XNja3tnfs3b2OEqnEpI0FE/IuQIowyklbU83IXSIJigNGusHoqvC7YyIVFfxWTxLixWjAaUQx0kby7cN+jPQwiLI09yns41Bo+CONc/et+tOw5kCLhK3JHVQouXbn/1Q4DQmXGOGlOq5TqK9DElNMSN5rZ8qkiA8QgPSM5SjmCgvmz6Sw2OjhDAS0hTXcKr+nshQrNQkDkxncaOa9wrxP6+X6ujCyhPUk04ni2KUga1gEUqMKSYM0mhiAsqbkV4iGSCGuTXc2E4M6/vEg6pw3Xabg3Z/XmZRlHFRyAI3ACXHAOmuAatEAbYPAnsALeLUerWfrzXqftVascmYf/IH18Q1e4Jov</latexit>

X

g∈n-grams(wi)

ug · vj

<latexit sha1_base64="vjRr+MXndBS39D+Os2ZGPURTaY=">ACKHicbVBNS8NAFNzUr1q/qh69LBahHiyJCHqz6MWjgq1CU8Jmu2nXbjZh96VaQn6OF/+KFxFvPpL3LYRtHVgYZiZx743fiy4Btv+tApz8wuLS8Xl0srq2vpGeXOrqaNEUdagkYjUrU80E1yBnAQ7DZWjIS+YDd+/3zk3wyY0jyS1zCMWTskXckDTgkYySufujoJvbSLXS6xdIE9QHrQVSTUWfXe4/uZGxLo+UGaZJ4J0U4E+EcaZN6dV67YNXsMPEucnFRQjkuv/Op2IpqETAIVROuWY8fQTokCTgXLSm6iWUxon3RZy1BJQqb6fjQDO8ZpYODSJknAY/V3xOp2VsPQ98kRzvqaW8k/ue1EghO2imXcQJM0slHQSIwRHjUGu5wxSiIoSGEKm52xbRHFKFgui2ZEpzpk2dJ87Dm2DXn6qhSP8vrKIdtIuqyEHqI4u0CVqIoe0TN6Q+/Wk/VifVifk2jByme20R9YX9/UuKet</latexit><latexit sha1_base64="vjRr+MXndBS39D+Os2ZGPURTaY=">ACKHicbVBNS8NAFNzUr1q/qh69LBahHiyJCHqz6MWjgq1CU8Jmu2nXbjZh96VaQn6OF/+KFxFvPpL3LYRtHVgYZiZx743fiy4Btv+tApz8wuLS8Xl0srq2vpGeXOrqaNEUdagkYjUrU80E1yBnAQ7DZWjIS+YDd+/3zk3wyY0jyS1zCMWTskXckDTgkYySufujoJvbSLXS6xdIE9QHrQVSTUWfXe4/uZGxLo+UGaZJ4J0U4E+EcaZN6dV67YNXsMPEucnFRQjkuv/Op2IpqETAIVROuWY8fQTokCTgXLSm6iWUxon3RZy1BJQqb6fjQDO8ZpYODSJknAY/V3xOp2VsPQ98kRzvqaW8k/ue1EghO2imXcQJM0slHQSIwRHjUGu5wxSiIoSGEKm52xbRHFKFgui2ZEpzpk2dJ87Dm2DXn6qhSP8vrKIdtIuqyEHqI4u0CVqIoe0TN6Q+/Wk/VifVifk2jByme20R9YX9/UuKet</latexit><latexit sha1_base64="vjRr+MXndBS39D+Os2ZGPURTaY=">ACKHicbVBNS8NAFNzUr1q/qh69LBahHiyJCHqz6MWjgq1CU8Jmu2nXbjZh96VaQn6OF/+KFxFvPpL3LYRtHVgYZiZx743fiy4Btv+tApz8wuLS8Xl0srq2vpGeXOrqaNEUdagkYjUrU80E1yBnAQ7DZWjIS+YDd+/3zk3wyY0jyS1zCMWTskXckDTgkYySufujoJvbSLXS6xdIE9QHrQVSTUWfXe4/uZGxLo+UGaZJ4J0U4E+EcaZN6dV67YNXsMPEucnFRQjkuv/Op2IpqETAIVROuWY8fQTokCTgXLSm6iWUxon3RZy1BJQqb6fjQDO8ZpYODSJknAY/V3xOp2VsPQ98kRzvqaW8k/ue1EghO2imXcQJM0slHQSIwRHjUGu5wxSiIoSGEKm52xbRHFKFgui2ZEpzpk2dJ87Dm2DXn6qhSP8vrKIdtIuqyEHqI4u0CVqIoe0TN6Q+/Wk/VifVifk2jByme20R9YX9/UuKet</latexit><latexit sha1_base64="vjRr+MXndBS39D+Os2ZGPURTaY=">ACKHicbVBNS8NAFNzUr1q/qh69LBahHiyJCHqz6MWjgq1CU8Jmu2nXbjZh96VaQn6OF/+KFxFvPpL3LYRtHVgYZiZx743fiy4Btv+tApz8wuLS8Xl0srq2vpGeXOrqaNEUdagkYjUrU80E1yBnAQ7DZWjIS+YDd+/3zk3wyY0jyS1zCMWTskXckDTgkYySufujoJvbSLXS6xdIE9QHrQVSTUWfXe4/uZGxLo+UGaZJ4J0U4E+EcaZN6dV67YNXsMPEucnFRQjkuv/Op2IpqETAIVROuWY8fQTokCTgXLSm6iWUxon3RZy1BJQqb6fjQDO8ZpYODSJknAY/V3xOp2VsPQ98kRzvqaW8k/ue1EghO2imXcQJM0slHQSIwRHjUGu5wxSiIoSGEKm52xbRHFKFgui2ZEpzpk2dJ87Dm2DXn6qhSP8vrKIdtIuqyEHqI4u0CVqIoe0TN6Q+/Wk/VifVifk2jByme20R9YX9/UuKet</latexit>
slide-27
SLIDE 27

Trained word embeddings available

  • word2vec: https://code.google.com/archive/p/word2vec/
  • GloVe: https://nlp.stanford.edu/projects/glove/
  • FastText: https://fasttext.cc/

Differ in algorithms, text corpora, dimensions, cased/uncased…

slide-28
SLIDE 28

Evaluating Word Embeddings

slide-29
SLIDE 29

Extrinsic evaluation

  • Let’s plug these word embeddings

into a real NLP system and see whether this improves performance

  • Could take a long time but still the

most important evaluation metric

I

( 0.31 −0.28) ( 0.01 −0.91) ( 1.87 0.03) ( −3.17 −0.18) ( 1.23 1.59)

don’t like this movie

ML model

👏

Extrinsic vs intrinsic evaluation

Intrinsic evaluation

  • Evaluate on a specific/intermediate subtask
  • Fast to compute
  • Not clear if it really helps the downstream task
slide-30
SLIDE 30

Intrinsic evaluation

Word similarity

Example dataset: wordsim-353
 353 pairs of words with human judgement

http://www.cs.technion.ac.il/~gabr/resources/data/wordsim353/

Cosine similarity:

Metric: Spearman rank correlation

slide-31
SLIDE 31

Intrinsic evaluation

Word Similarity

slide-32
SLIDE 32

Intrinsic evaluation

Word analogy

man: woman king: ?

arg max

i

(cos(ui, ub − ua + uc))

<latexit sha1_base64="JrpgXOIk2wxy6PeogrgqRj0rj24=">ACQHicbVDLSsNAFJ3UV62vqks3g0VoUsigi6LblxWsA9oQphMJ+3g5MHMjVhCP82Nn+DOtRsXirh15aTtIrYeGDjn3Hu5d4XC67ANF+NwtLyupacb20sbm1vVPe3WurKJGUtWgkItn1iGKCh6wFHATrxpKRwBOs491fZ/XOA5OKR+EdjGLmBGQcp9TAtpyx2byIEdkEc35WNbMB+qNo1UVsw9Pw0Gbv8BOeUh0/zkuDjvKQ1W/LBEGpuWLWzQnwIrFmpIJmaLrlF7sf0SRgIVBlOpZgxOSiRwKti4ZCeKxYTekwHraRqSgCknQwxkfa6WM/kvqFgCdufiIlgVKjwNOd2a1qvpaZ/9V6CfiXTsrDOAEW0ukiPxEYIpyliftcMgpipAmhkutbMR0SjozEs6BGv+y4ukfVa3zLp1e15pXM3iKIDdIiqyEIXqIFuUBO1EVP6A19oE/j2Xg3vozvaWvBmM3soz8wfn4BlsGw6A=</latexit><latexit sha1_base64="JrpgXOIk2wxy6PeogrgqRj0rj24=">ACQHicbVDLSsNAFJ3UV62vqks3g0VoUsigi6LblxWsA9oQphMJ+3g5MHMjVhCP82Nn+DOtRsXirh15aTtIrYeGDjn3Hu5d4XC67ANF+NwtLyupacb20sbm1vVPe3WurKJGUtWgkItn1iGKCh6wFHATrxpKRwBOs491fZ/XOA5OKR+EdjGLmBGQcp9TAtpyx2byIEdkEc35WNbMB+qNo1UVsw9Pw0Gbv8BOeUh0/zkuDjvKQ1W/LBEGpuWLWzQnwIrFmpIJmaLrlF7sf0SRgIVBlOpZgxOSiRwKti4ZCeKxYTekwHraRqSgCknQwxkfa6WM/kvqFgCdufiIlgVKjwNOd2a1qvpaZ/9V6CfiXTsrDOAEW0ukiPxEYIpyliftcMgpipAmhkutbMR0SjozEs6BGv+y4ukfVa3zLp1e15pXM3iKIDdIiqyEIXqIFuUBO1EVP6A19oE/j2Xg3vozvaWvBmM3soz8wfn4BlsGw6A=</latexit><latexit sha1_base64="JrpgXOIk2wxy6PeogrgqRj0rj24=">ACQHicbVDLSsNAFJ3UV62vqks3g0VoUsigi6LblxWsA9oQphMJ+3g5MHMjVhCP82Nn+DOtRsXirh15aTtIrYeGDjn3Hu5d4XC67ANF+NwtLyupacb20sbm1vVPe3WurKJGUtWgkItn1iGKCh6wFHATrxpKRwBOs491fZ/XOA5OKR+EdjGLmBGQcp9TAtpyx2byIEdkEc35WNbMB+qNo1UVsw9Pw0Gbv8BOeUh0/zkuDjvKQ1W/LBEGpuWLWzQnwIrFmpIJmaLrlF7sf0SRgIVBlOpZgxOSiRwKti4ZCeKxYTekwHraRqSgCknQwxkfa6WM/kvqFgCdufiIlgVKjwNOd2a1qvpaZ/9V6CfiXTsrDOAEW0ukiPxEYIpyliftcMgpipAmhkutbMR0SjozEs6BGv+y4ukfVa3zLp1e15pXM3iKIDdIiqyEIXqIFuUBO1EVP6A19oE/j2Xg3vozvaWvBmM3soz8wfn4BlsGw6A=</latexit><latexit sha1_base64="JrpgXOIk2wxy6PeogrgqRj0rj24=">ACQHicbVDLSsNAFJ3UV62vqks3g0VoUsigi6LblxWsA9oQphMJ+3g5MHMjVhCP82Nn+DOtRsXirh15aTtIrYeGDjn3Hu5d4XC67ANF+NwtLyupacb20sbm1vVPe3WurKJGUtWgkItn1iGKCh6wFHATrxpKRwBOs491fZ/XOA5OKR+EdjGLmBGQcp9TAtpyx2byIEdkEc35WNbMB+qNo1UVsw9Pw0Gbv8BOeUh0/zkuDjvKQ1W/LBEGpuWLWzQnwIrFmpIJmaLrlF7sf0SRgIVBlOpZgxOSiRwKti4ZCeKxYTekwHraRqSgCknQwxkfa6WM/kvqFgCdufiIlgVKjwNOd2a1qvpaZ/9V6CfiXTsrDOAEW0ukiPxEYIpyliftcMgpipAmhkutbMR0SjozEs6BGv+y4ukfVa3zLp1e15pXM3iKIDdIiqyEIXqIFuUBO1EVP6A19oE/j2Xg3vozvaWvBmM3soz8wfn4BlsGw6A=</latexit>

semantic

Chicago:Illinois Philadelphia: ?

bad:worst cool: ?

syntactic

http://download.tensorflow.org/data/questions-words.txt

More examples at

slide-33
SLIDE 33

What can go wrong with word embeddings?

  • What’s wrong with learning a word’s “meaning”

from its usage?

  • What data are we learning from?
  • What are we going to learn from this data?
slide-34
SLIDE 34

What do we mean by bias?

  • Identify she - he

axis in word vector space, project words

  • nto this axis

Bolukbasi et al. (2016) Manzini et al. (2019)

  • Nearest neighbor of

(b - a + c)

slide-35
SLIDE 35

Debiasing

Bolukbasi et al. (2016)

  • Identify gender subspace

with gendered words

she he homemaker woman man

  • Project words onto this

subspace

  • Subtract those

projections from the

  • riginal word

homemaker’

slide-36
SLIDE 36

Hardness of Debiasing

Gonen and Goldberg (2019)

  • Not that effective…and

the male and female words are still clustered together

  • Bias pervades the word

embedding space and isn’t just a local property of a few words