Vector Space Models
- Prof. Sameer Singh
CS 295: STATISTICAL NLP WINTER 2017
January 19, 2017
Based on slides from Jacob Eisenstein, Noah Smith, Mohit Bansal, Richard Socher, and everyone else they copied from.
Vector Space Models Prof. Sameer Singh CS 295: STATISTICAL NLP - - PowerPoint PPT Presentation
Vector Space Models Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 19, 2017 Based on slides from Jacob Eisenstein, Noah Smith, Mohit Bansal, Richard Socher, and everyone else they copied from. Outline Latent Semantic Analysis
January 19, 2017
Based on slides from Jacob Eisenstein, Noah Smith, Mohit Bansal, Richard Socher, and everyone else they copied from.
CS 295: STATISTICAL NLP (WINTER 2017) 2
CS 295: STATISTICAL NLP (WINTER 2017) 3
CS 295: STATISTICAL NLP (WINTER 2017) 4
c1: Human machine interface for ABC computer applications c2: A survey of user opinion of computer system response time c3: The EPS user interface management system c4: System and human system engineering testing of EPS c5: Relation of user perceived response time to error measurement m1: The generation of random, binary, ordered trees m2: The intersection graph of paths in trees m3: Graph minors IV: Widths of trees and well-quasi-ordering m4: Graph minors: A survey
From http://lsa.colorado.edu/papers/dp1.LSAintro.pdf
CS 295: STATISTICAL NLP (WINTER 2017) 5 c1 c2 c3 c4 c5 m1 m2 m3 m4
human interface computer user system response time EPS survey trees graph minors
CS 295: STATISTICAL NLP (WINTER 2017) 6
c1: Human machine interface for ABC computer applications c2: A survey of user opinion of computer system response time m4: Graph minors: A survey
CS 295: STATISTICAL NLP (WINTER 2017) 7
c1 c2 c3 c4 c5 m1 m2 m3 m4
c1 c2 c3 c4 c5 m1 m2 m3 m4
CS 295: STATISTICAL NLP (WINTER 2017) 8
CS 295: STATISTICAL NLP (WINTER 2017) 9
CS 295: STATISTICAL NLP (WINTER 2017) 10
human interface computer user system response time EPS survey trees graph minors
c1 c2 c3 c4 c5 m1 m2 m3 m4
CS 295: STATISTICAL NLP (WINTER 2017) 11
CS 295: STATISTICAL NLP (WINTER 2017) 12
human interface computer user system response time EPS survey trees graph minors
CS 295: STATISTICAL NLP (WINTER 2017) 13
CS 295: STATISTICAL NLP (WINTER 2017) 14
CS 295: STATISTICAL NLP (WINTER 2017) 15
A bottle of tezguino is on the table. Everybody likes tezguino. Tezguino makes you drunk. We make tezguino out of corn.
What does tezguino mean? Loud, motor oil, tortillas, choices, wine You shall know a word by the company keeps. (Firth, 1957)
CS 295: STATISTICAL NLP (WINTER 2017) 16
C1: A bottle of ______ is on the table. C2: Everybody likes ______. C3: _____ makes you drunk. C4: We make _____ out of corn.
tezguino loud motor oil tortillas choices wine C1 C2 C3 C4
CS 295: STATISTICAL NLP (WINTER 2017) 17
Can be anything you want!
A bottle of tezguino is on the table. Tezguino makes you drunk. … I had a fancy bottle of wine and got drunk last night! The terrible wine is on the table.
CS 295: STATISTICAL NLP (WINTER 2017) 18
Can be anything you want!
A bottle of tezguino is on the table. Tezguino makes you drunk. … I had a fancy bottle of wine and got drunk last night! The terrible wine is on the table.
tezguino wine C1 C2 C3 C4
CS 295: STATISTICAL NLP (WINTER 2017) 19
Can be anything you want!
A bottle of tezguino is on the table. Tezguino makes you drunk. … I had a fancy bottle of wine and got drunk last night! The terrible wine is on the table.
tezguino wine
bottle-of is-of makes-you and-got the-terrible is-on
CS 295: STATISTICAL NLP (WINTER 2017) 20
Can be anything you want!
A bottle of tezguino is on the table. Tezguino makes you drunk. … I had a fancy bottle of wine and got drunk last night! The terrible wine is on the table.
tezguino wine
bottle table you drunk fancy night terrible
CS 295: STATISTICAL NLP (WINTER 2017) 21
Can be anything you want!
A bottle of tezguino is on the table. Tezguino makes you drunk. … I had a fancy bottle of wine and got drunk last night! The terrible wine is on the table.
tezguino table bottle drunk wine
D1 D2 D3 D4
CS 295: STATISTICAL NLP (WINTER 2017) 22
Raw counts are not good
PMI(w,c)
CS 295: STATISTICAL NLP (WINTER 2017) 23
CS 295: STATISTICAL NLP (WINTER 2017) 24
CS 295: STATISTICAL NLP (WINTER 2017) 25
CS 295: STATISTICAL NLP (WINTER 2017) 26
CS 295: STATISTICAL NLP (WINTER 2017) 27
http://www.cs.cmu.edu/~ark/TweetNLP/cluster_viewer.html
CS 295: STATISTICAL NLP (WINTER 2017) 28
CS 295: STATISTICAL NLP (WINTER 2017) 29
CS 295: STATISTICAL NLP (WINTER 2017) 30
Computational Complexity
“One shot”
CS 295: STATISTICAL NLP (WINTER 2017) 31
CS 295: STATISTICAL NLP (WINTER 2017) 32
A bottle of tezguino is on the table. u v
CS 295: STATISTICAL NLP (WINTER 2017) 33
CS 295: STATISTICAL NLP (WINTER 2017) 34
CS 295: STATISTICAL NLP (WINTER 2017) 35 https://siddhant7.github.io/Vector-Representation-of-Words/
CS 295: STATISTICAL NLP (WINTER 2017) 36 https://siddhant7.github.io/Vector-Representation-of-Words/
King - male + female queen male : female :: King : queen
CS 295: STATISTICAL NLP (WINTER 2017) 37 https://siddhant7.github.io/Vector-Representation-of-Words/
swimming – walking + walked swam walking : walked :: swimming : swam
CS 295: STATISTICAL NLP (WINTER 2017) 38 https://siddhant7.github.io/Vector-Representation-of-Words/
Country – Capital + Spain Madrid
CS 295: STATISTICAL NLP (WINTER 2017) 39
Homework
Project