Distributed Representations of Sentences and Documents
Authors: QUOC LE, TOMAS MIKOLOV Presenters: Marjan Delpisheh, Nahid Alimohammadi
1
Sentences and Documents Authors: QUOC LE, TOMAS MIKOLOV Presenters: - - PowerPoint PPT Presentation
Distributed Representations of Sentences and Documents Authors: QUOC LE, TOMAS MIKOLOV Presenters: Marjan Delpisheh, Nahid Alimohammadi 1 Outline Objective of the paper Related works Algorithms Limitations and advantages
Authors: QUOC LE, TOMAS MIKOLOV Presenters: Marjan Delpisheh, Nahid Alimohammadi
1
2
3
BoW = {“good":2,“movie":2,“not":2,“a":1,“did":1,“like":1}; Text vectorization:
words are equally distant!!
4
2-gram frequency Good movie 2 Not a 1 A good 1 Did not 1 Not like 1 5
6
7
𝑑𝑏𝑢= (0, 0, 1)
𝑒𝑝= (0, 1, 0)
𝑏𝑗𝑠𝑞𝑚𝑏𝑜𝑓= (1, 0, 0)
8
9
10
11
word in a sentence.
12
13
1, 𝑋 2, 𝑋 3, … , 𝑋𝑈
14
15
16
17
18
19
20
21
randomly sampled from the paragraph in the output
random word from the text window and form a classification task given the Paragraph Vector
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36