Text as data
Econ 2148, fall 2019 Text as data
Maximilian Kasy
Department of Economics, Harvard University
1 / 25
Econ 2148, fall 2019 Text as data Maximilian Kasy Department of - - PowerPoint PPT Presentation
Text as data Econ 2148, fall 2019 Text as data Maximilian Kasy Department of Economics, Harvard University 1 / 25 Text as data Agenda One big contribution of machine learning methods to econometrics is that they make new forms of data
Text as data
Department of Economics, Harvard University
1 / 25
Text as data
2 / 25
Text as data
3 / 25
Text as data
4 / 25
Text as data
5 / 25
Text as data Representing text as data
6 / 25
Text as data Representing text as data
7 / 25
Text as data Representing text as data
8 / 25
Text as data Text regression
j
v
9 / 25
Text as data Text regression
j
v
10 / 25
Text as data Generative language models
n
11 / 25
Text as data Generative language models
z
n
12 / 25
Text as data Generative language models
13 / 25
Text as data Generative language models
14 / 25
Text as data Generative language models
15 / 25
Text as data Generative language models
16 / 25
Text as data Latent Dirichlet allocation
17 / 25
Text as data Latent Dirichlet allocation
18 / 25
Text as data Latent Dirichlet allocation
19 / 25
Text as data Latent Dirichlet allocation
20 / 25
Text as data Latent Dirichlet allocation
21 / 25
Text as data Latent Dirichlet allocation
k
j=1
j
N
n=1
22 / 25
Text as data Latent Dirichlet allocation
n ∑ zn
k
j=1
j
n ∑ zn
d
n ∑ zn
23 / 25
Text as data Latent Dirichlet allocation
24 / 25
Text as data References
25 / 25