Machine Learning 2 DS 4420 - Spring 2020 Topic Modeling 1 Byron C. - - PowerPoint PPT Presentation

machine learning 2
SMART_READER_LITE
LIVE PREVIEW

Machine Learning 2 DS 4420 - Spring 2020 Topic Modeling 1 Byron C. - - PowerPoint PPT Presentation

Machine Learning 2 DS 4420 - Spring 2020 Topic Modeling 1 Byron C. Wallace Last time: Clustering > Mixture Models > Expectation Maximization (EM) Today: Topic models Mixture models Assume we are given data, ,


slide-1
SLIDE 1

Machine Learning 2

DS 4420 - Spring 2020

Topic Modeling 1

Byron C. Wallace

slide-2
SLIDE 2

Last time: Clustering —> Mixture Models —> Expectation Maximization (EM)

slide-3
SLIDE 3

Today: Topic models

slide-4
SLIDE 4

Data:

Assume we are given data, , consisting of fully unsupervised ex- amples in dimensions: D = {(i)}N

i=1 where (i) ∈ RM

Model:

pθ,φ(, z) = pθ(|z)pφ(z) pθ,φ() =

K

  • z=1

pθ(|z)pφ(z)

Joint: Marginal:

Generative Story: z ∼ Multinomial(φ) ∼ pθ(·|z) (Marginal) Log-likelihood:

(θ) =

N

  • i=1

pθ,φ((i)) =

N

  • i=1
  • K
  • z=1

pθ((i)|z)pφ(z)

Mixture models

Slide credit: Matt Gormley and Eric Xing (CMU)

slide-5
SLIDE 5

Data:

Assume we are given data, , consisting of fully unsupervised ex- amples in dimensions: D = {(i)}N

i=1 where (i) ∈ RM

Model:

pθ,φ(, z) = pθ(|z)pφ(z) pθ,φ() =

K

  • z=1

pθ(|z)pφ(z)

Joint: Marginal:

Generative Story: z ∼ Multinomial(φ) ∼ pθ(·|z) (Marginal) Log-likelihood:

(θ) =

N

  • i=1

pθ,φ((i)) =

N

  • i=1
  • K
  • z=1

pθ((i)|z)pφ(z)

Mixture models

Slide credit: Matt Gormley and Eric Xing (CMU)

slide-6
SLIDE 6

Data:

Assume we are given data, , consisting of fully unsupervised ex- amples in dimensions: D = {(i)}N

i=1 where (i) ∈ RM

Model:

pθ,φ(, z) = pθ(|z)pφ(z) pθ,φ() =

K

  • z=1

pθ(|z)pφ(z)

Joint: Marginal:

Generative Story: z ∼ Multinomial(φ) ∼ pθ(·|z) (Marginal) Log-likelihood:

(θ) =

N

  • i=1

pθ,φ((i)) =

N

  • i=1
  • K
  • z=1

pθ((i)|z)pφ(z)

Mixture models

Slide credit: Matt Gormley and Eric Xing (CMU)

slide-7
SLIDE 7

Data:

Assume we are given data, , consisting of fully unsupervised ex- amples in dimensions: D = {(i)}N

i=1 where (i) ∈ RM

Model:

pθ,φ(, z) = pθ(|z)pφ(z) pθ,φ() =

K

  • z=1

pθ(|z)pφ(z)

Joint: Marginal:

Generative Story: z ∼ Multinomial(φ) ∼ pθ(·|z) (Marginal) Log-likelihood:

(θ) =

N

  • i=1

pθ,φ((i)) =

N

  • i=1
  • K
  • z=1

pθ((i)|z)pφ(z)

Mixture models

Slide credit: Matt Gormley and Eric Xing (CMU)

slide-8
SLIDE 8

Naive Bayes

p(c|w1:N, π, θ) ∝ p(c|π)

N

Y

n=1

p(wn|θc)

p(D|θ1:C, π) =

D

Y

d=1

p(cd|π)

N

Y

n=1

p(wn|θcd)

!

The model

slide-9
SLIDE 9

Slide credit: Matt Gormley and Eric Xing (CMU)

Initialize parameters randomly while not converged

1. E-Step: Create one training example for each possible value of the latent variables Weight each example according to model’s confidence

Treat parameters as observed

2. M-Step: Set the parameters to the values that maximizes likelihood

Treat pseudo-counts from above as observed

(Soft) EM

slide-10
SLIDE 10

And for NB

For

soft

EM

expected

  • f

times

t

  • ccurs

in

C

Pct

e

PCZi

L Count

tin

Xi

F

Plz

c

Ix il

a

Total Token Count

in Xi

slide-11
SLIDE 11

TOPIC MODELS

Some content borrowed from:
 David Blei
 (Columbia)

slide-12
SLIDE 12

Topic Models: Motivation

  • Suppose we have a giant dataset (“corpus”) of text, e.g., all of the

NYTimes or all emails from a company

❖ Cannot read all documents ❖ But want to get a sense of what they contain

slide-13
SLIDE 13

Topic Models: Motivation

  • Topic models are a way of uncovering, well,

“topics” (themes) in a set of documents

  • Topic models are unsupervised
  • Can be viewed as a type of clustering, so follows

naturally from prior lectures; will come back to this.

slide-14
SLIDE 14

Topic Models: Motivation

  • Topic models are a way of uncovering, well,

“topics” (themes) in a set of documents

  • Topic models are unsupervised
  • Can be viewed as a type of clustering, so follows

naturally from prior lectures; will come back to this.

slide-15
SLIDE 15

Topic Models: Motivation

  • Topic models are a way of uncovering, well,

“topics” (themes) in a set of documents

  • Topic models are unsupervised
  • Can be viewed as a sort of soft clustering of

documents into topics.

slide-16
SLIDE 16

the i that easter “number” is proteins ishtar in satan the a to the

  • f

the espn which to have hockey and i with a

  • f

if but this metaphorical “number” english as evil you and run there fact is

Example from Wallach, 2006

Topic 1 Topic 2 Topic 3 Topic 4

slide-17
SLIDE 17

Key outputs

  • Topics Distributions over words; we hope these are

somehow thematically coherent

  • Document-topics Probabilistic assignments of

topics to documents

slide-18
SLIDE 18

https://en.wikipedia.org/wiki/Enron_scandal

Example: Enron emails

https://www.cs.cmu.edu/~enron/

slide-19
SLIDE 19

Example: Enron emails

Topic Terms 3 trading financial trade product price 6 gas capacity deal pipeline contract 9 state california davis power utilities 14 ferc issue order party case 22 group meeting team process plan Example from Boyd-Graber, Hu and Mimno, 2017

https://en.wikipedia.org/wiki/Enron_scandal

slide-20
SLIDE 20

Document-topic probabilities

Yesterday, SDG&E filed a motion for adoption of an electric procurement cost recovery mechanism and for an order short- ening time for parties to file comments on the mechanism. The attached email from SDG&E contains the motion, an executive summary, and a detailed summary of their proposals and rec-

  • mmendations governing procurement of the net short energy

requirements for SDG&E’s customers. The utility requests a 15-day comment period, which means comments would have to be filed by September 10 (September 8 is a Saturday). Reply comments would be filed 10 days later. Topic Probability 9 0.42 11 0.05 8 0.05

Example from Boyd-Graber, Hu and Mimno, 2017

slide-21
SLIDE 21

Topics as Matrix Factorization

  • One can view topics as a kind of matrix factorization

M × V M × K K × V ≈ ×

Topic Assignment Topics Dataset

Figure from Boyd-Graber, Hu and Mimno, 2017

slide-22
SLIDE 22
  • One can view topics as a kind of matrix factorization

M × V M × K K × V ≈ ×

Topic Assignment Topics Dataset

  • We will try and take a more probabilistic view, but

useful to keep this in mind

Figure from Boyd-Graber, Hu and Mimno, 2017

Topics as Matrix Factorization

slide-23
SLIDE 23

Probabilistic Word Mixtures

Topics: Words: Idea: Model text as a mixture over words (ignore order)

gene 0.04 dna 0.02 genetic 0.01 .,, life 0.02 evolve 0.01

  • rganism 0.01

.,, brain 0.04 neuron 0.02 nerve 0.01 ... data 0.02 number 0.02 computer 0.01 .,,

slide-24
SLIDE 24

Topic Modeling

Idea: Model corpus of documents with shared topics

gene 0.04 dna 0.02 genetic 0.01 .,, life 0.02 evolve 0.01

  • rganism 0.01

.,, brain 0.04 neuron 0.02 nerve 0.01 ... data 0.02 number 0.02 computer 0.01 .,,

assignments

Topics (shared) Words in Document (mixture over topics) Topic Proportions (document-specific)

slide-25
SLIDE 25

Topic Modeling

  • Each topic is a distribution over words
  • Each document is a mixture over topics
  • Each word is drawn from one topic distribution

gene 0.04 dna 0.02 genetic 0.01 .,, life 0.02 evolve 0.01

  • rganism 0.01

.,, brain 0.04 neuron 0.02 nerve 0.01 ... data 0.02 number 0.02 computer 0.01 .,,

assignments

Topics (shared) Words in Document (mixture over topics) Topic Proportions (document-specific)

slide-26
SLIDE 26

Topic Modeling

  • Each topic is a distribution over words
  • Each document is a mixture over topics
  • Each word is drawn from one topic distribution

gene 0.04 dna 0.02 genetic 0.01 .,, life 0.02 evolve 0.01

  • rganism 0.01

.,, brain 0.04 neuron 0.02 nerve 0.01 ... data 0.02 number 0.02 computer 0.01 .,,

assignments

Topics (shared) Words in Document (mixture over topics) Topic Proportions (document-specific)

slide-27
SLIDE 27

Topic Modeling

  • Each topic is a distribution over words
  • Each document is a mixture over topics
  • Each word is drawn from one topic distribution

gene 0.04 dna 0.02 genetic 0.01 .,, life 0.02 evolve 0.01

  • rganism 0.01

.,, brain 0.04 neuron 0.02 nerve 0.01 ... data 0.02 number 0.02 computer 0.01 .,,

assignments

Topics (shared) Words in Document (mixture over topics) Topic Proportions (document-specific)

slide-28
SLIDE 28

xdn | zdn=k ∼ Discrete(βk)

<latexit sha1_base64="OX8/ryM68neOZUArDd/DTpCJlCE=">AGHicfZTNbtQwEIDd0oWy/LVw5JKylyKtqSt2l6QqkJFj6Xqn1SvVo4zu2t4gTbabe1/Co8Ak/AkRvihpA4wLPg7EZiE0c4UjSa+cYz4xk7zGImle/Wli8t9S6/2D5YfvR4ydPn62sPj+XaS4onNE0TsVlSCTEjMOZYiqGy0wAScIYLsLx28J+cQ1CspSfqtsMegkZcjZglCir6q8cTfoaUxHpyHDj4YRF3l1Fs/YGr3ljD0uWeFjBROl3TFIBCsw6vtY4BEVMf/y6v9LxN/zp8lwhKIXOPpqt4/7q0hcpTRPgCsaEymvAj9TPU2EYjQG08a5hIzQMRnClRU5SUD29LRk41Wsp0FPD1KugNOKmyaJTIgaOcoClUtHdnAIKphS2VPF7tEINmQV73CxLTbOIKBPf5pZjoK4xyMPnl/YLTf3dnqBpu7poYIiEoi2PO79qsDQwHAS2Rvuxvs7LlMloshn+QX2BFNgI43NA0SQiPNL4Gaq7s+WDgMhdQFGKbluhOYIx4Blqfab2Np43Toy2gzGfoO3/xNSx2zmsKNRCtw5017TXnYN9bMKwGhUz52avmSN7C5w+YuJBxI1DOExpiQSRan3KlnMEdP52TgBo3nmLHxZaxvdMRcXbMRs14NmIOe1LrzIkpxmWeIGKYENtnGYgiEpFcelumBrFLGFK6tJuXC/G/+9l7fVgh6Y6lMU/DPWhcUgaxtPBrJ6dO6H21apyRZUN2FBUsVnjGsCsBpYHXJD2vQvqr5srnG9uBFb+sN3ZPyhfvmX0Er1C6yhAu2gfHaFjdIYo+ox+oN/oT+tT62vrW+v7DF1cKH1eoMpq/fwLOsw/og=</latexit><latexit sha1_base64="OX8/ryM68neOZUArDd/DTpCJlCE=">AGHicfZTNbtQwEIDd0oWy/LVw5JKylyKtqSt2l6QqkJFj6Xqn1SvVo4zu2t4gTbabe1/Co8Ak/AkRvihpA4wLPg7EZiE0c4UjSa+cYz4xk7zGImle/Wli8t9S6/2D5YfvR4ydPn62sPj+XaS4onNE0TsVlSCTEjMOZYiqGy0wAScIYLsLx28J+cQ1CspSfqtsMegkZcjZglCir6q8cTfoaUxHpyHDj4YRF3l1Fs/YGr3ljD0uWeFjBROl3TFIBCsw6vtY4BEVMf/y6v9LxN/zp8lwhKIXOPpqt4/7q0hcpTRPgCsaEymvAj9TPU2EYjQG08a5hIzQMRnClRU5SUD29LRk41Wsp0FPD1KugNOKmyaJTIgaOcoClUtHdnAIKphS2VPF7tEINmQV73CxLTbOIKBPf5pZjoK4xyMPnl/YLTf3dnqBpu7poYIiEoi2PO79qsDQwHAS2Rvuxvs7LlMloshn+QX2BFNgI43NA0SQiPNL4Gaq7s+WDgMhdQFGKbluhOYIx4Blqfab2Np43Toy2gzGfoO3/xNSx2zmsKNRCtw5017TXnYN9bMKwGhUz52avmSN7C5w+YuJBxI1DOExpiQSRan3KlnMEdP52TgBo3nmLHxZaxvdMRcXbMRs14NmIOe1LrzIkpxmWeIGKYENtnGYgiEpFcelumBrFLGFK6tJuXC/G/+9l7fVgh6Y6lMU/DPWhcUgaxtPBrJ6dO6H21apyRZUN2FBUsVnjGsCsBpYHXJD2vQvqr5srnG9uBFb+sN3ZPyhfvmX0Er1C6yhAu2gfHaFjdIYo+ox+oN/oT+tT62vrW+v7DF1cKH1eoMpq/fwLOsw/og=</latexit><latexit sha1_base64="OX8/ryM68neOZUArDd/DTpCJlCE=">AGHicfZTNbtQwEIDd0oWy/LVw5JKylyKtqSt2l6QqkJFj6Xqn1SvVo4zu2t4gTbabe1/Co8Ak/AkRvihpA4wLPg7EZiE0c4UjSa+cYz4xk7zGImle/Wli8t9S6/2D5YfvR4ydPn62sPj+XaS4onNE0TsVlSCTEjMOZYiqGy0wAScIYLsLx28J+cQ1CspSfqtsMegkZcjZglCir6q8cTfoaUxHpyHDj4YRF3l1Fs/YGr3ljD0uWeFjBROl3TFIBCsw6vtY4BEVMf/y6v9LxN/zp8lwhKIXOPpqt4/7q0hcpTRPgCsaEymvAj9TPU2EYjQG08a5hIzQMRnClRU5SUD29LRk41Wsp0FPD1KugNOKmyaJTIgaOcoClUtHdnAIKphS2VPF7tEINmQV73CxLTbOIKBPf5pZjoK4xyMPnl/YLTf3dnqBpu7poYIiEoi2PO79qsDQwHAS2Rvuxvs7LlMloshn+QX2BFNgI43NA0SQiPNL4Gaq7s+WDgMhdQFGKbluhOYIx4Blqfab2Np43Toy2gzGfoO3/xNSx2zmsKNRCtw5017TXnYN9bMKwGhUz52avmSN7C5w+YuJBxI1DOExpiQSRan3KlnMEdP52TgBo3nmLHxZaxvdMRcXbMRs14NmIOe1LrzIkpxmWeIGKYENtnGYgiEpFcelumBrFLGFK6tJuXC/G/+9l7fVgh6Y6lMU/DPWhcUgaxtPBrJ6dO6H21apyRZUN2FBUsVnjGsCsBpYHXJD2vQvqr5srnG9uBFb+sN3ZPyhfvmX0Er1C6yhAu2gfHaFjdIYo+ox+oN/oT+tT62vrW+v7DF1cKH1eoMpq/fwLOsw/og=</latexit><latexit sha1_base64="ZEqi/4sMBZVC8LRe6l9DxkYkYM=">AGHicfZTNbtQwEIBd6EJZ/lo4ctmylyKtqRUbS9IVaGix1L1T6pXK8eZ3bXWcYLtNtafhUegSfgyA1xQ0gc4FlwtpHYxBGOFI1mvHMeMaOMs6UDoJfC3fuLrbu3V960H746PGTp8srz05VmksKJzTlqTyPiALOBJxopjmcZxJIEnE4iyZvC/vZJUjFUnGsrzPoJ2Qk2JBRop1qsHwHRhMZWxiK2wHJyzu3FQ0q2/wamfSwYolHaxhqs07pqgEDXYNXxocgSZ2MHk1WO4G68FsdXwhLIUuKtfhYGXxC45TmicgNOVEqYswyHTfEKkZ5WDbOFeQETohI7hwoiAJqL6ZlWw7Fetx2DfDVGgQtOJmSKISosesoBVUvHLjDIathS2TfFLjEoNhJVryix7TaOYeiOf5aZiSOegzVH7/esCXpbr3vhxratIRLikgh3gp76sBIAogS2dnshVs7PpPlMuPwDwoKrMhGgoArmiYJEbHBl0DthTsfDELlEopCXNMS0w2tR58izqfmb2N541Ta9xgzCfo+j+1dex6DisKdC1B907XjYR+bMKzHxcz52etmuQNbO6xuQ9JD5L1DKExJmSK8VR49Qzn6NmcDP2gfI4pe1xsyd2djom3YzZuxrMx89ijWmeObDEu8wSRo4S4PuM0A0l0KotLd8X0mLOEaWVKu/W9mPi/l7PXg+3b6lAW/ygy+9YjacRng1k9O39C3atV5YoqG7CRrGK3jWsAsxpYHnBuvcurL9uvnC6sR46+cNmd3evfPmW0Av0Eq2hEG2jXSADtEJougz+oF+oz+tT62vrW+t7fonYXS5zmqrNbPv+juP2I=</latexit>

zdn ∼ Discrete(θd)

<latexit sha1_base64="O1i9JMBjmHNm9M2eTNvBxn+6U6w=">AGB3icfZTNbtQwEIDd0oWy/HQLRy4r9lKkVZW0VdtjVajgWKr+SfVq5Tizu1YdJ9hO/yw/AI/AU3BDHJAQN3gE3gZnN4JNHOFI0cjzjWfGM54o40zpIPi9sHhvqX/wfLD9qPHT56udFafnao0lxROaMpTeR4RBZwJONFMczjPJAk4nAWXb4u9GdXIBVLxbG+zWCQkLFgI0aJdlvDzubd0GAqYxNbYbtYsaSLNdxo84YpKkGDXcN6Apr8xeyrYacXrAfT1fWFsBR6e2i2DoerS9wnNI8AaEpJ0pdhEGmB4ZIzSgH28a5gozQSzKGCycKkoAamGl2tlvRHocDM0qFBkErZoYkKiF64m0WsKru0olzDLqtwcmOKUGBQbi6pVlNh2G8cwcjc9jczEc/BmqO3+9YE/e3NfrixY2uIhLgkwt2g746MJYAokR2t/rh9q7PZLnMOPyDgIropEg4JqmSUJEbPAVUHvh7geDULmEIhGDo8T0QmutB89QZzPVt/G8sYaV+75APGVubF17HYOKxJ10K0H3TWdedhH5qwWes1RK+baZI3sLnH5j4kPUjWI4RGn5ApxlPh5TOao6d9MvKd8jmrHFxJHfPNybeidmkGc8mzGOPapU5skW7zBNEjhPi6ozTDCTRqSwe3TXTE84SpUp9da3YuL/Vk5fd3Zgq01Z/KPIHFiPpBGfNmb17vwOdbOoyhVZNmBjWcVmhWsAsxpYXnBunkX1qebL5xurIdOfr/V29svJ98yeoFeojUoh20h96hQ3SCKPqEvqOf6FfrY+tz60vr6wxdXChtnqPKav34A4dxOBg=</latexit><latexit sha1_base64="O1i9JMBjmHNm9M2eTNvBxn+6U6w=">AGB3icfZTNbtQwEIDd0oWy/HQLRy4r9lKkVZW0VdtjVajgWKr+SfVq5Tizu1YdJ9hO/yw/AI/AU3BDHJAQN3gE3gZnN4JNHOFI0cjzjWfGM54o40zpIPi9sHhvqX/wfLD9qPHT56udFafnao0lxROaMpTeR4RBZwJONFMczjPJAk4nAWXb4u9GdXIBVLxbG+zWCQkLFgI0aJdlvDzubd0GAqYxNbYbtYsaSLNdxo84YpKkGDXcN6Apr8xeyrYacXrAfT1fWFsBR6e2i2DoerS9wnNI8AaEpJ0pdhEGmB4ZIzSgH28a5gozQSzKGCycKkoAamGl2tlvRHocDM0qFBkErZoYkKiF64m0WsKru0olzDLqtwcmOKUGBQbi6pVlNh2G8cwcjc9jczEc/BmqO3+9YE/e3NfrixY2uIhLgkwt2g746MJYAokR2t/rh9q7PZLnMOPyDgIropEg4JqmSUJEbPAVUHvh7geDULmEIhGDo8T0QmutB89QZzPVt/G8sYaV+75APGVubF17HYOKxJ10K0H3TWdedhH5qwWes1RK+baZI3sLnH5j4kPUjWI4RGn5ApxlPh5TOao6d9MvKd8jmrHFxJHfPNybeidmkGc8mzGOPapU5skW7zBNEjhPi6ozTDCTRqSwe3TXTE84SpUp9da3YuL/Vk5fd3Zgq01Z/KPIHFiPpBGfNmb17vwOdbOoyhVZNmBjWcVmhWsAsxpYXnBunkX1qebL5xurIdOfr/V29svJ98yeoFeojUoh20h96hQ3SCKPqEvqOf6FfrY+tz60vr6wxdXChtnqPKav34A4dxOBg=</latexit><latexit sha1_base64="O1i9JMBjmHNm9M2eTNvBxn+6U6w=">AGB3icfZTNbtQwEIDd0oWy/HQLRy4r9lKkVZW0VdtjVajgWKr+SfVq5Tizu1YdJ9hO/yw/AI/AU3BDHJAQN3gE3gZnN4JNHOFI0cjzjWfGM54o40zpIPi9sHhvqX/wfLD9qPHT56udFafnao0lxROaMpTeR4RBZwJONFMczjPJAk4nAWXb4u9GdXIBVLxbG+zWCQkLFgI0aJdlvDzubd0GAqYxNbYbtYsaSLNdxo84YpKkGDXcN6Apr8xeyrYacXrAfT1fWFsBR6e2i2DoerS9wnNI8AaEpJ0pdhEGmB4ZIzSgH28a5gozQSzKGCycKkoAamGl2tlvRHocDM0qFBkErZoYkKiF64m0WsKru0olzDLqtwcmOKUGBQbi6pVlNh2G8cwcjc9jczEc/BmqO3+9YE/e3NfrixY2uIhLgkwt2g746MJYAokR2t/rh9q7PZLnMOPyDgIropEg4JqmSUJEbPAVUHvh7geDULmEIhGDo8T0QmutB89QZzPVt/G8sYaV+75APGVubF17HYOKxJ10K0H3TWdedhH5qwWes1RK+baZI3sLnH5j4kPUjWI4RGn5ApxlPh5TOao6d9MvKd8jmrHFxJHfPNybeidmkGc8mzGOPapU5skW7zBNEjhPi6ozTDCTRqSwe3TXTE84SpUp9da3YuL/Vk5fd3Zgq01Z/KPIHFiPpBGfNmb17vwOdbOoyhVZNmBjWcVmhWsAsxpYXnBunkX1qebL5xurIdOfr/V29svJ98yeoFeojUoh20h96hQ3SCKPqEvqOf6FfrY+tz60vr6wxdXChtnqPKav34A4dxOBg=</latexit><latexit sha1_base64="wK/6vFJ0W2rB0VpbrgTLA0uYUTE=">AGB3icfZTNbtQwEIDd0oWy/HQLRy4r9lKkVZW0VdtjVajgWKr+Sc1q5Tizu1YdJ9hO/yw/AI/AU3BDHJAQN3gE3gZ7N4JNHOFI0cjzjWfGM54Z1SqIPi9sHhvqX/wfLD9qPHT56udFafncqsEAROSMYycR5jCYxyOFUMTjPBeA0ZnAWX752+rMrEJm/Fjd5jBI8ZjTESVY2a1hZ/NuqCMiEp0YbrqRpGk3UnCj9BsqiQAFZi1SE1D4L2ZeDTu9YD2Yrq4vhKXQ+U6HK4ufYuSjBQpcEUYlvIiDHI10FgoShiYdlRIyDG5xGO4sCLHKciBnmZnuhXtcTjQo4wr4KRipnEqU6wm3qaDZXWXTKxjEFW35eZAu1MSkHTMq1ZxatrtKIGRvelpZDqJWQFGH73dNzrob2/2w40dU0MEJCUR7gZ9+9WBsQDgJbK71Q+3d30mL0TO4B8UOMxFI4DNcnSFPNER1dAzIW9nwi4LAS4RHQUp7oXGmM8eIZam6m+Hc0rb4y25Z4PMLrSN6aO3c5hLlEL3XrQXdNZdx72oQmbtV5D9KqZxkUDW3hs4UPCg0Q9Qmj0CbmkLONePqM5etonI98pm2PKGrsjmX2+CfZOzCfNeD6hHntUq8yRce0yT2AxTrGtc5TlILDKhHt01RNGE2pkrUG9+K8v9bWX3d2YGpNqX7x7E+MB5JYjZtzOrd+R1qZ1GVc1k2YGNRxWaFawDzGlhesCPtvAvr080XTjfWQyu/3+rt7ZeTbxm9QC/RGgrRDtpD79AhOkEfULf0U/0q/Wx9bn1pfV1hi4ulDbPUW1fvwBNaI32A=</latexit>

gene 0.04 dna 0.02 genetic 0.01 .,, life 0.02 evolve 0.01

  • rganism 0.01

.,, brain 0.04 neuron 0.02 nerve 0.01 ... data 0.02 number 0.02 computer 0.01 .,,

assignments

Topics (shared) Words in Document (mixture over topics) Topic Proportions (document-specific)

Each document has Different topic proportions

Topic Modeling

slide-29
SLIDE 29

LDA’s view of a document

Slide credit: William Cohen (CMU)

slide-30
SLIDE 30

Example: Discovering scientific topics

slide-31
SLIDE 31

Example Inference

human evolution disease computer genome evolutionary host models dna species bacteria information genetic

  • rganisms

diseases data genes life resistance computers sequence

  • rigin

bacterial system gene biology new network molecular groups strains systems sequencing phylogenetic control model map living infectious parallel information diversity malaria methods genetics group parasite networks mapping new parasites software project two united new sequences common tuberculosis simulations

slide-32
SLIDE 32

Example Inference

1 8 16 26 36 46 56 66 76 86 96 Topics Probability 0.0 0.1 0.2 0.3 0.4

slide-33
SLIDE 33

Example Inference

problem model selection species problems rate male forest mathematical constant males ecology number distribution females fish new time sex ecological mathematics number species conservation university size female diversity two values evolution population first value populations natural numbers average population ecosystems work rates sexual populations time data behavior endangered mathematicians density evolutionary tropical chaos measured genetic forests chaotic models reproductive ecosystem

slide-34
SLIDE 34

From Griffiths and Steyvers, PNAS 2004

slide-35
SLIDE 35

From Naive Bayes to Topic Models (board)

slide-36
SLIDE 36

Likelihood

log(p(x d | β,θ d)) = X

n

log(p(x dn | β,θ d)) X ✓Y ◆

<latexit sha1_base64="LPiWkWvUWNYI5mpxmr9IeDTaWv4=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQglDwB6QKia0ob2wjptUd5WTuKnVxMkcp1wsfzO+yL7N7DSDJo7YVoqVdbx75z/OT7OiZuEOGW2/W1ufqHx4OGjxWbr8ZOnz54vLb84TeOMeujEi8OYnrswRSEm6IRhFqLzhCIYuSE6c8fv1P7ZBNEUx+SYXSWoH8GA4CH2IJOmwfLCDQjwAQhGrI1M1kDXkA5mPBLIQbcFyaIsG8Cj/rKCFzEoBCWNLhbmCj3DLw101AcTBi63Xe2YLpFk0IOZ9ocmfxAZAhq+LDhIa+wM+ETU6e5NfV/rCD3u6f1/UZcCJfVqeRNYPyn+d5L6S/EpP64/GiWY5Lbr27as7oFVc/x/8ivnkqdym0mNsHXv0dYEM8dWtaZ/kv9Yz4/7liwKaHqyVhW7IlronAtxF78SsRqo1Roste0NO39MfeEUi3bHmD5Hg+XGDfBjL4sQYV4I07Tn2Anrc0gZ9kIkWiBLUQK9MQxQTy4JjFDa5/mcEWZp9jp82FMGCJeyY3DKI0gG2lGBadlqzeSwoiWZQtjn6soPkpxQMpebiRLBz4aypmXZ8Z9N8yQ4N3+4Lb1vYby9ncERWEIr8gnF3bkr8qEFCESIHsblnO9q7OJBlNQnQH2QpT2VBE0IUXRxEkqkXIEz15PgCRNKNIFSKbFvG2I2TbqvAUlT75fgvMbl4KzkEpwelrGBXM5gqVEJXGnRdF+taw7WYT/uoJY9q6dhVsNmGpvpENUgWs0Q1WqiJMVhTLR6hjN0fk+Gumg4wxQ9ViFD+SH1oRYxGdXjyQhrbLfSma56b0sEpEZ9BnCAKWUzVS3eB2SjEWYpL/aF7oXJ/V5yvyp2IMqXUv27Lj8QGimHT34xy2en31A5k8qcqrIGC2gZmzauBkwqYHAipTzqlON31xurnhyPWnrXZnv5h8i8ZL45WxZjGjtExPhHxonhNdqNw0a38bm50nzb7DQLdn6u8FkxSk/z43ePMYEw</latexit><latexit sha1_base64="LPiWkWvUWNYI5mpxmr9IeDTaWv4=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQglDwB6QKia0ob2wjptUd5WTuKnVxMkcp1wsfzO+yL7N7DSDJo7YVoqVdbx75z/OT7OiZuEOGW2/W1ufqHx4OGjxWbr8ZOnz54vLb84TeOMeujEi8OYnrswRSEm6IRhFqLzhCIYuSE6c8fv1P7ZBNEUx+SYXSWoH8GA4CH2IJOmwfLCDQjwAQhGrI1M1kDXkA5mPBLIQbcFyaIsG8Cj/rKCFzEoBCWNLhbmCj3DLw101AcTBi63Xe2YLpFk0IOZ9ocmfxAZAhq+LDhIa+wM+ETU6e5NfV/rCD3u6f1/UZcCJfVqeRNYPyn+d5L6S/EpP64/GiWY5Lbr27as7oFVc/x/8ivnkqdym0mNsHXv0dYEM8dWtaZ/kv9Yz4/7liwKaHqyVhW7IlronAtxF78SsRqo1Roste0NO39MfeEUi3bHmD5Hg+XGDfBjL4sQYV4I07Tn2Anrc0gZ9kIkWiBLUQK9MQxQTy4JjFDa5/mcEWZp9jp82FMGCJeyY3DKI0gG2lGBadlqzeSwoiWZQtjn6soPkpxQMpebiRLBz4aypmXZ8Z9N8yQ4N3+4Lb1vYby9ncERWEIr8gnF3bkr8qEFCESIHsblnO9q7OJBlNQnQH2QpT2VBE0IUXRxEkqkXIEz15PgCRNKNIFSKbFvG2I2TbqvAUlT75fgvMbl4KzkEpwelrGBXM5gqVEJXGnRdF+taw7WYT/uoJY9q6dhVsNmGpvpENUgWs0Q1WqiJMVhTLR6hjN0fk+Gumg4wxQ9ViFD+SH1oRYxGdXjyQhrbLfSma56b0sEpEZ9BnCAKWUzVS3eB2SjEWYpL/aF7oXJ/V5yvyp2IMqXUv27Lj8QGimHT34xy2en31A5k8qcqrIGC2gZmzauBkwqYHAipTzqlON31xurnhyPWnrXZnv5h8i8ZL45WxZjGjtExPhHxonhNdqNw0a38bm50nzb7DQLdn6u8FkxSk/z43ePMYEw</latexit><latexit sha1_base64="LPiWkWvUWNYI5mpxmr9IeDTaWv4=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQglDwB6QKia0ob2wjptUd5WTuKnVxMkcp1wsfzO+yL7N7DSDJo7YVoqVdbx75z/OT7OiZuEOGW2/W1ufqHx4OGjxWbr8ZOnz54vLb84TeOMeujEi8OYnrswRSEm6IRhFqLzhCIYuSE6c8fv1P7ZBNEUx+SYXSWoH8GA4CH2IJOmwfLCDQjwAQhGrI1M1kDXkA5mPBLIQbcFyaIsG8Cj/rKCFzEoBCWNLhbmCj3DLw101AcTBi63Xe2YLpFk0IOZ9ocmfxAZAhq+LDhIa+wM+ETU6e5NfV/rCD3u6f1/UZcCJfVqeRNYPyn+d5L6S/EpP64/GiWY5Lbr27as7oFVc/x/8ivnkqdym0mNsHXv0dYEM8dWtaZ/kv9Yz4/7liwKaHqyVhW7IlronAtxF78SsRqo1Roste0NO39MfeEUi3bHmD5Hg+XGDfBjL4sQYV4I07Tn2Anrc0gZ9kIkWiBLUQK9MQxQTy4JjFDa5/mcEWZp9jp82FMGCJeyY3DKI0gG2lGBadlqzeSwoiWZQtjn6soPkpxQMpebiRLBz4aypmXZ8Z9N8yQ4N3+4Lb1vYby9ncERWEIr8gnF3bkr8qEFCESIHsblnO9q7OJBlNQnQH2QpT2VBE0IUXRxEkqkXIEz15PgCRNKNIFSKbFvG2I2TbqvAUlT75fgvMbl4KzkEpwelrGBXM5gqVEJXGnRdF+taw7WYT/uoJY9q6dhVsNmGpvpENUgWs0Q1WqiJMVhTLR6hjN0fk+Gumg4wxQ9ViFD+SH1oRYxGdXjyQhrbLfSma56b0sEpEZ9BnCAKWUzVS3eB2SjEWYpL/aF7oXJ/V5yvyp2IMqXUv27Lj8QGimHT34xy2en31A5k8qcqrIGC2gZmzauBkwqYHAipTzqlON31xurnhyPWnrXZnv5h8i8ZL45WxZjGjtExPhHxonhNdqNw0a38bm50nzb7DQLdn6u8FkxSk/z43ePMYEw</latexit><latexit sha1_base64="ZGSfvHkKuXAmqhUvswfMoyPCFyw=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQg1DwB6QEBPa0F5Yx03CXeUkbmrVcTLHKRfL34wvsm8zO82giSN2lZKlX8O+d/jo9z4iUEp7zT+TYzO9d48PDRfLP1+MnTZ8XFl+cpHGfHTsxyRmZx5MEcEUHXPMCTpLGIKR9CpN3qn90/HiKU4pkf8KkG9CIYUD7APuTL1F+duAIlDGxA04Ct2sgL8kAkwFpdS9kUgbRDhwAY+C7QReIhDKR1l8Ehu4MPc0g9WbcBwOSrdc7dgukWdSn9n2h6Z/EBkCFr4sOEhYHfTGWNTo7419X+iIOzk3/nqzLQFBH6dXyNnB+UvzvJPWX4hN+VH80WjDJbde3bVneAcv26P/kV84lT+U2kxph596jrQlmj5xqTf8k/5GZnwgcVRQw9FStOnZFtNA5k/IufiViNVCr1V9od9Y6+WObC7dYtK3iOewvNm5AEPtZhCj3CUzTc7eT8J6AjGOfINkCWYoS6I9giM7VksIpT2Rzxlpl3aP3J4YxJQj6pfcBIzSCPKhYdRwWrb6QyWMWFm2MPaEjhKgFIe07OVFqnQoIGaeXlmIvBIhqTovt+TouNsvnHc9S1ZQRgKCsLd7jqVwVChAtkO0Nx93cNpkYwlBd1BHYzobhi68OMoglS3CPnyXJ0PQDTNGNKFqKZFou1K1bYqPEGVT7fAtObl1IUEpwchkr2NUpgtV0JUBXdfFujawr3XYjztoZM/raZjVsJnBZibEDIhVM0S1mihJMYmpUc9gis7vycAUJVNM0WMdkqgPaQCNiMmwHk+G2GC7lc509XtbIiALI6j6DOIEMchjpl+6C8yHBEeYp6LYl6YXpvd7qf2q2L4sX0r973liXxqkGj75xSyfnXlD1Uwqc7rKGixkZWzSuBowqYDFAWtSzTu3Ot3Mxcn6mqvWnzbau3vF5Ju3XlqvrBXLtbasXeuDdWgdW36j3ThodBufm0vNt83dZsHOzhQ+S1bpaX78Dj1igPA=</latexit>

X = X

n

log ✓Y

v

p(x dn = v | β,θ d)I[x dn=v] ◆ X

<latexit sha1_base64="LPiWkWvUWNYI5mpxmr9IeDTaWv4=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQglDwB6QKia0ob2wjptUd5WTuKnVxMkcp1wsfzO+yL7N7DSDJo7YVoqVdbx75z/OT7OiZuEOGW2/W1ufqHx4OGjxWbr8ZOnz54vLb84TeOMeujEi8OYnrswRSEm6IRhFqLzhCIYuSE6c8fv1P7ZBNEUx+SYXSWoH8GA4CH2IJOmwfLCDQjwAQhGrI1M1kDXkA5mPBLIQbcFyaIsG8Cj/rKCFzEoBCWNLhbmCj3DLw101AcTBi63Xe2YLpFk0IOZ9ocmfxAZAhq+LDhIa+wM+ETU6e5NfV/rCD3u6f1/UZcCJfVqeRNYPyn+d5L6S/EpP64/GiWY5Lbr27as7oFVc/x/8ivnkqdym0mNsHXv0dYEM8dWtaZ/kv9Yz4/7liwKaHqyVhW7IlronAtxF78SsRqo1Roste0NO39MfeEUi3bHmD5Hg+XGDfBjL4sQYV4I07Tn2Anrc0gZ9kIkWiBLUQK9MQxQTy4JjFDa5/mcEWZp9jp82FMGCJeyY3DKI0gG2lGBadlqzeSwoiWZQtjn6soPkpxQMpebiRLBz4aypmXZ8Z9N8yQ4N3+4Lb1vYby9ncERWEIr8gnF3bkr8qEFCESIHsblnO9q7OJBlNQnQH2QpT2VBE0IUXRxEkqkXIEz15PgCRNKNIFSKbFvG2I2TbqvAUlT75fgvMbl4KzkEpwelrGBXM5gqVEJXGnRdF+taw7WYT/uoJY9q6dhVsNmGpvpENUgWs0Q1WqiJMVhTLR6hjN0fk+Gumg4wxQ9ViFD+SH1oRYxGdXjyQhrbLfSma56b0sEpEZ9BnCAKWUzVS3eB2SjEWYpL/aF7oXJ/V5yvyp2IMqXUv27Lj8QGimHT34xy2en31A5k8qcqrIGC2gZmzauBkwqYHAipTzqlON31xurnhyPWnrXZnv5h8i8ZL45WxZjGjtExPhHxonhNdqNw0a38bm50nzb7DQLdn6u8FkxSk/z43ePMYEw</latexit><latexit sha1_base64="LPiWkWvUWNYI5mpxmr9IeDTaWv4=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQglDwB6QKia0ob2wjptUd5WTuKnVxMkcp1wsfzO+yL7N7DSDJo7YVoqVdbx75z/OT7OiZuEOGW2/W1ufqHx4OGjxWbr8ZOnz54vLb84TeOMeujEi8OYnrswRSEm6IRhFqLzhCIYuSE6c8fv1P7ZBNEUx+SYXSWoH8GA4CH2IJOmwfLCDQjwAQhGrI1M1kDXkA5mPBLIQbcFyaIsG8Cj/rKCFzEoBCWNLhbmCj3DLw101AcTBi63Xe2YLpFk0IOZ9ocmfxAZAhq+LDhIa+wM+ETU6e5NfV/rCD3u6f1/UZcCJfVqeRNYPyn+d5L6S/EpP64/GiWY5Lbr27as7oFVc/x/8ivnkqdym0mNsHXv0dYEM8dWtaZ/kv9Yz4/7liwKaHqyVhW7IlronAtxF78SsRqo1Roste0NO39MfeEUi3bHmD5Hg+XGDfBjL4sQYV4I07Tn2Anrc0gZ9kIkWiBLUQK9MQxQTy4JjFDa5/mcEWZp9jp82FMGCJeyY3DKI0gG2lGBadlqzeSwoiWZQtjn6soPkpxQMpebiRLBz4aypmXZ8Z9N8yQ4N3+4Lb1vYby9ncERWEIr8gnF3bkr8qEFCESIHsblnO9q7OJBlNQnQH2QpT2VBE0IUXRxEkqkXIEz15PgCRNKNIFSKbFvG2I2TbqvAUlT75fgvMbl4KzkEpwelrGBXM5gqVEJXGnRdF+taw7WYT/uoJY9q6dhVsNmGpvpENUgWs0Q1WqiJMVhTLR6hjN0fk+Gumg4wxQ9ViFD+SH1oRYxGdXjyQhrbLfSma56b0sEpEZ9BnCAKWUzVS3eB2SjEWYpL/aF7oXJ/V5yvyp2IMqXUv27Lj8QGimHT34xy2en31A5k8qcqrIGC2gZmzauBkwqYHAipTzqlON31xurnhyPWnrXZnv5h8i8ZL45WxZjGjtExPhHxonhNdqNw0a38bm50nzb7DQLdn6u8FkxSk/z43ePMYEw</latexit><latexit sha1_base64="LPiWkWvUWNYI5mpxmr9IeDTaWv4=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQglDwB6QKia0ob2wjptUd5WTuKnVxMkcp1wsfzO+yL7N7DSDJo7YVoqVdbx75z/OT7OiZuEOGW2/W1ufqHx4OGjxWbr8ZOnz54vLb84TeOMeujEi8OYnrswRSEm6IRhFqLzhCIYuSE6c8fv1P7ZBNEUx+SYXSWoH8GA4CH2IJOmwfLCDQjwAQhGrI1M1kDXkA5mPBLIQbcFyaIsG8Cj/rKCFzEoBCWNLhbmCj3DLw101AcTBi63Xe2YLpFk0IOZ9ocmfxAZAhq+LDhIa+wM+ETU6e5NfV/rCD3u6f1/UZcCJfVqeRNYPyn+d5L6S/EpP64/GiWY5Lbr27as7oFVc/x/8ivnkqdym0mNsHXv0dYEM8dWtaZ/kv9Yz4/7liwKaHqyVhW7IlronAtxF78SsRqo1Roste0NO39MfeEUi3bHmD5Hg+XGDfBjL4sQYV4I07Tn2Anrc0gZ9kIkWiBLUQK9MQxQTy4JjFDa5/mcEWZp9jp82FMGCJeyY3DKI0gG2lGBadlqzeSwoiWZQtjn6soPkpxQMpebiRLBz4aypmXZ8Z9N8yQ4N3+4Lb1vYby9ncERWEIr8gnF3bkr8qEFCESIHsblnO9q7OJBlNQnQH2QpT2VBE0IUXRxEkqkXIEz15PgCRNKNIFSKbFvG2I2TbqvAUlT75fgvMbl4KzkEpwelrGBXM5gqVEJXGnRdF+taw7WYT/uoJY9q6dhVsNmGpvpENUgWs0Q1WqiJMVhTLR6hjN0fk+Gumg4wxQ9ViFD+SH1oRYxGdXjyQhrbLfSma56b0sEpEZ9BnCAKWUzVS3eB2SjEWYpL/aF7oXJ/V5yvyp2IMqXUv27Lj8QGimHT34xy2en31A5k8qcqrIGC2gZmzauBkwqYHAipTzqlON31xurnhyPWnrXZnv5h8i8ZL45WxZjGjtExPhHxonhNdqNw0a38bm50nzb7DQLdn6u8FkxSk/z43ePMYEw</latexit><latexit sha1_base64="ZGSfvHkKuXAmqhUvswfMoyPCFyw=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQg1DwB6QEBPa0F5Yx03CXeUkbmrVcTLHKRfL34wvsm8zO82giSN2lZKlX8O+d/jo9z4iUEp7zT+TYzO9d48PDRfLP1+MnTZ8XFl+cpHGfHTsxyRmZx5MEcEUHXPMCTpLGIKR9CpN3qn90/HiKU4pkf8KkG9CIYUD7APuTL1F+duAIlDGxA04Ct2sgL8kAkwFpdS9kUgbRDhwAY+C7QReIhDKR1l8Ehu4MPc0g9WbcBwOSrdc7dgukWdSn9n2h6Z/EBkCFr4sOEhYHfTGWNTo7419X+iIOzk3/nqzLQFBH6dXyNnB+UvzvJPWX4hN+VH80WjDJbde3bVneAcv26P/kV84lT+U2kxph596jrQlmj5xqTf8k/5GZnwgcVRQw9FStOnZFtNA5k/IufiViNVCr1V9od9Y6+WObC7dYtK3iOewvNm5AEPtZhCj3CUzTc7eT8J6AjGOfINkCWYoS6I9giM7VksIpT2Rzxlpl3aP3J4YxJQj6pfcBIzSCPKhYdRwWrb6QyWMWFm2MPaEjhKgFIe07OVFqnQoIGaeXlmIvBIhqTovt+TouNsvnHc9S1ZQRgKCsLd7jqVwVChAtkO0Nx93cNpkYwlBd1BHYzobhi68OMoglS3CPnyXJ0PQDTNGNKFqKZFou1K1bYqPEGVT7fAtObl1IUEpwchkr2NUpgtV0JUBXdfFujawr3XYjztoZM/raZjVsJnBZibEDIhVM0S1mihJMYmpUc9gis7vycAUJVNM0WMdkqgPaQCNiMmwHk+G2GC7lc509XtbIiALI6j6DOIEMchjpl+6C8yHBEeYp6LYl6YXpvd7qf2q2L4sX0r973liXxqkGj75xSyfnXlD1Uwqc7rKGixkZWzSuBowqYDFAWtSzTu3Ot3Mxcn6mqvWnzbau3vF5Ju3XlqvrBXLtbasXeuDdWgdW36j3ThodBufm0vNt83dZsHOzhQ+S1bpaX78Dj1igPA=</latexit>

X ✓Y ◆ = X

n,v

I[x dn = v] log(p(x dn = v | β,θ d)) Ç å

<latexit sha1_base64="LPiWkWvUWNYI5mpxmr9IeDTaWv4=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQglDwB6QKia0ob2wjptUd5WTuKnVxMkcp1wsfzO+yL7N7DSDJo7YVoqVdbx75z/OT7OiZuEOGW2/W1ufqHx4OGjxWbr8ZOnz54vLb84TeOMeujEi8OYnrswRSEm6IRhFqLzhCIYuSE6c8fv1P7ZBNEUx+SYXSWoH8GA4CH2IJOmwfLCDQjwAQhGrI1M1kDXkA5mPBLIQbcFyaIsG8Cj/rKCFzEoBCWNLhbmCj3DLw101AcTBi63Xe2YLpFk0IOZ9ocmfxAZAhq+LDhIa+wM+ETU6e5NfV/rCD3u6f1/UZcCJfVqeRNYPyn+d5L6S/EpP64/GiWY5Lbr27as7oFVc/x/8ivnkqdym0mNsHXv0dYEM8dWtaZ/kv9Yz4/7liwKaHqyVhW7IlronAtxF78SsRqo1Roste0NO39MfeEUi3bHmD5Hg+XGDfBjL4sQYV4I07Tn2Anrc0gZ9kIkWiBLUQK9MQxQTy4JjFDa5/mcEWZp9jp82FMGCJeyY3DKI0gG2lGBadlqzeSwoiWZQtjn6soPkpxQMpebiRLBz4aypmXZ8Z9N8yQ4N3+4Lb1vYby9ncERWEIr8gnF3bkr8qEFCESIHsblnO9q7OJBlNQnQH2QpT2VBE0IUXRxEkqkXIEz15PgCRNKNIFSKbFvG2I2TbqvAUlT75fgvMbl4KzkEpwelrGBXM5gqVEJXGnRdF+taw7WYT/uoJY9q6dhVsNmGpvpENUgWs0Q1WqiJMVhTLR6hjN0fk+Gumg4wxQ9ViFD+SH1oRYxGdXjyQhrbLfSma56b0sEpEZ9BnCAKWUzVS3eB2SjEWYpL/aF7oXJ/V5yvyp2IMqXUv27Lj8QGimHT34xy2en31A5k8qcqrIGC2gZmzauBkwqYHAipTzqlON31xurnhyPWnrXZnv5h8i8ZL45WxZjGjtExPhHxonhNdqNw0a38bm50nzb7DQLdn6u8FkxSk/z43ePMYEw</latexit><latexit sha1_base64="LPiWkWvUWNYI5mpxmr9IeDTaWv4=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQglDwB6QKia0ob2wjptUd5WTuKnVxMkcp1wsfzO+yL7N7DSDJo7YVoqVdbx75z/OT7OiZuEOGW2/W1ufqHx4OGjxWbr8ZOnz54vLb84TeOMeujEi8OYnrswRSEm6IRhFqLzhCIYuSE6c8fv1P7ZBNEUx+SYXSWoH8GA4CH2IJOmwfLCDQjwAQhGrI1M1kDXkA5mPBLIQbcFyaIsG8Cj/rKCFzEoBCWNLhbmCj3DLw101AcTBi63Xe2YLpFk0IOZ9ocmfxAZAhq+LDhIa+wM+ETU6e5NfV/rCD3u6f1/UZcCJfVqeRNYPyn+d5L6S/EpP64/GiWY5Lbr27as7oFVc/x/8ivnkqdym0mNsHXv0dYEM8dWtaZ/kv9Yz4/7liwKaHqyVhW7IlronAtxF78SsRqo1Roste0NO39MfeEUi3bHmD5Hg+XGDfBjL4sQYV4I07Tn2Anrc0gZ9kIkWiBLUQK9MQxQTy4JjFDa5/mcEWZp9jp82FMGCJeyY3DKI0gG2lGBadlqzeSwoiWZQtjn6soPkpxQMpebiRLBz4aypmXZ8Z9N8yQ4N3+4Lb1vYby9ncERWEIr8gnF3bkr8qEFCESIHsblnO9q7OJBlNQnQH2QpT2VBE0IUXRxEkqkXIEz15PgCRNKNIFSKbFvG2I2TbqvAUlT75fgvMbl4KzkEpwelrGBXM5gqVEJXGnRdF+taw7WYT/uoJY9q6dhVsNmGpvpENUgWs0Q1WqiJMVhTLR6hjN0fk+Gumg4wxQ9ViFD+SH1oRYxGdXjyQhrbLfSma56b0sEpEZ9BnCAKWUzVS3eB2SjEWYpL/aF7oXJ/V5yvyp2IMqXUv27Lj8QGimHT34xy2en31A5k8qcqrIGC2gZmzauBkwqYHAipTzqlON31xurnhyPWnrXZnv5h8i8ZL45WxZjGjtExPhHxonhNdqNw0a38bm50nzb7DQLdn6u8FkxSk/z43ePMYEw</latexit><latexit sha1_base64="LPiWkWvUWNYI5mpxmr9IeDTaWv4=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQglDwB6QKia0ob2wjptUd5WTuKnVxMkcp1wsfzO+yL7N7DSDJo7YVoqVdbx75z/OT7OiZuEOGW2/W1ufqHx4OGjxWbr8ZOnz54vLb84TeOMeujEi8OYnrswRSEm6IRhFqLzhCIYuSE6c8fv1P7ZBNEUx+SYXSWoH8GA4CH2IJOmwfLCDQjwAQhGrI1M1kDXkA5mPBLIQbcFyaIsG8Cj/rKCFzEoBCWNLhbmCj3DLw101AcTBi63Xe2YLpFk0IOZ9ocmfxAZAhq+LDhIa+wM+ETU6e5NfV/rCD3u6f1/UZcCJfVqeRNYPyn+d5L6S/EpP64/GiWY5Lbr27as7oFVc/x/8ivnkqdym0mNsHXv0dYEM8dWtaZ/kv9Yz4/7liwKaHqyVhW7IlronAtxF78SsRqo1Roste0NO39MfeEUi3bHmD5Hg+XGDfBjL4sQYV4I07Tn2Anrc0gZ9kIkWiBLUQK9MQxQTy4JjFDa5/mcEWZp9jp82FMGCJeyY3DKI0gG2lGBadlqzeSwoiWZQtjn6soPkpxQMpebiRLBz4aypmXZ8Z9N8yQ4N3+4Lb1vYby9ncERWEIr8gnF3bkr8qEFCESIHsblnO9q7OJBlNQnQH2QpT2VBE0IUXRxEkqkXIEz15PgCRNKNIFSKbFvG2I2TbqvAUlT75fgvMbl4KzkEpwelrGBXM5gqVEJXGnRdF+taw7WYT/uoJY9q6dhVsNmGpvpENUgWs0Q1WqiJMVhTLR6hjN0fk+Gumg4wxQ9ViFD+SH1oRYxGdXjyQhrbLfSma56b0sEpEZ9BnCAKWUzVS3eB2SjEWYpL/aF7oXJ/V5yvyp2IMqXUv27Lj8QGimHT34xy2en31A5k8qcqrIGC2gZmzauBkwqYHAipTzqlON31xurnhyPWnrXZnv5h8i8ZL45WxZjGjtExPhHxonhNdqNw0a38bm50nzb7DQLdn6u8FkxSk/z43ePMYEw</latexit><latexit sha1_base64="ZGSfvHkKuXAmqhUvswfMoyPCFyw=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQg1DwB6QEBPa0F5Yx03CXeUkbmrVcTLHKRfL34wvsm8zO82giSN2lZKlX8O+d/jo9z4iUEp7zT+TYzO9d48PDRfLP1+MnTZ8XFl+cpHGfHTsxyRmZx5MEcEUHXPMCTpLGIKR9CpN3qn90/HiKU4pkf8KkG9CIYUD7APuTL1F+duAIlDGxA04Ct2sgL8kAkwFpdS9kUgbRDhwAY+C7QReIhDKR1l8Ehu4MPc0g9WbcBwOSrdc7dgukWdSn9n2h6Z/EBkCFr4sOEhYHfTGWNTo7419X+iIOzk3/nqzLQFBH6dXyNnB+UvzvJPWX4hN+VH80WjDJbde3bVneAcv26P/kV84lT+U2kxph596jrQlmj5xqTf8k/5GZnwgcVRQw9FStOnZFtNA5k/IufiViNVCr1V9od9Y6+WObC7dYtK3iOewvNm5AEPtZhCj3CUzTc7eT8J6AjGOfINkCWYoS6I9giM7VksIpT2Rzxlpl3aP3J4YxJQj6pfcBIzSCPKhYdRwWrb6QyWMWFm2MPaEjhKgFIe07OVFqnQoIGaeXlmIvBIhqTovt+TouNsvnHc9S1ZQRgKCsLd7jqVwVChAtkO0Nx93cNpkYwlBd1BHYzobhi68OMoglS3CPnyXJ0PQDTNGNKFqKZFou1K1bYqPEGVT7fAtObl1IUEpwchkr2NUpgtV0JUBXdfFujawr3XYjztoZM/raZjVsJnBZibEDIhVM0S1mihJMYmpUc9gis7vycAUJVNM0WMdkqgPaQCNiMmwHk+G2GC7lc509XtbIiALI6j6DOIEMchjpl+6C8yHBEeYp6LYl6YXpvd7qf2q2L4sX0r973liXxqkGj75xSyfnXlD1Uwqc7rKGixkZWzSuBowqYDFAWtSzTu3Ot3Mxcn6mqvWnzbau3vF5Ju3XlqvrBXLtbasXeuDdWgdW36j3ThodBufm0vNt83dZsHOzhQ+S1bpaX78Dj1igPA=</latexit>

X = X

n,v

I[x dn = v] log ÇX

k

p(x dn = v, zdn=k | β,θ d) å Ç å

<latexit sha1_base64="LPiWkWvUWNYI5mpxmr9IeDTaWv4=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQglDwB6QKia0ob2wjptUd5WTuKnVxMkcp1wsfzO+yL7N7DSDJo7YVoqVdbx75z/OT7OiZuEOGW2/W1ufqHx4OGjxWbr8ZOnz54vLb84TeOMeujEi8OYnrswRSEm6IRhFqLzhCIYuSE6c8fv1P7ZBNEUx+SYXSWoH8GA4CH2IJOmwfLCDQjwAQhGrI1M1kDXkA5mPBLIQbcFyaIsG8Cj/rKCFzEoBCWNLhbmCj3DLw101AcTBi63Xe2YLpFk0IOZ9ocmfxAZAhq+LDhIa+wM+ETU6e5NfV/rCD3u6f1/UZcCJfVqeRNYPyn+d5L6S/EpP64/GiWY5Lbr27as7oFVc/x/8ivnkqdym0mNsHXv0dYEM8dWtaZ/kv9Yz4/7liwKaHqyVhW7IlronAtxF78SsRqo1Roste0NO39MfeEUi3bHmD5Hg+XGDfBjL4sQYV4I07Tn2Anrc0gZ9kIkWiBLUQK9MQxQTy4JjFDa5/mcEWZp9jp82FMGCJeyY3DKI0gG2lGBadlqzeSwoiWZQtjn6soPkpxQMpebiRLBz4aypmXZ8Z9N8yQ4N3+4Lb1vYby9ncERWEIr8gnF3bkr8qEFCESIHsblnO9q7OJBlNQnQH2QpT2VBE0IUXRxEkqkXIEz15PgCRNKNIFSKbFvG2I2TbqvAUlT75fgvMbl4KzkEpwelrGBXM5gqVEJXGnRdF+taw7WYT/uoJY9q6dhVsNmGpvpENUgWs0Q1WqiJMVhTLR6hjN0fk+Gumg4wxQ9ViFD+SH1oRYxGdXjyQhrbLfSma56b0sEpEZ9BnCAKWUzVS3eB2SjEWYpL/aF7oXJ/V5yvyp2IMqXUv27Lj8QGimHT34xy2en31A5k8qcqrIGC2gZmzauBkwqYHAipTzqlON31xurnhyPWnrXZnv5h8i8ZL45WxZjGjtExPhHxonhNdqNw0a38bm50nzb7DQLdn6u8FkxSk/z43ePMYEw</latexit><latexit sha1_base64="LPiWkWvUWNYI5mpxmr9IeDTaWv4=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQglDwB6QKia0ob2wjptUd5WTuKnVxMkcp1wsfzO+yL7N7DSDJo7YVoqVdbx75z/OT7OiZuEOGW2/W1ufqHx4OGjxWbr8ZOnz54vLb84TeOMeujEi8OYnrswRSEm6IRhFqLzhCIYuSE6c8fv1P7ZBNEUx+SYXSWoH8GA4CH2IJOmwfLCDQjwAQhGrI1M1kDXkA5mPBLIQbcFyaIsG8Cj/rKCFzEoBCWNLhbmCj3DLw101AcTBi63Xe2YLpFk0IOZ9ocmfxAZAhq+LDhIa+wM+ETU6e5NfV/rCD3u6f1/UZcCJfVqeRNYPyn+d5L6S/EpP64/GiWY5Lbr27as7oFVc/x/8ivnkqdym0mNsHXv0dYEM8dWtaZ/kv9Yz4/7liwKaHqyVhW7IlronAtxF78SsRqo1Roste0NO39MfeEUi3bHmD5Hg+XGDfBjL4sQYV4I07Tn2Anrc0gZ9kIkWiBLUQK9MQxQTy4JjFDa5/mcEWZp9jp82FMGCJeyY3DKI0gG2lGBadlqzeSwoiWZQtjn6soPkpxQMpebiRLBz4aypmXZ8Z9N8yQ4N3+4Lb1vYby9ncERWEIr8gnF3bkr8qEFCESIHsblnO9q7OJBlNQnQH2QpT2VBE0IUXRxEkqkXIEz15PgCRNKNIFSKbFvG2I2TbqvAUlT75fgvMbl4KzkEpwelrGBXM5gqVEJXGnRdF+taw7WYT/uoJY9q6dhVsNmGpvpENUgWs0Q1WqiJMVhTLR6hjN0fk+Gumg4wxQ9ViFD+SH1oRYxGdXjyQhrbLfSma56b0sEpEZ9BnCAKWUzVS3eB2SjEWYpL/aF7oXJ/V5yvyp2IMqXUv27Lj8QGimHT34xy2en31A5k8qcqrIGC2gZmzauBkwqYHAipTzqlON31xurnhyPWnrXZnv5h8i8ZL45WxZjGjtExPhHxonhNdqNw0a38bm50nzb7DQLdn6u8FkxSk/z43ePMYEw</latexit><latexit sha1_base64="LPiWkWvUWNYI5mpxmr9IeDTaWv4=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQglDwB6QKia0ob2wjptUd5WTuKnVxMkcp1wsfzO+yL7N7DSDJo7YVoqVdbx75z/OT7OiZuEOGW2/W1ufqHx4OGjxWbr8ZOnz54vLb84TeOMeujEi8OYnrswRSEm6IRhFqLzhCIYuSE6c8fv1P7ZBNEUx+SYXSWoH8GA4CH2IJOmwfLCDQjwAQhGrI1M1kDXkA5mPBLIQbcFyaIsG8Cj/rKCFzEoBCWNLhbmCj3DLw101AcTBi63Xe2YLpFk0IOZ9ocmfxAZAhq+LDhIa+wM+ETU6e5NfV/rCD3u6f1/UZcCJfVqeRNYPyn+d5L6S/EpP64/GiWY5Lbr27as7oFVc/x/8ivnkqdym0mNsHXv0dYEM8dWtaZ/kv9Yz4/7liwKaHqyVhW7IlronAtxF78SsRqo1Roste0NO39MfeEUi3bHmD5Hg+XGDfBjL4sQYV4I07Tn2Anrc0gZ9kIkWiBLUQK9MQxQTy4JjFDa5/mcEWZp9jp82FMGCJeyY3DKI0gG2lGBadlqzeSwoiWZQtjn6soPkpxQMpebiRLBz4aypmXZ8Z9N8yQ4N3+4Lb1vYby9ncERWEIr8gnF3bkr8qEFCESIHsblnO9q7OJBlNQnQH2QpT2VBE0IUXRxEkqkXIEz15PgCRNKNIFSKbFvG2I2TbqvAUlT75fgvMbl4KzkEpwelrGBXM5gqVEJXGnRdF+taw7WYT/uoJY9q6dhVsNmGpvpENUgWs0Q1WqiJMVhTLR6hjN0fk+Gumg4wxQ9ViFD+SH1oRYxGdXjyQhrbLfSma56b0sEpEZ9BnCAKWUzVS3eB2SjEWYpL/aF7oXJ/V5yvyp2IMqXUv27Lj8QGimHT34xy2en31A5k8qcqrIGC2gZmzauBkwqYHAipTzqlON31xurnhyPWnrXZnv5h8i8ZL45WxZjGjtExPhHxonhNdqNw0a38bm50nzb7DQLdn6u8FkxSk/z43ePMYEw</latexit><latexit sha1_base64="ZGSfvHkKuXAmqhUvswfMoyPCFyw=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQg1DwB6QEBPa0F5Yx03CXeUkbmrVcTLHKRfL34wvsm8zO82giSN2lZKlX8O+d/jo9z4iUEp7zT+TYzO9d48PDRfLP1+MnTZ8XFl+cpHGfHTsxyRmZx5MEcEUHXPMCTpLGIKR9CpN3qn90/HiKU4pkf8KkG9CIYUD7APuTL1F+duAIlDGxA04Ct2sgL8kAkwFpdS9kUgbRDhwAY+C7QReIhDKR1l8Ehu4MPc0g9WbcBwOSrdc7dgukWdSn9n2h6Z/EBkCFr4sOEhYHfTGWNTo7419X+iIOzk3/nqzLQFBH6dXyNnB+UvzvJPWX4hN+VH80WjDJbde3bVneAcv26P/kV84lT+U2kxph596jrQlmj5xqTf8k/5GZnwgcVRQw9FStOnZFtNA5k/IufiViNVCr1V9od9Y6+WObC7dYtK3iOewvNm5AEPtZhCj3CUzTc7eT8J6AjGOfINkCWYoS6I9giM7VksIpT2Rzxlpl3aP3J4YxJQj6pfcBIzSCPKhYdRwWrb6QyWMWFm2MPaEjhKgFIe07OVFqnQoIGaeXlmIvBIhqTovt+TouNsvnHc9S1ZQRgKCsLd7jqVwVChAtkO0Nx93cNpkYwlBd1BHYzobhi68OMoglS3CPnyXJ0PQDTNGNKFqKZFou1K1bYqPEGVT7fAtObl1IUEpwchkr2NUpgtV0JUBXdfFujawr3XYjztoZM/raZjVsJnBZibEDIhVM0S1mihJMYmpUc9gis7vycAUJVNM0WMdkqgPaQCNiMmwHk+G2GC7lc509XtbIiALI6j6DOIEMchjpl+6C8yHBEeYp6LYl6YXpvd7qf2q2L4sX0r973liXxqkGj75xSyfnXlD1Uwqc7rKGixkZWzSuBowqYDFAWtSzTu3Ot3Mxcn6mqvWnzbau3vF5Ju3XlqvrBXLtbasXeuDdWgdW36j3ThodBufm0vNt83dZsHOzhQ+S1bpaX78Dj1igPA=</latexit>

X ÇX å = X

n,v

I[x dn = v] log ÇX

k

p(zdn=k | θ d) p(x dn = v | zdn=k,β) å Ç å

<latexit sha1_base64="LPiWkWvUWNYI5mpxmr9IeDTaWv4=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQglDwB6QKia0ob2wjptUd5WTuKnVxMkcp1wsfzO+yL7N7DSDJo7YVoqVdbx75z/OT7OiZuEOGW2/W1ufqHx4OGjxWbr8ZOnz54vLb84TeOMeujEi8OYnrswRSEm6IRhFqLzhCIYuSE6c8fv1P7ZBNEUx+SYXSWoH8GA4CH2IJOmwfLCDQjwAQhGrI1M1kDXkA5mPBLIQbcFyaIsG8Cj/rKCFzEoBCWNLhbmCj3DLw101AcTBi63Xe2YLpFk0IOZ9ocmfxAZAhq+LDhIa+wM+ETU6e5NfV/rCD3u6f1/UZcCJfVqeRNYPyn+d5L6S/EpP64/GiWY5Lbr27as7oFVc/x/8ivnkqdym0mNsHXv0dYEM8dWtaZ/kv9Yz4/7liwKaHqyVhW7IlronAtxF78SsRqo1Roste0NO39MfeEUi3bHmD5Hg+XGDfBjL4sQYV4I07Tn2Anrc0gZ9kIkWiBLUQK9MQxQTy4JjFDa5/mcEWZp9jp82FMGCJeyY3DKI0gG2lGBadlqzeSwoiWZQtjn6soPkpxQMpebiRLBz4aypmXZ8Z9N8yQ4N3+4Lb1vYby9ncERWEIr8gnF3bkr8qEFCESIHsblnO9q7OJBlNQnQH2QpT2VBE0IUXRxEkqkXIEz15PgCRNKNIFSKbFvG2I2TbqvAUlT75fgvMbl4KzkEpwelrGBXM5gqVEJXGnRdF+taw7WYT/uoJY9q6dhVsNmGpvpENUgWs0Q1WqiJMVhTLR6hjN0fk+Gumg4wxQ9ViFD+SH1oRYxGdXjyQhrbLfSma56b0sEpEZ9BnCAKWUzVS3eB2SjEWYpL/aF7oXJ/V5yvyp2IMqXUv27Lj8QGimHT34xy2en31A5k8qcqrIGC2gZmzauBkwqYHAipTzqlON31xurnhyPWnrXZnv5h8i8ZL45WxZjGjtExPhHxonhNdqNw0a38bm50nzb7DQLdn6u8FkxSk/z43ePMYEw</latexit><latexit sha1_base64="LPiWkWvUWNYI5mpxmr9IeDTaWv4=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQglDwB6QKia0ob2wjptUd5WTuKnVxMkcp1wsfzO+yL7N7DSDJo7YVoqVdbx75z/OT7OiZuEOGW2/W1ufqHx4OGjxWbr8ZOnz54vLb84TeOMeujEi8OYnrswRSEm6IRhFqLzhCIYuSE6c8fv1P7ZBNEUx+SYXSWoH8GA4CH2IJOmwfLCDQjwAQhGrI1M1kDXkA5mPBLIQbcFyaIsG8Cj/rKCFzEoBCWNLhbmCj3DLw101AcTBi63Xe2YLpFk0IOZ9ocmfxAZAhq+LDhIa+wM+ETU6e5NfV/rCD3u6f1/UZcCJfVqeRNYPyn+d5L6S/EpP64/GiWY5Lbr27as7oFVc/x/8ivnkqdym0mNsHXv0dYEM8dWtaZ/kv9Yz4/7liwKaHqyVhW7IlronAtxF78SsRqo1Roste0NO39MfeEUi3bHmD5Hg+XGDfBjL4sQYV4I07Tn2Anrc0gZ9kIkWiBLUQK9MQxQTy4JjFDa5/mcEWZp9jp82FMGCJeyY3DKI0gG2lGBadlqzeSwoiWZQtjn6soPkpxQMpebiRLBz4aypmXZ8Z9N8yQ4N3+4Lb1vYby9ncERWEIr8gnF3bkr8qEFCESIHsblnO9q7OJBlNQnQH2QpT2VBE0IUXRxEkqkXIEz15PgCRNKNIFSKbFvG2I2TbqvAUlT75fgvMbl4KzkEpwelrGBXM5gqVEJXGnRdF+taw7WYT/uoJY9q6dhVsNmGpvpENUgWs0Q1WqiJMVhTLR6hjN0fk+Gumg4wxQ9ViFD+SH1oRYxGdXjyQhrbLfSma56b0sEpEZ9BnCAKWUzVS3eB2SjEWYpL/aF7oXJ/V5yvyp2IMqXUv27Lj8QGimHT34xy2en31A5k8qcqrIGC2gZmzauBkwqYHAipTzqlON31xurnhyPWnrXZnv5h8i8ZL45WxZjGjtExPhHxonhNdqNw0a38bm50nzb7DQLdn6u8FkxSk/z43ePMYEw</latexit><latexit sha1_base64="LPiWkWvUWNYI5mpxmr9IeDTaWv4=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQglDwB6QKia0ob2wjptUd5WTuKnVxMkcp1wsfzO+yL7N7DSDJo7YVoqVdbx75z/OT7OiZuEOGW2/W1ufqHx4OGjxWbr8ZOnz54vLb84TeOMeujEi8OYnrswRSEm6IRhFqLzhCIYuSE6c8fv1P7ZBNEUx+SYXSWoH8GA4CH2IJOmwfLCDQjwAQhGrI1M1kDXkA5mPBLIQbcFyaIsG8Cj/rKCFzEoBCWNLhbmCj3DLw101AcTBi63Xe2YLpFk0IOZ9ocmfxAZAhq+LDhIa+wM+ETU6e5NfV/rCD3u6f1/UZcCJfVqeRNYPyn+d5L6S/EpP64/GiWY5Lbr27as7oFVc/x/8ivnkqdym0mNsHXv0dYEM8dWtaZ/kv9Yz4/7liwKaHqyVhW7IlronAtxF78SsRqo1Roste0NO39MfeEUi3bHmD5Hg+XGDfBjL4sQYV4I07Tn2Anrc0gZ9kIkWiBLUQK9MQxQTy4JjFDa5/mcEWZp9jp82FMGCJeyY3DKI0gG2lGBadlqzeSwoiWZQtjn6soPkpxQMpebiRLBz4aypmXZ8Z9N8yQ4N3+4Lb1vYby9ncERWEIr8gnF3bkr8qEFCESIHsblnO9q7OJBlNQnQH2QpT2VBE0IUXRxEkqkXIEz15PgCRNKNIFSKbFvG2I2TbqvAUlT75fgvMbl4KzkEpwelrGBXM5gqVEJXGnRdF+taw7WYT/uoJY9q6dhVsNmGpvpENUgWs0Q1WqiJMVhTLR6hjN0fk+Gumg4wxQ9ViFD+SH1oRYxGdXjyQhrbLfSma56b0sEpEZ9BnCAKWUzVS3eB2SjEWYpL/aF7oXJ/V5yvyp2IMqXUv27Lj8QGimHT34xy2en31A5k8qcqrIGC2gZmzauBkwqYHAipTzqlON31xurnhyPWnrXZnv5h8i8ZL45WxZjGjtExPhHxonhNdqNw0a38bm50nzb7DQLdn6u8FkxSk/z43ePMYEw</latexit><latexit sha1_base64="ZGSfvHkKuXAmqhUvswfMoyPCFyw=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQg1DwB6QEBPa0F5Yx03CXeUkbmrVcTLHKRfL34wvsm8zO82giSN2lZKlX8O+d/jo9z4iUEp7zT+TYzO9d48PDRfLP1+MnTZ8XFl+cpHGfHTsxyRmZx5MEcEUHXPMCTpLGIKR9CpN3qn90/HiKU4pkf8KkG9CIYUD7APuTL1F+duAIlDGxA04Ct2sgL8kAkwFpdS9kUgbRDhwAY+C7QReIhDKR1l8Ehu4MPc0g9WbcBwOSrdc7dgukWdSn9n2h6Z/EBkCFr4sOEhYHfTGWNTo7419X+iIOzk3/nqzLQFBH6dXyNnB+UvzvJPWX4hN+VH80WjDJbde3bVneAcv26P/kV84lT+U2kxph596jrQlmj5xqTf8k/5GZnwgcVRQw9FStOnZFtNA5k/IufiViNVCr1V9od9Y6+WObC7dYtK3iOewvNm5AEPtZhCj3CUzTc7eT8J6AjGOfINkCWYoS6I9giM7VksIpT2Rzxlpl3aP3J4YxJQj6pfcBIzSCPKhYdRwWrb6QyWMWFm2MPaEjhKgFIe07OVFqnQoIGaeXlmIvBIhqTovt+TouNsvnHc9S1ZQRgKCsLd7jqVwVChAtkO0Nx93cNpkYwlBd1BHYzobhi68OMoglS3CPnyXJ0PQDTNGNKFqKZFou1K1bYqPEGVT7fAtObl1IUEpwchkr2NUpgtV0JUBXdfFujawr3XYjztoZM/raZjVsJnBZibEDIhVM0S1mihJMYmpUc9gis7vycAUJVNM0WMdkqgPaQCNiMmwHk+G2GC7lc509XtbIiALI6j6DOIEMchjpl+6C8yHBEeYp6LYl6YXpvd7qf2q2L4sX0r973liXxqkGj75xSyfnXlD1Uwqc7rKGixkZWzSuBowqYDFAWtSzTu3Ot3Mxcn6mqvWnzbau3vF5Ju3XlqvrBXLtbasXeuDdWgdW36j3ThodBufm0vNt83dZsHOzhQ+S1bpaX78Dj1igPA=</latexit>

X ÇX å = X

n,v

I[x dn = v] log ÇX

k

θ d,k β k,v å

<latexit sha1_base64="LPiWkWvUWNYI5mpxmr9IeDTaWv4=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQglDwB6QKia0ob2wjptUd5WTuKnVxMkcp1wsfzO+yL7N7DSDJo7YVoqVdbx75z/OT7OiZuEOGW2/W1ufqHx4OGjxWbr8ZOnz54vLb84TeOMeujEi8OYnrswRSEm6IRhFqLzhCIYuSE6c8fv1P7ZBNEUx+SYXSWoH8GA4CH2IJOmwfLCDQjwAQhGrI1M1kDXkA5mPBLIQbcFyaIsG8Cj/rKCFzEoBCWNLhbmCj3DLw101AcTBi63Xe2YLpFk0IOZ9ocmfxAZAhq+LDhIa+wM+ETU6e5NfV/rCD3u6f1/UZcCJfVqeRNYPyn+d5L6S/EpP64/GiWY5Lbr27as7oFVc/x/8ivnkqdym0mNsHXv0dYEM8dWtaZ/kv9Yz4/7liwKaHqyVhW7IlronAtxF78SsRqo1Roste0NO39MfeEUi3bHmD5Hg+XGDfBjL4sQYV4I07Tn2Anrc0gZ9kIkWiBLUQK9MQxQTy4JjFDa5/mcEWZp9jp82FMGCJeyY3DKI0gG2lGBadlqzeSwoiWZQtjn6soPkpxQMpebiRLBz4aypmXZ8Z9N8yQ4N3+4Lb1vYby9ncERWEIr8gnF3bkr8qEFCESIHsblnO9q7OJBlNQnQH2QpT2VBE0IUXRxEkqkXIEz15PgCRNKNIFSKbFvG2I2TbqvAUlT75fgvMbl4KzkEpwelrGBXM5gqVEJXGnRdF+taw7WYT/uoJY9q6dhVsNmGpvpENUgWs0Q1WqiJMVhTLR6hjN0fk+Gumg4wxQ9ViFD+SH1oRYxGdXjyQhrbLfSma56b0sEpEZ9BnCAKWUzVS3eB2SjEWYpL/aF7oXJ/V5yvyp2IMqXUv27Lj8QGimHT34xy2en31A5k8qcqrIGC2gZmzauBkwqYHAipTzqlON31xurnhyPWnrXZnv5h8i8ZL45WxZjGjtExPhHxonhNdqNw0a38bm50nzb7DQLdn6u8FkxSk/z43ePMYEw</latexit><latexit sha1_base64="LPiWkWvUWNYI5mpxmr9IeDTaWv4=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQglDwB6QKia0ob2wjptUd5WTuKnVxMkcp1wsfzO+yL7N7DSDJo7YVoqVdbx75z/OT7OiZuEOGW2/W1ufqHx4OGjxWbr8ZOnz54vLb84TeOMeujEi8OYnrswRSEm6IRhFqLzhCIYuSE6c8fv1P7ZBNEUx+SYXSWoH8GA4CH2IJOmwfLCDQjwAQhGrI1M1kDXkA5mPBLIQbcFyaIsG8Cj/rKCFzEoBCWNLhbmCj3DLw101AcTBi63Xe2YLpFk0IOZ9ocmfxAZAhq+LDhIa+wM+ETU6e5NfV/rCD3u6f1/UZcCJfVqeRNYPyn+d5L6S/EpP64/GiWY5Lbr27as7oFVc/x/8ivnkqdym0mNsHXv0dYEM8dWtaZ/kv9Yz4/7liwKaHqyVhW7IlronAtxF78SsRqo1Roste0NO39MfeEUi3bHmD5Hg+XGDfBjL4sQYV4I07Tn2Anrc0gZ9kIkWiBLUQK9MQxQTy4JjFDa5/mcEWZp9jp82FMGCJeyY3DKI0gG2lGBadlqzeSwoiWZQtjn6soPkpxQMpebiRLBz4aypmXZ8Z9N8yQ4N3+4Lb1vYby9ncERWEIr8gnF3bkr8qEFCESIHsblnO9q7OJBlNQnQH2QpT2VBE0IUXRxEkqkXIEz15PgCRNKNIFSKbFvG2I2TbqvAUlT75fgvMbl4KzkEpwelrGBXM5gqVEJXGnRdF+taw7WYT/uoJY9q6dhVsNmGpvpENUgWs0Q1WqiJMVhTLR6hjN0fk+Gumg4wxQ9ViFD+SH1oRYxGdXjyQhrbLfSma56b0sEpEZ9BnCAKWUzVS3eB2SjEWYpL/aF7oXJ/V5yvyp2IMqXUv27Lj8QGimHT34xy2en31A5k8qcqrIGC2gZmzauBkwqYHAipTzqlON31xurnhyPWnrXZnv5h8i8ZL45WxZjGjtExPhHxonhNdqNw0a38bm50nzb7DQLdn6u8FkxSk/z43ePMYEw</latexit><latexit sha1_base64="LPiWkWvUWNYI5mpxmr9IeDTaWv4=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQglDwB6QKia0ob2wjptUd5WTuKnVxMkcp1wsfzO+yL7N7DSDJo7YVoqVdbx75z/OT7OiZuEOGW2/W1ufqHx4OGjxWbr8ZOnz54vLb84TeOMeujEi8OYnrswRSEm6IRhFqLzhCIYuSE6c8fv1P7ZBNEUx+SYXSWoH8GA4CH2IJOmwfLCDQjwAQhGrI1M1kDXkA5mPBLIQbcFyaIsG8Cj/rKCFzEoBCWNLhbmCj3DLw101AcTBi63Xe2YLpFk0IOZ9ocmfxAZAhq+LDhIa+wM+ETU6e5NfV/rCD3u6f1/UZcCJfVqeRNYPyn+d5L6S/EpP64/GiWY5Lbr27as7oFVc/x/8ivnkqdym0mNsHXv0dYEM8dWtaZ/kv9Yz4/7liwKaHqyVhW7IlronAtxF78SsRqo1Roste0NO39MfeEUi3bHmD5Hg+XGDfBjL4sQYV4I07Tn2Anrc0gZ9kIkWiBLUQK9MQxQTy4JjFDa5/mcEWZp9jp82FMGCJeyY3DKI0gG2lGBadlqzeSwoiWZQtjn6soPkpxQMpebiRLBz4aypmXZ8Z9N8yQ4N3+4Lb1vYby9ncERWEIr8gnF3bkr8qEFCESIHsblnO9q7OJBlNQnQH2QpT2VBE0IUXRxEkqkXIEz15PgCRNKNIFSKbFvG2I2TbqvAUlT75fgvMbl4KzkEpwelrGBXM5gqVEJXGnRdF+taw7WYT/uoJY9q6dhVsNmGpvpENUgWs0Q1WqiJMVhTLR6hjN0fk+Gumg4wxQ9ViFD+SH1oRYxGdXjyQhrbLfSma56b0sEpEZ9BnCAKWUzVS3eB2SjEWYpL/aF7oXJ/V5yvyp2IMqXUv27Lj8QGimHT34xy2en31A5k8qcqrIGC2gZmzauBkwqYHAipTzqlON31xurnhyPWnrXZnv5h8i8ZL45WxZjGjtExPhHxonhNdqNw0a38bm50nzb7DQLdn6u8FkxSk/z43ePMYEw</latexit><latexit sha1_base64="ZGSfvHkKuXAmqhUvswfMoyPCFyw=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQg1DwB6QEBPa0F5Yx03CXeUkbmrVcTLHKRfL34wvsm8zO82giSN2lZKlX8O+d/jo9z4iUEp7zT+TYzO9d48PDRfLP1+MnTZ8XFl+cpHGfHTsxyRmZx5MEcEUHXPMCTpLGIKR9CpN3qn90/HiKU4pkf8KkG9CIYUD7APuTL1F+duAIlDGxA04Ct2sgL8kAkwFpdS9kUgbRDhwAY+C7QReIhDKR1l8Ehu4MPc0g9WbcBwOSrdc7dgukWdSn9n2h6Z/EBkCFr4sOEhYHfTGWNTo7419X+iIOzk3/nqzLQFBH6dXyNnB+UvzvJPWX4hN+VH80WjDJbde3bVneAcv26P/kV84lT+U2kxph596jrQlmj5xqTf8k/5GZnwgcVRQw9FStOnZFtNA5k/IufiViNVCr1V9od9Y6+WObC7dYtK3iOewvNm5AEPtZhCj3CUzTc7eT8J6AjGOfINkCWYoS6I9giM7VksIpT2Rzxlpl3aP3J4YxJQj6pfcBIzSCPKhYdRwWrb6QyWMWFm2MPaEjhKgFIe07OVFqnQoIGaeXlmIvBIhqTovt+TouNsvnHc9S1ZQRgKCsLd7jqVwVChAtkO0Nx93cNpkYwlBd1BHYzobhi68OMoglS3CPnyXJ0PQDTNGNKFqKZFou1K1bYqPEGVT7fAtObl1IUEpwchkr2NUpgtV0JUBXdfFujawr3XYjztoZM/raZjVsJnBZibEDIhVM0S1mihJMYmpUc9gis7vycAUJVNM0WMdkqgPaQCNiMmwHk+G2GC7lc509XtbIiALI6j6DOIEMchjpl+6C8yHBEeYp6LYl6YXpvd7qf2q2L4sX0r973liXxqkGj75xSyfnXlD1Uwqc7rKGixkZWzSuBowqYDFAWtSzTu3Ot3Mxcn6mqvWnzbau3vF5Ju3XlqvrBXLtbasXeuDdWgdW36j3ThodBufm0vNt83dZsHOzhQ+S1bpaX78Dj1igPA=</latexit>

X ÇX å = X logθβ

<latexit sha1_base64="LPiWkWvUWNYI5mpxmr9IeDTaWv4=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQglDwB6QKia0ob2wjptUd5WTuKnVxMkcp1wsfzO+yL7N7DSDJo7YVoqVdbx75z/OT7OiZuEOGW2/W1ufqHx4OGjxWbr8ZOnz54vLb84TeOMeujEi8OYnrswRSEm6IRhFqLzhCIYuSE6c8fv1P7ZBNEUx+SYXSWoH8GA4CH2IJOmwfLCDQjwAQhGrI1M1kDXkA5mPBLIQbcFyaIsG8Cj/rKCFzEoBCWNLhbmCj3DLw101AcTBi63Xe2YLpFk0IOZ9ocmfxAZAhq+LDhIa+wM+ETU6e5NfV/rCD3u6f1/UZcCJfVqeRNYPyn+d5L6S/EpP64/GiWY5Lbr27as7oFVc/x/8ivnkqdym0mNsHXv0dYEM8dWtaZ/kv9Yz4/7liwKaHqyVhW7IlronAtxF78SsRqo1Roste0NO39MfeEUi3bHmD5Hg+XGDfBjL4sQYV4I07Tn2Anrc0gZ9kIkWiBLUQK9MQxQTy4JjFDa5/mcEWZp9jp82FMGCJeyY3DKI0gG2lGBadlqzeSwoiWZQtjn6soPkpxQMpebiRLBz4aypmXZ8Z9N8yQ4N3+4Lb1vYby9ncERWEIr8gnF3bkr8qEFCESIHsblnO9q7OJBlNQnQH2QpT2VBE0IUXRxEkqkXIEz15PgCRNKNIFSKbFvG2I2TbqvAUlT75fgvMbl4KzkEpwelrGBXM5gqVEJXGnRdF+taw7WYT/uoJY9q6dhVsNmGpvpENUgWs0Q1WqiJMVhTLR6hjN0fk+Gumg4wxQ9ViFD+SH1oRYxGdXjyQhrbLfSma56b0sEpEZ9BnCAKWUzVS3eB2SjEWYpL/aF7oXJ/V5yvyp2IMqXUv27Lj8QGimHT34xy2en31A5k8qcqrIGC2gZmzauBkwqYHAipTzqlON31xurnhyPWnrXZnv5h8i8ZL45WxZjGjtExPhHxonhNdqNw0a38bm50nzb7DQLdn6u8FkxSk/z43ePMYEw</latexit><latexit sha1_base64="LPiWkWvUWNYI5mpxmr9IeDTaWv4=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQglDwB6QKia0ob2wjptUd5WTuKnVxMkcp1wsfzO+yL7N7DSDJo7YVoqVdbx75z/OT7OiZuEOGW2/W1ufqHx4OGjxWbr8ZOnz54vLb84TeOMeujEi8OYnrswRSEm6IRhFqLzhCIYuSE6c8fv1P7ZBNEUx+SYXSWoH8GA4CH2IJOmwfLCDQjwAQhGrI1M1kDXkA5mPBLIQbcFyaIsG8Cj/rKCFzEoBCWNLhbmCj3DLw101AcTBi63Xe2YLpFk0IOZ9ocmfxAZAhq+LDhIa+wM+ETU6e5NfV/rCD3u6f1/UZcCJfVqeRNYPyn+d5L6S/EpP64/GiWY5Lbr27as7oFVc/x/8ivnkqdym0mNsHXv0dYEM8dWtaZ/kv9Yz4/7liwKaHqyVhW7IlronAtxF78SsRqo1Roste0NO39MfeEUi3bHmD5Hg+XGDfBjL4sQYV4I07Tn2Anrc0gZ9kIkWiBLUQK9MQxQTy4JjFDa5/mcEWZp9jp82FMGCJeyY3DKI0gG2lGBadlqzeSwoiWZQtjn6soPkpxQMpebiRLBz4aypmXZ8Z9N8yQ4N3+4Lb1vYby9ncERWEIr8gnF3bkr8qEFCESIHsblnO9q7OJBlNQnQH2QpT2VBE0IUXRxEkqkXIEz15PgCRNKNIFSKbFvG2I2TbqvAUlT75fgvMbl4KzkEpwelrGBXM5gqVEJXGnRdF+taw7WYT/uoJY9q6dhVsNmGpvpENUgWs0Q1WqiJMVhTLR6hjN0fk+Gumg4wxQ9ViFD+SH1oRYxGdXjyQhrbLfSma56b0sEpEZ9BnCAKWUzVS3eB2SjEWYpL/aF7oXJ/V5yvyp2IMqXUv27Lj8QGimHT34xy2en31A5k8qcqrIGC2gZmzauBkwqYHAipTzqlON31xurnhyPWnrXZnv5h8i8ZL45WxZjGjtExPhHxonhNdqNw0a38bm50nzb7DQLdn6u8FkxSk/z43ePMYEw</latexit><latexit sha1_base64="LPiWkWvUWNYI5mpxmr9IeDTaWv4=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQglDwB6QKia0ob2wjptUd5WTuKnVxMkcp1wsfzO+yL7N7DSDJo7YVoqVdbx75z/OT7OiZuEOGW2/W1ufqHx4OGjxWbr8ZOnz54vLb84TeOMeujEi8OYnrswRSEm6IRhFqLzhCIYuSE6c8fv1P7ZBNEUx+SYXSWoH8GA4CH2IJOmwfLCDQjwAQhGrI1M1kDXkA5mPBLIQbcFyaIsG8Cj/rKCFzEoBCWNLhbmCj3DLw101AcTBi63Xe2YLpFk0IOZ9ocmfxAZAhq+LDhIa+wM+ETU6e5NfV/rCD3u6f1/UZcCJfVqeRNYPyn+d5L6S/EpP64/GiWY5Lbr27as7oFVc/x/8ivnkqdym0mNsHXv0dYEM8dWtaZ/kv9Yz4/7liwKaHqyVhW7IlronAtxF78SsRqo1Roste0NO39MfeEUi3bHmD5Hg+XGDfBjL4sQYV4I07Tn2Anrc0gZ9kIkWiBLUQK9MQxQTy4JjFDa5/mcEWZp9jp82FMGCJeyY3DKI0gG2lGBadlqzeSwoiWZQtjn6soPkpxQMpebiRLBz4aypmXZ8Z9N8yQ4N3+4Lb1vYby9ncERWEIr8gnF3bkr8qEFCESIHsblnO9q7OJBlNQnQH2QpT2VBE0IUXRxEkqkXIEz15PgCRNKNIFSKbFvG2I2TbqvAUlT75fgvMbl4KzkEpwelrGBXM5gqVEJXGnRdF+taw7WYT/uoJY9q6dhVsNmGpvpENUgWs0Q1WqiJMVhTLR6hjN0fk+Gumg4wxQ9ViFD+SH1oRYxGdXjyQhrbLfSma56b0sEpEZ9BnCAKWUzVS3eB2SjEWYpL/aF7oXJ/V5yvyp2IMqXUv27Lj8QGimHT34xy2en31A5k8qcqrIGC2gZmzauBkwqYHAipTzqlON31xurnhyPWnrXZnv5h8i8ZL45WxZjGjtExPhHxonhNdqNw0a38bm50nzb7DQLdn6u8FkxSk/z43ePMYEw</latexit><latexit sha1_base64="ZGSfvHkKuXAmqhUvswfMoyPCFyw=">AJkXicxZVbT9swFMcDdBvtbjAe9xJWaQIpQg1DwB6QEBPa0F5Yx03CXeUkbmrVcTLHKRfL34wvsm8zO82giSN2lZKlX8O+d/jo9z4iUEp7zT+TYzO9d48PDRfLP1+MnTZ8XFl+cpHGfHTsxyRmZx5MEcEUHXPMCTpLGIKR9CpN3qn90/HiKU4pkf8KkG9CIYUD7APuTL1F+duAIlDGxA04Ct2sgL8kAkwFpdS9kUgbRDhwAY+C7QReIhDKR1l8Ehu4MPc0g9WbcBwOSrdc7dgukWdSn9n2h6Z/EBkCFr4sOEhYHfTGWNTo7419X+iIOzk3/nqzLQFBH6dXyNnB+UvzvJPWX4hN+VH80WjDJbde3bVneAcv26P/kV84lT+U2kxph596jrQlmj5xqTf8k/5GZnwgcVRQw9FStOnZFtNA5k/IufiViNVCr1V9od9Y6+WObC7dYtK3iOewvNm5AEPtZhCj3CUzTc7eT8J6AjGOfINkCWYoS6I9giM7VksIpT2Rzxlpl3aP3J4YxJQj6pfcBIzSCPKhYdRwWrb6QyWMWFm2MPaEjhKgFIe07OVFqnQoIGaeXlmIvBIhqTovt+TouNsvnHc9S1ZQRgKCsLd7jqVwVChAtkO0Nx93cNpkYwlBd1BHYzobhi68OMoglS3CPnyXJ0PQDTNGNKFqKZFou1K1bYqPEGVT7fAtObl1IUEpwchkr2NUpgtV0JUBXdfFujawr3XYjztoZM/raZjVsJnBZibEDIhVM0S1mihJMYmpUc9gis7vycAUJVNM0WMdkqgPaQCNiMmwHk+G2GC7lc509XtbIiALI6j6DOIEMchjpl+6C8yHBEeYp6LYl6YXpvd7qf2q2L4sX0r973liXxqkGj75xSyfnXlD1Uwqc7rKGixkZWzSuBowqYDFAWtSzTu3Ot3Mxcn6mqvWnzbau3vF5Ju3XlqvrBXLtbasXeuDdWgdW36j3ThodBufm0vNt83dZsHOzhQ+S1bpaX78Dj1igPA=</latexit>
slide-37
SLIDE 37

How to estimate parameters in PLSA?

slide-38
SLIDE 38

Let’s implement… (in class exercise)

slide-39
SLIDE 39

Evaluation: Are these topics any good?

  • As for clustering: a bit tricky. Thoughts on how we

might evaluate topics?

slide-40
SLIDE 40

Likelihood of held-out data

<latexit sha1_base64="Ot+R1CnHY23oNHKIvRpVfR1ZFt4=">AF03icfZRPT9swFMANoxvr/gDbcRe0XnaoqoQh6BExoe0QUBiVTIcV4TC8fJbKdQLF+m3abdtm+zL7Jvs7iNtCaO5kjR03u/989+dpgzKpXn/Vlbf7TRefxk82n32fMXL7e2d15dyKwQBMYkY5m4CrERjmMFVUMrnIBOA0ZXIa3H6z9cgZC0oyfq3kOkxTHnE4pwapUnX2+2e5A2+xdl3Br4Qeqtbpzc7G7yDKSJECV4RhKa9L1cTjYWihIHpBoWEHJNbHMN1KXKcgpzoRaVmt2Y9yd6mnEFnNTcNE5lilXiKC0s61qSlIlB1NWyom2USKQNOZ1rzA13W4QwbTctUVlOgpZAUaPh4b7fUP3vf9vUPTQAREFeEPvX75NYFYAPAKGe73/YOhy+SFyBn8gzyL2WoEcLgjWZpiHulgBsRcl/sTAJeFANuIDsJU93xjAMv0dJnYe8Gq8Z7o3VQKzCY6XvTxOYrmG20hOYO9NAW68HBvrRhgUpA4ZbqVTuNixa2cNjChYQDiWaF0JoTcklZxp1+piv0Yk6mblK2wlRnbEOy8ipG2ImYJ+14nlCHTVOZmTsuKwSWMQpLs85yHIQWGXCXro7qhJGU6qkruzG9aL8/16lvZnsxNSH0v7DUJ8YhyQhWwxmfe/cCSUiqnO2yxYsFnVseXAtYN4Aqw2ZPne+c3XzRUu9ga+N/DP9ntHx9XLt4neoLfoHfLRITpCn9ApGiOCAH1HP9GvzrijO18735bo+lrl8xrVufHX+c0IZU=</latexit>

zn

<latexit sha1_base64="CL/3CPWRC9wnFZu4UuXjwKQH1Y=">AF1XicfZTNb9MwFMC9scIoXxscuUz0wqGakjFtPVZDExH2Ze0VJPjvKbWHCfYTrfO8g1xQ9zgn+Ef4b/BbiPRxBGOFD293tf9rPjglGpguDP2vqDjc7DR5uPu0+ePnv+Ymv75bnMS0HgjOQsF5cxlsAohzNFYPLQgDOYgYX8c17Z7+YgZA056dqXsA4wymnE0qwsqrP9f8eqsX7AaLteMLYSX0hmi5Tq63N35HSU7KDLgiDEt5FQaFGmsFCUMTDcqJRSY3OAUrqzIcQZyrBe1mp2a9TQc60nOFXBSc9M4kxlWU0/pYFnXkqlNDKetlKOtYuSgKQpr3vFmel2owQmdt8WlekZiUYPfpwZHTQP3jXD/cOTQMRkFREOAj69msCqQDgFTLY74cHA58pSlEw+AcFDnPVCOBwS/IswzR0QyIubL7EwGXpQDXiI7iTPdCY4wHL1Hrs7B3o1XjndE6qhUYzfSdaWLzFcw1aqG5B923xbr3sC9tWKSmoHBL9aqdxmULW3ps6UPCg0SzQmjNCYWkLOdeP5MVejEnEz8pW2GqM3Yhmb2MCfYiFtN2vJhSjx01TmZk3LisElikGbnHOUFCKxy4S7dLVTRjOqpK7sxvei/P9e1t5MdmzqQ+n+cayPjUeSmC0Gs753/oQSkdQ512ULlo6tjy4FrBogNUGO9K+d2HzdfOF873d0Mqf9nvDo+rl20Sv0Rv0FoXoEA3R3SCzhBKfqOfqJfnYuO6XztfFui62uVzytUW50fwHVSLi</latexit><latexit sha1_base64="CL/3CPWRC9wnFZu4UuXjwKQH1Y=">AF1XicfZTNb9MwFMC9scIoXxscuUz0wqGakjFtPVZDExH2Ze0VJPjvKbWHCfYTrfO8g1xQ9zgn+Ef4b/BbiPRxBGOFD293tf9rPjglGpguDP2vqDjc7DR5uPu0+ePnv+Ymv75bnMS0HgjOQsF5cxlsAohzNFYPLQgDOYgYX8c17Z7+YgZA056dqXsA4wymnE0qwsqrP9f8eqsX7AaLteMLYSX0hmi5Tq63N35HSU7KDLgiDEt5FQaFGmsFCUMTDcqJRSY3OAUrqzIcQZyrBe1mp2a9TQc60nOFXBSc9M4kxlWU0/pYFnXkqlNDKetlKOtYuSgKQpr3vFmel2owQmdt8WlekZiUYPfpwZHTQP3jXD/cOTQMRkFREOAj69msCqQDgFTLY74cHA58pSlEw+AcFDnPVCOBwS/IswzR0QyIubL7EwGXpQDXiI7iTPdCY4wHL1Hrs7B3o1XjndE6qhUYzfSdaWLzFcw1aqG5B923xbr3sC9tWKSmoHBL9aqdxmULW3ps6UPCg0SzQmjNCYWkLOdeP5MVejEnEz8pW2GqM3Yhmb2MCfYiFtN2vJhSjx01TmZk3LisElikGbnHOUFCKxy4S7dLVTRjOqpK7sxvei/P9e1t5MdmzqQ+n+cayPjUeSmC0Gs753/oQSkdQ512ULlo6tjy4FrBogNUGO9K+d2HzdfOF873d0Mqf9nvDo+rl20Sv0Rv0FoXoEA3R3SCzhBKfqOfqJfnYuO6XztfFui62uVzytUW50fwHVSLi</latexit><latexit sha1_base64="CL/3CPWRC9wnFZu4UuXjwKQH1Y=">AF1XicfZTNb9MwFMC9scIoXxscuUz0wqGakjFtPVZDExH2Ze0VJPjvKbWHCfYTrfO8g1xQ9zgn+Ef4b/BbiPRxBGOFD293tf9rPjglGpguDP2vqDjc7DR5uPu0+ePnv+Ymv75bnMS0HgjOQsF5cxlsAohzNFYPLQgDOYgYX8c17Z7+YgZA056dqXsA4wymnE0qwsqrP9f8eqsX7AaLteMLYSX0hmi5Tq63N35HSU7KDLgiDEt5FQaFGmsFCUMTDcqJRSY3OAUrqzIcQZyrBe1mp2a9TQc60nOFXBSc9M4kxlWU0/pYFnXkqlNDKetlKOtYuSgKQpr3vFmel2owQmdt8WlekZiUYPfpwZHTQP3jXD/cOTQMRkFREOAj69msCqQDgFTLY74cHA58pSlEw+AcFDnPVCOBwS/IswzR0QyIubL7EwGXpQDXiI7iTPdCY4wHL1Hrs7B3o1XjndE6qhUYzfSdaWLzFcw1aqG5B923xbr3sC9tWKSmoHBL9aqdxmULW3ps6UPCg0SzQmjNCYWkLOdeP5MVejEnEz8pW2GqM3Yhmb2MCfYiFtN2vJhSjx01TmZk3LisElikGbnHOUFCKxy4S7dLVTRjOqpK7sxvei/P9e1t5MdmzqQ+n+cayPjUeSmC0Gs753/oQSkdQ512ULlo6tjy4FrBogNUGO9K+d2HzdfOF873d0Mqf9nvDo+rl20Sv0Rv0FoXoEA3R3SCzhBKfqOfqJfnYuO6XztfFui62uVzytUW50fwHVSLi</latexit><latexit sha1_base64="EVtucYrJet2AkCUkTOzXDAuSwZo=">AF1XicfZTNb9MwFMC9scIoXxscuUz0wqGakjFtPU5DExzH2JfUVJXjvKbWHCfYzrbW8g1xQ9zgn+Ef4b/BbiPRxBGOFD293tf9rPjglGpguDP2vqDjc7DR5uPu0+ePnv+Ymv75aXMS0HguQsF9cxlsAohwtFYPrQgDOYgZX8c17Z7+6BSFpzs/VrIBRhlNOJ5RgZVWf52M+3uoFu8Fi7fhCWAk9VK3T8fbG7yjJSZkBV4RhKYdhUKiRxkJRwsB0o1JCgckNTmFoRY4zkCO9qNXs1Kzn4UhPcq6Ak5qbxpnMsJp6SgfLupZMbWIQ9bSVcqRdlAQkTXndK85MtxslMLH7tqhMJzErweizD8dGB/2Dd/1w79A0EAFJRYSDoG+/JpAKAF4hg/1+eDwmaIUBYN/UOAwV40ADnckzLMEx3dAjFDuz8RcFkKcI3oKM50LzTGePAStT4LezdaNd4braNagdGtvjdNbLaCuUYtNPOgeVusuYd9acMiNQWFW6pX7TQuW9jSY0sfEh4kmhVCa04oJGU59/qZrNCLOZn4SdkKU52xC8nsZUywF7GYtuPFlHrsWeNkzowbl1UCizTD9pyjvACBVS7cpbujaspoRpXUld34XpT/38vam8lOTH0o3T+O9YnxSBKzxWDW986fUCKSOue6bMFSUceWB9cCFg2w2mBH2vcubL5uvnC5txta+dN+7+i4evk20Wv0Br1FITpER+gjOkUXiKAUfUc/0a/OVcd0vna+LdH1tcrnFaqtzo+/g3oiog=</latexit>

xn

<latexit sha1_base64="R5fqLlqQnakjPB/hUCUoBDPFKv0=">AF1XicfZRPb9MwFMC9scIo/zY4cpnohUM1JWPaeqyGJjiOsm6TlmpynNfUmuME29nWb4hbogbfBm+CN8Gu41E0c4UvT03u/9s58dF4xKFQR/1tYfbHQePtp83H3y9NnzF1vbL89kXgoCY5KzXFzEWAKjHMaKgYXhQCcxQzO4+v3zn5+A0LSnJ+qeQGTDKecTinByqo+313xq61esBs1o4vhJXQG6LlOrna3vgdJTkpM+CKMCzlZRgUaqKxUJQwMN2olFBgco1TuLQixnIiV7UanZq1tNwoqc5V8BJzU3jTGZYzTylg2VdS2Y2MYh62ko50S5KApKmvO4VZ6bjRKY2n1bVKaTmJVg9OjDkdFB/+BdP9w7NA1EQFIR4SDo268JpAKAV8hgvx8eDHymKEXB4B8UOMxVI4DLcmzDPNERzdAzKXdnwi4LAW4RnQUZ7oXGmM8eIlan4W9G60a74zWUa3A6EbfmSY2X8Fcoxae9B9W6x7D/vShkVqBgq3VK/aVy2sKXHlj4kPEg0K4TWnFBIynLu9TNdoRdzMvWTshWmOmMXktnLmGAvYjFrx4sZ9dhR42RGxo3LKoFmF7zlFegMAqF+7S3VI1YzSjSurKbnwvyv/vZe3NZMemPpTuH8f62HgkidliMOt7508oEUmdc12YKmoY8uDawGLBlhtsCPtexc2XzdfONvbDa38ab83PKpevk30Gr1Bb1GIDtEQfUQnaIwIStF39BP96px3TOdr59sSXV+rfF6h2ur8+AvKZyLg</latexit><latexit sha1_base64="R5fqLlqQnakjPB/hUCUoBDPFKv0=">AF1XicfZRPb9MwFMC9scIo/zY4cpnohUM1JWPaeqyGJjiOsm6TlmpynNfUmuME29nWb4hbogbfBm+CN8Gu41E0c4UvT03u/9s58dF4xKFQR/1tYfbHQePtp83H3y9NnzF1vbL89kXgoCY5KzXFzEWAKjHMaKgYXhQCcxQzO4+v3zn5+A0LSnJ+qeQGTDKecTinByqo+313xq61esBs1o4vhJXQG6LlOrna3vgdJTkpM+CKMCzlZRgUaqKxUJQwMN2olFBgco1TuLQixnIiV7UanZq1tNwoqc5V8BJzU3jTGZYzTylg2VdS2Y2MYh62ko50S5KApKmvO4VZ6bjRKY2n1bVKaTmJVg9OjDkdFB/+BdP9w7NA1EQFIR4SDo268JpAKAV8hgvx8eDHymKEXB4B8UOMxVI4DLcmzDPNERzdAzKXdnwi4LAW4RnQUZ7oXGmM8eIlan4W9G60a74zWUa3A6EbfmSY2X8Fcoxae9B9W6x7D/vShkVqBgq3VK/aVy2sKXHlj4kPEg0K4TWnFBIynLu9TNdoRdzMvWTshWmOmMXktnLmGAvYjFrx4sZ9dhR42RGxo3LKoFmF7zlFegMAqF+7S3VI1YzSjSurKbnwvyv/vZe3NZMemPpTuH8f62HgkidliMOt7508oEUmdc12YKmoY8uDawGLBlhtsCPtexc2XzdfONvbDa38ab83PKpevk30Gr1Bb1GIDtEQfUQnaIwIStF39BP96px3TOdr59sSXV+rfF6h2ur8+AvKZyLg</latexit><latexit sha1_base64="R5fqLlqQnakjPB/hUCUoBDPFKv0=">AF1XicfZRPb9MwFMC9scIo/zY4cpnohUM1JWPaeqyGJjiOsm6TlmpynNfUmuME29nWb4hbogbfBm+CN8Gu41E0c4UvT03u/9s58dF4xKFQR/1tYfbHQePtp83H3y9NnzF1vbL89kXgoCY5KzXFzEWAKjHMaKgYXhQCcxQzO4+v3zn5+A0LSnJ+qeQGTDKecTinByqo+313xq61esBs1o4vhJXQG6LlOrna3vgdJTkpM+CKMCzlZRgUaqKxUJQwMN2olFBgco1TuLQixnIiV7UanZq1tNwoqc5V8BJzU3jTGZYzTylg2VdS2Y2MYh62ko50S5KApKmvO4VZ6bjRKY2n1bVKaTmJVg9OjDkdFB/+BdP9w7NA1EQFIR4SDo268JpAKAV8hgvx8eDHymKEXB4B8UOMxVI4DLcmzDPNERzdAzKXdnwi4LAW4RnQUZ7oXGmM8eIlan4W9G60a74zWUa3A6EbfmSY2X8Fcoxae9B9W6x7D/vShkVqBgq3VK/aVy2sKXHlj4kPEg0K4TWnFBIynLu9TNdoRdzMvWTshWmOmMXktnLmGAvYjFrx4sZ9dhR42RGxo3LKoFmF7zlFegMAqF+7S3VI1YzSjSurKbnwvyv/vZe3NZMemPpTuH8f62HgkidliMOt7508oEUmdc12YKmoY8uDawGLBlhtsCPtexc2XzdfONvbDa38ab83PKpevk30Gr1Bb1GIDtEQfUQnaIwIStF39BP96px3TOdr59sSXV+rfF6h2ur8+AvKZyLg</latexit><latexit sha1_base64="TFLGUTr6qFYReHLRiCE2yIPgUY=">AF1XicfZTNb9MwFMC9scIoXxscuUz0wqGakjFtPU5DExzH2JfUVJXjvKbWHCfYztbO8g1xQ9zgn+Ef4b/BbiPRxBGOFD293tf9rPjglGpguDP2vqDjc7DR5uPu0+ePnv+Ymv75aXMS0HguQsF9cxlsAohwtFYPrQgDOYgZX8c17Z7+6BSFpzs/VvIBRhlNOJ5RgZVWfZ2M+3uoFu8Fi7fhCWAk9VK3T8fbG7yjJSZkBV4RhKYdhUKiRxkJRwsB0o1JCgckNTmFoRY4zkCO9qNXs1Kzn4UhPcq6Ak5qbxpnMsJp6SgfLupZMbWIQ9bSVcqRdlAQkTXndK85MtxslMLH7tqhMJzErweizD8dGB/2Dd/1w79A0EAFJRYSDoG+/JpAKAF4hg/1+eDwmaIUBYN/UOAwV40ADnckzLMEx3dAjFDuz8RcFkKcI3oKM50LzTGePAStT4LezdaNc6M1lGtwOhWz0wTm69grlELzT3ovi3WvYd9acMiNQWFW6pX7TQuW9jSY0sfEh4kmhVCa04oJGU59/qZrNCLOZn4SdkKU52xC8nsZUywF7GYtuPFlHrsWeNkzowbl1UCizTD9pyjvACBVS7cpbujaspoRpXUld34XpT/38vam8lOTH0o3T+O9YnxSBKzxWDW986fUCKSOue6bMFSUceWB9cCFg2w2mBH2vcubL5uvnC5txta+dN+7+i4evk20Wv0Br1FITpER+gjOkUXiKAUfUc/0a/OVcd0vna+LdH1tcrnFaqtzo+/eJgioA=</latexit>

β

<latexit sha1_base64="aDS6LXjdUwMKJ2qhSEXt/yLFw8g=">AF13icfZRPb9MwFMC9scIof7bBkctELxyqKRnT1mM1NMFxVOs2tFST47y21hwn2E63zrK4IW6IG3wXvgjfBruNRBNHOFL09N7v/bOfHeMShUEf9bWH2y0Hj7afNx+8vTZ863tnRfnMisEgSHJWCYuYyBUQ5DRWDy1wATmMGF/HNO2e/mIGQNONnap7DKMUTseUYGVwygGha+3O8FesFi7vhCWQqePluv0emfjd5RkpEiBK8KwlFdhkKuRxkJRwsC0o0JCjskNnsCVFTlOQY70olqzW7GehSM9zrgCTipuGqcyxWrqKR0sq1oytYlBVNOWypF2URKQdMKrXnFq2u0ogbHduUVlOolZAUYP3h8bHXQP3bD/SNTQwQkJRH2gq796sBEAPAS6R10w8Oez+SFyBn8gwKHuWoEcLglWZpinuhoBsRc2f2JgMtCgGtER3GqO6ExoOXqPVZ2NvRqvHOaB1VCoxm+s7UsfkK5hq10NyD7pti3XvY5yYsUlM7bw3Vq2YaFw1s4bGFDwkPEvUKoTEn5JKyjHv9jFfoxZyM/aRshSnP2IVk9jom2IuYT5vxfEo9dlA7mYFx47JKYDFJsT3nKMtBYJUJd+luqZoymlIldWk3vhfl/ey9nqyE1MdSvePY31iPJLEbDGY1b3zJ5SIpMq5Lhuwiahiy4NrAPMaWG6wI+17F9ZfN184398LrfzxoNM/Ll+TfQKvUZvUIiOUB9QKdoiAi6Dv6iX61PrW+tL62vi3R9bXS5yWqrNaPv+tI6c=</latexit><latexit sha1_base64="aDS6LXjdUwMKJ2qhSEXt/yLFw8g=">AF13icfZRPb9MwFMC9scIof7bBkctELxyqKRnT1mM1NMFxVOs2tFST47y21hwn2E63zrK4IW6IG3wXvgjfBruNRBNHOFL09N7v/bOfHeMShUEf9bWH2y0Hj7afNx+8vTZ863tnRfnMisEgSHJWCYuYyBUQ5DRWDy1wATmMGF/HNO2e/mIGQNONnap7DKMUTseUYGVwygGha+3O8FesFi7vhCWQqePluv0emfjd5RkpEiBK8KwlFdhkKuRxkJRwsC0o0JCjskNnsCVFTlOQY70olqzW7GehSM9zrgCTipuGqcyxWrqKR0sq1oytYlBVNOWypF2URKQdMKrXnFq2u0ogbHduUVlOolZAUYP3h8bHXQP3bD/SNTQwQkJRH2gq796sBEAPAS6R10w8Oez+SFyBn8gwKHuWoEcLglWZpinuhoBsRc2f2JgMtCgGtER3GqO6ExoOXqPVZ2NvRqvHOaB1VCoxm+s7UsfkK5hq10NyD7pti3XvY5yYsUlM7bw3Vq2YaFw1s4bGFDwkPEvUKoTEn5JKyjHv9jFfoxZyM/aRshSnP2IVk9jom2IuYT5vxfEo9dlA7mYFx47JKYDFJsT3nKMtBYJUJd+luqZoymlIldWk3vhfl/ey9nqyE1MdSvePY31iPJLEbDGY1b3zJ5SIpMq5Lhuwiahiy4NrAPMaWG6wI+17F9ZfN184398LrfzxoNM/Ll+TfQKvUZvUIiOUB9QKdoiAi6Dv6iX61PrW+tL62vi3R9bXS5yWqrNaPv+tI6c=</latexit><latexit sha1_base64="aDS6LXjdUwMKJ2qhSEXt/yLFw8g=">AF13icfZRPb9MwFMC9scIof7bBkctELxyqKRnT1mM1NMFxVOs2tFST47y21hwn2E63zrK4IW6IG3wXvgjfBruNRBNHOFL09N7v/bOfHeMShUEf9bWH2y0Hj7afNx+8vTZ863tnRfnMisEgSHJWCYuYyBUQ5DRWDy1wATmMGF/HNO2e/mIGQNONnap7DKMUTseUYGVwygGha+3O8FesFi7vhCWQqePluv0emfjd5RkpEiBK8KwlFdhkKuRxkJRwsC0o0JCjskNnsCVFTlOQY70olqzW7GehSM9zrgCTipuGqcyxWrqKR0sq1oytYlBVNOWypF2URKQdMKrXnFq2u0ogbHduUVlOolZAUYP3h8bHXQP3bD/SNTQwQkJRH2gq796sBEAPAS6R10w8Oez+SFyBn8gwKHuWoEcLglWZpinuhoBsRc2f2JgMtCgGtER3GqO6ExoOXqPVZ2NvRqvHOaB1VCoxm+s7UsfkK5hq10NyD7pti3XvY5yYsUlM7bw3Vq2YaFw1s4bGFDwkPEvUKoTEn5JKyjHv9jFfoxZyM/aRshSnP2IVk9jom2IuYT5vxfEo9dlA7mYFx47JKYDFJsT3nKMtBYJUJd+luqZoymlIldWk3vhfl/ey9nqyE1MdSvePY31iPJLEbDGY1b3zJ5SIpMq5Lhuwiahiy4NrAPMaWG6wI+17F9ZfN184398LrfzxoNM/Ll+TfQKvUZvUIiOUB9QKdoiAi6Dv6iX61PrW+tL62vi3R9bXS5yWqrNaPv+tI6c=</latexit><latexit sha1_base64="kSBDf/XaBhLxR2gd7E4qD97QhD4=">AF13icfZRPb9MwFMC9scIof7bBkctELxyqKRnT1uM0NMFxVOs21FST47y21hwn2E63zrK4IW6IG3wXvgjfBruNRBNHOFL09N7v/bOfHeMShUEf9bWH2y0Hj7afNx+8vTZ863tnRcXMisEgQHJWCauYiyBUQ4DRWDq1wATmMGl/HNO2e/nIGQNOPnap7DKMUTseUYGVgygGha+3O8FesFi7vhCWQgeV6+x6Z+N3lGSkSIErwrCUwzDI1UhjoShYNpRISH5AZPYGhFjlOQI72o1uxWrOfhSI8zroCTipvGqUyxmnpKB8uqlkxtYhDVtKVypF2UBCSd8KpXnJp2O0pgbHduUZlOYlaA0f3J0YH3cO3XD/yNQAUlJhL2ga786MBEAvER6B93wsOczeSFyBv+gwGuGgEcbkmWpgnOpoBMUO7PxFwWQhwjegoTnUnNMZ48BK1Pgt7O1o13hmto0qB0UzfmTo2X8Fcoxae9B9U6x7D/vchEVqauetoXrVTOigS08tvAh4UGiXiE05oRcUpZxr5/xCr2Yk7GflK0w5Rm7kMxexwR7EfNpM5Pqcf2ayfTN25cVgksJim25xlOQisMuEu3S1VU0ZTqQu7cb3ovz/XtZeT3ZqkPp/nGsT41HkpgtBrO6d/6EpFUOdlAzYRVWx5cA1gXgPLDXakfe/C+uvmCxf7e6GVPx50jk/Kl28TvUKv0RsUoiN0jD6gMzRABFH0Hf1Ev1qfWl9aX1vfluj6WunzElVW68dfmZ4jZw=</latexit>

θ

<latexit sha1_base64="dtgpQ6KClwybVS3dqS0RC75e2U=">AF2HicfZRPb9MwFMC9scIo/zY4cpnohUM1JWPaepyGJjiOat0mlmpynNfUzHGC7WzrLEvcEDfEDT4LX4Rvg91GokjHCl6eu/3/tnPjgtGpQqCPyur9Y69x+sP+w+evzk6bONzenMi8FgRHJWS7OYyBUQ4jRWD80IAzmIGZ/HVW2c/uwYhac5P1KyAcYZTieUYGVp5GagsKXG71gO5ivLV8IK6F3gBbr+HJz7XeU5KTMgCvCsJQXYVCoscZCUcLAdKNSQoHJFU7hwocZyDHel6u2apZT8KxnuRcASc1N40zmWE19ZQOlnUtmdrEIOpK+VYuygJSJryulecmW43SmBit25emU5iVoLRw3eHRgf9vTf9cGfNBABSUWEg6BvyaQCgBeIYPdfrg38JmiFAWDf1DgMFeNA43JM8yzBMdXQMxF3Z/IuCyFOAa0VGc6V5ojPHgBWp95vZutGy8NVpHtQKja31rmthsCXONWmjmQXdtse487HMbtpi3lupVO43LFrb02NKHhAeJZoXQmhMKSVnOvX4mS/R8TiZ+UrbEVGfsQjJ7HxPsRSym7XgxpR47bJzM0LhxWSawSDNsznKCxBY5cJduhuqpoxmVEld2Y3vRfn/vay9mezI1IfS/eNYHxmPJDGbD2Z97/wJSKpc67LFiwVdWxcC1g0QCrDXakfe/C5uvmC6c726GVP+z2Dg6rl28dvUSv0GsUon10gN6jYzRCBH1C39FP9KvzsfOl87XzbYGurlQ+L1BtdX78Ba2VJCs=</latexit><latexit sha1_base64="dtgpQ6KClwybVS3dqS0RC75e2U=">AF2HicfZRPb9MwFMC9scIo/zY4cpnohUM1JWPaepyGJjiOat0mlmpynNfUzHGC7WzrLEvcEDfEDT4LX4Rvg91GokjHCl6eu/3/tnPjgtGpQqCPyur9Y69x+sP+w+evzk6bONzenMi8FgRHJWS7OYyBUQ4jRWD80IAzmIGZ/HVW2c/uwYhac5P1KyAcYZTieUYGVp5GagsKXG71gO5ivLV8IK6F3gBbr+HJz7XeU5KTMgCvCsJQXYVCoscZCUcLAdKNSQoHJFU7hwocZyDHel6u2apZT8KxnuRcASc1N40zmWE19ZQOlnUtmdrEIOpK+VYuygJSJryulecmW43SmBit25emU5iVoLRw3eHRgf9vTf9cGfNBABSUWEg6BvyaQCgBeIYPdfrg38JmiFAWDf1DgMFeNA43JM8yzBMdXQMxF3Z/IuCyFOAa0VGc6V5ojPHgBWp95vZutGy8NVpHtQKja31rmthsCXONWmjmQXdtse487HMbtpi3lupVO43LFrb02NKHhAeJZoXQmhMKSVnOvX4mS/R8TiZ+UrbEVGfsQjJ7HxPsRSym7XgxpR47bJzM0LhxWSawSDNsznKCxBY5cJduhuqpoxmVEld2Y3vRfn/vay9mezI1IfS/eNYHxmPJDGbD2Z97/wJSKpc67LFiwVdWxcC1g0QCrDXakfe/C5uvmC6c726GVP+z2Dg6rl28dvUSv0GsUon10gN6jYzRCBH1C39FP9KvzsfOl87XzbYGurlQ+L1BtdX78Ba2VJCs=</latexit><latexit sha1_base64="dtgpQ6KClwybVS3dqS0RC75e2U=">AF2HicfZRPb9MwFMC9scIo/zY4cpnohUM1JWPaepyGJjiOat0mlmpynNfUzHGC7WzrLEvcEDfEDT4LX4Rvg91GokjHCl6eu/3/tnPjgtGpQqCPyur9Y69x+sP+w+evzk6bONzenMi8FgRHJWS7OYyBUQ4jRWD80IAzmIGZ/HVW2c/uwYhac5P1KyAcYZTieUYGVp5GagsKXG71gO5ivLV8IK6F3gBbr+HJz7XeU5KTMgCvCsJQXYVCoscZCUcLAdKNSQoHJFU7hwocZyDHel6u2apZT8KxnuRcASc1N40zmWE19ZQOlnUtmdrEIOpK+VYuygJSJryulecmW43SmBit25emU5iVoLRw3eHRgf9vTf9cGfNBABSUWEg6BvyaQCgBeIYPdfrg38JmiFAWDf1DgMFeNA43JM8yzBMdXQMxF3Z/IuCyFOAa0VGc6V5ojPHgBWp95vZutGy8NVpHtQKja31rmthsCXONWmjmQXdtse487HMbtpi3lupVO43LFrb02NKHhAeJZoXQmhMKSVnOvX4mS/R8TiZ+UrbEVGfsQjJ7HxPsRSym7XgxpR47bJzM0LhxWSawSDNsznKCxBY5cJduhuqpoxmVEld2Y3vRfn/vay9mezI1IfS/eNYHxmPJDGbD2Z97/wJSKpc67LFiwVdWxcC1g0QCrDXakfe/C5uvmC6c726GVP+z2Dg6rl28dvUSv0GsUon10gN6jYzRCBH1C39FP9KvzsfOl87XzbYGurlQ+L1BtdX78Ba2VJCs=</latexit><latexit sha1_base64="oMiyQFUFxbkN7ZoLVRKcGnWf1C8=">AF2HicfZRPb9MwFMC9scIo/zY4cpnohUM1JWPaepyGJjiOau0mlmpynNfWzHGC7XTrLEvcEDfEDT4LX4Rvg91GokjHCl6eu/3/tnPjnNGpQqCP2vr9zZa9x9sPmw/evzk6bOt7edDmRWCwIBkLBMXMZbAKIeBorBRS4ApzGD8/j6rbOfz0BImvEzNc9hlOIJp2NKsLKqYaSmoPDVifYDRZrxfCUuigcp1ebW/8jpKMFClwRiW8jIMcjXSWChKGJh2VEjIMbnGE7i0IscpyJFelGt2KtazcKTHGVfAScVN41SmWE09pYNlVUumNjGIatpSOdIuSgKSTnjVK05Nux0lMLZbt6hMJzErwOj+u2Ojg+7Bm264d2hqiICkJMJe0LVfHZgIAF4ivf1ueNDzmbwQOYN/UOAwV40ADjckS1PMEx3NgJhLuz8RcFkIcI3oKE51JzTGePAStT4LeztaNd4araNKgdFM35o6Nl/BXKMWmnvQXVOsOw/73IQt562hetVM46KBLTy28CHhQaJeITmhFxSlnGvn/EKvZiTsZ+UrTDlGbuQzN7HBHsR82kznk+px/ZrJ9M3blxWCSwmKbnHGU5CKwy4S7dDVTRlOqpC7txvei/P9e1l5PdmKqQ+n+caxPjEeSmC0Gs7p3/oQSkVQ512UDNhFVbHlwDWBeA8sNdqR978L6+YLw73d0Mof9jtHx+XLt4leolfoNQrRITpC79EpGiCPqHv6Cf61frY+tL62vq2RNfXSp8XqLJaP/4CW8Yj6w=</latexit>

Fit model Observed data Held-out data

slide-41
SLIDE 41

“Intrusion detection”

Word Intrusion Topic Intrusion

From Chang et al., 2009

slide-42
SLIDE 42

“Intrusion detection”

Word Intrusion Topic Intrusion

Which word doesn’t belong?

From Chang et al., 2009

slide-43
SLIDE 43

“Intrusion detection”

Word Intrusion Topic Intrusion

Which topic doesn’t belong?

From Chang et al., 2009

slide-44
SLIDE 44

Summing up

  • PLSA is a simple ad-mixture model that uncovers

topics (distributions over words) and soft-assigns instances to these.

  • We saw parameter estimation via Expectation-

Maximization.

  • Next time: Introducing priors into topic models —

Latent Dirichlet Allocation (LDA).

★ This will motivate sampling-based estimation

slide-45
SLIDE 45

Summing up

  • PLSA is a simple ad-mixture model that uncovers

topics (distributions over words) and soft-assigns instances to these.

  • We saw parameter estimation via Expectation-

Maximization.

  • Next time: Introducing priors into topic models —

Latent Dirichlet Allocation (LDA).

★ This will motivate sampling-based estimation

slide-46
SLIDE 46

Summing up

  • PLSA is a simple ad-mixture model that uncovers

topics (distributions over words) and soft-assigns instances to these.

  • We saw parameter estimation via Expectation-

Maximization.

  • Next time: Introducing priors into topic models —

Latent Dirichlet Allocation (LDA).

★ This will motivate sampling-based estimation