Supervised Topic Models Atallah Hezbor and Anant Kharkar Outline - PowerPoint PPT Presentation

Supervised Topic Models Atallah Hezbor and Anant Kharkar

Outline (optional. Mostly for our reference) ● Intro [AK] ● LDA [AK] ○ Objective ○ Diagram ○ Motivation for sLDA ● sLDA ○ Expectation Maximization [AH] ○ Variational Inference [AH] ○ E-step [AH] ○ M-step [AK] ○ Prediction [AK] ● Experimental Setup [AH] ● Results/Conclusions [AH]

Introduction ● Topic modeling ○ Generally unsupervised ○ Learn topics - major clusters of content ● Latent Dirichlet Allocation ○ One method for topic modeling ○ Learn topic assignment for each document ● Learned topics often used for prediction ○ Analogous to PCA for regression/lasso ● sLDA - end-to-end learned LDA + regression ● Dirichlet Distribution ○ Takes parameter vector �

Latent Dirichlet Allocation ● Objective - identify major topics in document ○ Topic = word probabilities ○ Use variational inference to compute parameters ○ � (topic distr), z (topic assign), w (word), � , � - Dirichlet ● Intractable posterior distr. ● Unsupervised topics may not be ideal for response prediction ○ Genre may not be optimal topics for movie reviews

Supervised Latent Dirichlet Allocation ● Extend document generation model ○ Response variable ■ Numerical rating, number of likes ● Formulate posterior ○ Intractable to compute e l b a t c a r t n I

Variational Inference ● Want to approximate posterior distribution ● Use Jensen’s inequality ○ log expectation >= expectation log ● Pick a family of variational distributions, Q ● Each q in Q has variational params: � , � ● Variational Expectation Maximization ○ E: Optimize w.r.t � , � ○ M: Optimize w.r.t model parameters

Expectation Step ● Model Parameters are fixed ● � - parametrizes Dirichlet Distribution ● - the jth words distribution of topics ● Maximize LB with respect to � ● Maximize LB with respect to

Maximization Step ● Estimate model parameters by maximizing corpus-level ELBO ● � 1:K - topic definitions (word distribution under topic k) Regression parameters - � , � 2 ● ○ Corpus-level analogue of log(p(response)) ○ Expected-value normal equations and update rules

Prediction Learned model params - ⍺ , � 1:K , � , � ● � - regression coefficients learned on z for response y ○ ● Predict response Y for a specific document given learned model ● Variational approximation

Experimental Setup ● Movie review corpus [Ratings] ● Digg article corpus [Number of Diggs] ● Compared against ○ LDA + regression ○ Lasso regression ● Metrics: ○ Predictive R-squared ○ Per-word log-likelihood

Results ● 8%, 9.4% prediction improvement ● Better topic model for movie reviews ●

Conclusions ● LDA adapted to a specific purpose ○ Learn optimal topics for a specific response ● Best of both worlds ○ Predict response ○ Preserve high topic likelihood ● Lingering questions ○ More real world examples - when does it work well? ○ How does it compare to deep feature learning?

Backup Slide ● Variational Distribution q

Supervised Topic Models Atallah Hezbor and Anant Kharkar Outline - PowerPoint PPT Presentation

Supervised Topic Models Atallah Hezbor and Anant Kharkar Outline (optional. Mostly for our reference) Intro [AK] LDA [AK] Objective Diagram Motivation for sLDA sLDA Expectation Maximization [AH]

Virtual Student Orientation Information for Families SLIDESMANIA.COM TOPIC TOPIC TOPIC TOPIC

ConnectHome ConnectHome Topic 2 Topic 2 Nation Webinar Nation Webinar Topic 3 Topic 3 Topic

UNIT TOPICS TOPIC 1: MINERALS TOPIC 2: IGNEOUS ROCKS TOPIC 3: SEDIMENTARY ROCKS

TOPIC #X: TOPIC NAME DATE, 2020 PRESENTATION OUTLINE Main topic #1 Main topic #2 Main

COMP31212: Concurrency Topic 5.3: Liveness and Topic 5.4 Fairness Topic 5.3: Liveness Properties

PCA CS 446 Supervised learning So far, weve done supervised learning: Given (( x i , y i )) ,

4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood

Unsupervised and Semi-supervised Learning of Structure Graham Neubig Site

Unsupervised and Semi-supervised Learning of Structure Graham Neubig Site

Second Year Student Meeting PhD Candidacy Exam On-topic or Off-topic Candidacy Exam? On-Topic:

The Dynamic Earth Unit Topics Topic 1: Earths Interior Topic 2: Continental Drift

Strategic Considerations for Managing a Nanotechnology Patent Portfolio Sarah Korman, Ph.D., J.D.

9/15/17 Outline Topic 1.Introduc8on Topic 2. RCS for six key fuels Topic 3.

Researching Researching Your Paper Topic Your Paper Topic A HOW TO GUIDE A HOW TO GUIDE

Using topic models as classifiers Pavel Oleinikov Associate Director Quantitative Analysis

CS330 Paper Presentation: October 16th, 2019 Supervised Classification Semi-Supervised

ADVANCEMENT Review of RFP for Next W.M. Keck Foundation Research Program Funding Cycle June 9,

Planned Research Andreas Witzel Institute for Logic, Language and Computation University of

Surviving to a severe trauma following road traffic accident. What changes? Rhne Trauma Register

A few points for our WebEx today: Please dial in on your phone: 0800 032 8069 and then use the

Investor Presentation November 2019 Safe Harbor Statement The offering to which this

A quantum algorithm for model independent searches for new physics Prasanth Shyamsundar

Doubly Stochastic Inference for Deep Gaussian Processes Hugh Salimbeni Department of Computing

Dive Deeper in Finance GTC 2017 San Jos California Daniel Egloff Dr. sc. math.