[PPT] - Dynamic Embeddings for User Profiling in Twitter Shangsong Liang 1 , PowerPoint Presentation

SLIDE 1

Dynamic Embeddings for User Profiling in Twitter

1 KAUST, Saudi Arabia 2 JD.com, China 3 University of Amsterdam, The Netherlands

Shangsong Liang1, Xiangliang Zhang1, Zhaochun Ren2, Evangelos Kanoulas3

SLIDE 2

2

Overview

Ò The Task Background and Related Work Ò Our Method

Ò Dynamic User and Word Embedding Model (DUWE) Ò Streaming Keyword Diversification Model (SKDM)

Ò Experiments Ò Conclusion

SLIDE 3

3

The Task

Input: A stream of tweets generated across the time Output: A set of keywords to profile the user at different point in time

Tweets over time Twitter Users Given a user at time t Sport Food

SLIDE 4

4

The Task

Tweets over time Twitter Users Given a user at time t Sport Food

Relevant Diversified Dynamic

SLIDE 5

5

Background of User Profiling Problem

Ò Expert finding task at TREC 2005 enterprise track

Ò Given documents which describes expert candidates, answer

a query with a sorted name list in a specific domain, ☛ uncovering associations between people and topics

Ò A generative language modeling approach in Balong et al

(2007)

Ò Works on a Static document collection Ò Assumes users’ profiling results are unchanged

Need Dynamic User Profiling

SLIDE 6

6

Dynamic User Profiling Approaches

Ò ExperTime (Rybak et al 2014) Ò A probabilistic model for learning how personal research

interests evolve (Fang and Godavarthy 2014)

SLIDE 7

7

Limitations of Current User Profiling Methods

Ò Treat words as atomic units leading to a vocabulary mismatch that

harms performance

Ò Represent words and users in disjoint vocabulary spaces making

it difficult to measure the similarity between users and words when constructing the profile Can words and users be embedded in the same semantic space? Can their embedding be modeled in the dynamic environment?

SLIDE 8

8

Overview

Ò The Task Background and Related Work Ò Our Method

Ò Dynamic User and Word Embedding Model (DUWE) Ò Streaming Keyword Diversification Model (SKDM)

Ò Experiments Ò Conclusion

SLIDE 10

10

Our Approach

Ò Dynamic User and Word Embedding Model (DUWE)

Ò Infer both users’ and words’ embeddings over time in the

same semantic space

Ò Enable to measure the similarities between users’ and words’

embeddings

Ò Streaming Keyword Diversification Model

Ò Retrieve relevant keywords to profile users’ current interests

ver time

Ò Diversify the returned relevant keywords such that the

keywords can cover all aspects of the users’ interests

SLIDE 11

Dynamic User and Word Embedding

11

vt−1 zt−1 βt−1 ut−1 yt−1 αt−1 vt zt βt ut yt αt

n+

t−1

m+

t−1

n+

t

m+

t

V V |Ut| |Ut−1|

Word representation at t-1 User representation at t Observed co-

ccurrence of

words at t-1 Observed user-word pairs at t-1

p(Ut | Ut−1) ∝ N(Ut−1,α α α2

t−1I) · N(0,α

α α2

0 I)

p(Vt | Vt−1) ∝ N(Vt−1,β β β2

t−1I) · N(0,β

β β2

0 I)

User Diffusion Word Diffusion

SLIDE 12

12

Diffusion of user representation

According to Kalman filtering, we define the variance of transition kernel for a user embedding from t-1 to t

.

p(Ut | Ut−1) ∝ N(Ut−1,α α α2

t−1I) · N(0,α

α α2

0 I) Gaussian Prior

A
F
F

measuring the word distribution changes from previous time step t-1 to the current time step t for user u

SLIDE 13

13

Diffusion of word representation

According to Kalman filtering, we define the variance of transition kernel for a word embedding from t-1 to t

.

Gaussian Prior

A
F
F

measuring the word distribution changes from t-1 to the current time step t

p(Vt | Vt−1) ∝ N(Vt−1,β β β2

t−1I) · N(0,β

β β2

0 I)

SLIDE 14

14

DUWE model inference

Ò Apply the skip-gram filtering for the inference (Bamler et al. 2017)

and the variational inference algorithm to obtain the embeddings

Ò Posterior distribution over and conditional on the statistics

information and as follows: where we have:

skip-gram model for words skip-gram model for user and words model transition for users model transition for words

positive and negative indicator matrices for all word-to-word pairs positive and negative indicator matrices for all user-to-word pairs

SLIDE 15

15

Streaming Keyword Diversification Model

Ò generating top-K relevant and diversified keywords for

profiling users’ interests at time t.

SLIDE 16

16

Overview

Ò The Task Background and Related Work Ò Our Method

Ò Dynamic User and Word Embedding Model (DUWE) Ò Streaming Keyword Diversification Model (SKDM)

Ò Experiments Ò Conclusion

SLIDE 17

17

Experimental Setup

Ò Datasets

Ò 1,375 users randomly sampled from Twitter Ò 3.78 million tweets posted by the users from the beginning of their

registrations up to May 31, 2015

Ò Two types of Ground Truth: One for evaluating Relevance-oriented

(RGT) performance and another for evaluating Diversity-oriented (DGT) performance.

Ò Evaluation Metrics

Ò Relevance: Pre (Precision), NDCG, MRR, MAP Ò Their semantic version of the metrics, denoted as Pre-S, NDCG-S,

MRR-S, MAP-S

Ò Diversity: Pre-IA (Intent-Aware Precision), α-NDCG, MRR-IA, MAP-IA

SLIDE 18

18

Experimental Setup

Ò Baselines

Ò Non-dynamic Embedding Models

Ò

Skip-Gram Model, i.e., word2vec Model (SGM)

Ò

Distributed Representations of Documents (DRD)

Ò Dynamic Traditional Profiling Model

Ò

Predictive Language Model (PLM)

Ò Dynamic Topic Model

Ò

User Clustering Topic model (UCT)

Ò Dynamic Embedding Models

Ò

Dynamic Independent Skip-Gram model (DISG)

Ò

Dynamic Pre-initialized Skip-Gram model (DPSG)

Ò

Dynamic Independent Distributed Representations of documents (DIDR)

Ò

Dynamic Pre-initialized Distributed Representations of documents (DPDR)

SLIDE 19

19

Overall Performance

Ò Average relevance performance on time periods of each

month

SLIDE 20

20

Overall Performance

Ò Diversity performance on time periods of each month

SLIDE 21

An Example User’s Dynamic Profiling Results over Time

21

Top-6 keywords of an example user’s dynamic profile, whose interests cover a number of aspects and dramatically change over time, from Sport, fitness, kitchen, exercise, to education.

SLIDE 22

Relevance and diversity performance over time

22

Relevance performance over time Diversity performance over time

SLIDE 23

Performance w.r.t. embedding dimensionality

23

SLIDE 24

24

Overview

Ò The Task Background and Related Work Ò Our Method

Ò Dynamic User and Word Embedding Model (DUWE) Ò Streaming Keyword Diversification Model (SKDM)

Ò Experiments Ò Conclusion

SLIDE 25

25

Conclusions

Ò Study the problem of dynamic user profiling in Twitter Ò Propose a Dynamic User and Word Embedding model

(DUWE)

Ò Propose a Streaming Keyword Diversification Model

(SKDM)

Ò Evaluate the performance of the proposed models in real

dataset, Twitter

SLIDE 26

Dynamic Embeddings for User Profiling in Twitter

Shangsong Liang1, Xiangliang Zhang1, Zhaochun Ren2, Evangelos Kanoulas3

Overview

The Task

Input: A stream of tweets generated across the time Output: A set of keywords to profile the user at different point in time

The Task

Background of User Profiling Problem

a query with a sorted name list in a specific domain, ☛ uncovering associations between people and topics

(2007)

Need Dynamic User Profiling

Dynamic User Profiling Approaches

interests evolve (Fang and Godavarthy 2014)

Limitations of Current User Profiling Methods

harms performance

it difficult to measure the similarity between users and words when constructing the profile Can words and users be embedded in the same semantic space? Can their embedding be modeled in the dynamic environment?

Related Work in Dynamic Topic Models and Dynamic Embedding

Overview

Our Approach

same semantic space

embeddings

keywords can cover all aspects of the users’ interests

Dynamic User and Word Embedding

Diffusion of user representation

.

p(Ut | Ut−1) ∝ N(Ut−1,α α α2

α α2

Diffusion of word representation

.

p(Vt | Vt−1) ∝ N(Vt−1,β β β2

β β2

DUWE model inference

and the variational inference algorithm to obtain the embeddings

information and as follows: where we have:

Streaming Keyword Diversification Model

profiling users’ interests at time t.

Overview

Experimental Setup

registrations up to May 31, 2015

(RGT) performance and another for evaluating Diversity-oriented (DGT) performance.

MRR-S, MAP-S

Experimental Setup

Overall Performance

month

Overall Performance

An Example User’s Dynamic Profiling Results over Time

Relevance and diversity performance over time

Performance w.r.t. embedding dimensionality

Overview

Conclusions

(DUWE)

(SKDM)

dataset, Twitter

Thank you for your attention!

Our paper at

http://www.kdd.org/kdd2018/accepted-papers/view/dynamic- embeddings-for-user-profiling-in-twitter

Lab of Machine Intelligence and kNowledge Engineering (MINE): http://mine.kaust.edu.sa/