Detection of latent roles in online forums Luchon- July 1, 2014 - - PowerPoint PPT Presentation

detection of latent roles in online forums
SMART_READER_LITE
LIVE PREVIEW

Detection of latent roles in online forums Luchon- July 1, 2014 - - PowerPoint PPT Presentation

Detection of latent roles in online forums Luchon- July 1, 2014 Alberto Lumbreras, Sup: Jouve B., Velcin J. Roles in discusion threads Task: detect roles Definition: role as archetypical behavior or social function. 2 Different roles,


slide-1
SLIDE 1

Detection of latent roles in online forums

Luchon- July 1, 2014 Alberto Lumbreras, Sup: Jouve B., Velcin J.

slide-2
SLIDE 2

Roles in discusion threads

Task: detect roles Definition: role as archetypical behavior or social function.

2

slide-3
SLIDE 3

Different roles, different definitions.

3

slide-4
SLIDE 4

Sociology/Antropology

attributes: strategies of speech. technique: ethnology, observational study. Identified roles: Celebrity, Newbie, Lurker, Flamer, Troll, Ranter. [1] S. Golder and J. Donath, “Social roles in electronic communities,” Internet Res.,

  • vol. 5, 2004.

4

slide-5
SLIDE 5

Similar attributes

attributes: in-deg, out-deg, %init, %posts replied, % bi-dir neighs,... technique: clustering. Identified roles: Joining conversationalists, Popular initiators, Taciturns, Supporters, Elitists, Popular participants, Grunts, Ignored. [2] J. Chan, C. Hayes, and E. Daly, “Decomposing discussion forums using common user roles,” in Proceedings of the WebSci10: Extending the Frontiers of Society On-Line, 2010.

5

slide-6
SLIDE 6

Similar relationships

attributes: sociomatrix (matrix of relations) technique: blockmodeling. Identified roles: Centre-periphery, hierarchies, horizontal structures, ghettos...

Figure: Kemp, C., Griffiths, T. & Tenenbaum, J., 2004. Discovering latent classes in relational data.

[1] H. White, S. Boorman, and R. Breiger, “Social structure from multiple networks.

  • I. Blockmodels of roles and positions,” Am. J. Sociol., 1976.

[2] K. Nowicki and T. A. B. Snijders, “Estimation and prediction for stochastic blockstructures,” J. Am. Stat. Assoc, 2001.

6

slide-7
SLIDE 7

Similar relationships

Example

7

slide-8
SLIDE 8

Role as similar behavior

Idea: if you hold role r, you behave like the archetype r plus some noise. bu = ru + ǫu (1) (toy example) bu ∼ N(ru, ǫu) (2)

8

slide-9
SLIDE 9

Intuition

9

slide-10
SLIDE 10

Bayesian framework

Bayesian probability: P(θ|Y )

posterior

=

joint probability

  • P(Y , θ)
  • θ P(Y , θ) =

likelihood

P(Y |θ)

prior

P(θ)

  • θ P(Y |θ)P(θ) ∝

likelihood

P(Y |θ)

prior

P(θ) (3) BAYESIAN BONUS: we can make predictions (and therefore validate our model). P(y|yt−1, θ) (4)

10

slide-11
SLIDE 11

Mixture models

A generative story: behavioru|roleu, θrole ∼ F(behavior|roleu, θrole) (5) θrole|β ∼ G(β) (6) roleu ∼ Discrete(P(role1), ..., P(roleK )) (7) P(role1), ..., P(roleK )|α ∼ Dirichlet(α) (8) (intuition: imagine F is a Normal distribution, role is the mean µ, and behavior is the

  • bservation y)

Probability of everything: P(b, r, π, θ) = P(π|α)

  • U

P(ru|π)

  • K

P(θr|β)

  • U

P(bu|ru, θru ) (9) Intractable: Marginal probability of r: P(r) =

  • b
  • π
  • θ

P(b, r, π, θ) (10) Solution: Gibbs sampling:

  • 1. Loop:

(11) ru ∼ P(ru|r−u, θ, b) (12) θk ∼ P(θk|θ−k, r, b) (13) π ∼ P(π|θ, r, b) (14)

  • 2. Histogram ru

(15)

11

slide-12
SLIDE 12

Behaviors

Triads in which user is seen. Cascades after user participation. Leskovec et al, “Cascading Behavior in Large Blog Graphs Patterns and a model.” Preference function (patterns of choices). etc.

12

slide-13
SLIDE 13

Remarks

Mixture models as natural framework to group fuzzy behaviors. Flexibility in what behaviors to study. (structural, text, dynamics...) The main issue: inference (sampling) Machine Learning: Non-parametric model (let the data speak) Efficient sampling methods (parallel, hamiltonian monte carlo...)

13

slide-14
SLIDE 14

Thanks!

14