A Bounded Confidence Approach to Understand User Participation in - - PowerPoint PPT Presentation

a bounded confidence approach to understand user
SMART_READER_LITE
LIVE PREVIEW

A Bounded Confidence Approach to Understand User Participation in - - PowerPoint PPT Presentation

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia University of Lugano Third Intl Conference on Social Informatics (SocInfo 2011) Singapore, October 8, 2011. Motivation


slide-1
SLIDE 1

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems

Giovanni Luca Ciampaglia

University of Lugano

Third Intl Conference on Social Informatics (SocInfo 2011) Singapore, October 8, 2011.

slide-2
SLIDE 2

Motivation Background Peer production model Results

Social Informatics

One key idea of social informatics research is that the “social context” of information technology development plays a significant role in influencing the ways that people use information and technologies, and thus influences their consequences for work, organizations, and other social relationships. (Kling 1999)

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-3
SLIDE 3

Motivation Background Peer production model Results

Social computing

◮ Commons-based peer production (Benkler 2002) ◮ 0-th law of Wikipedia: “The problem with Wikipedia is that

it only works in practice. In theory, it can never work.”1

◮ Example: user participation to peer production.

1http://en.wikipedia.org/wiki/User:Raul654/Raul’s_laws A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-4
SLIDE 4

Motivation Background Peer production model Results

Social computing

◮ Commons-based peer production (Benkler 2002) ◮ 0-th law of Wikipedia: “The problem with Wikipedia is that

it only works in practice. In theory, it can never work.”1

◮ Example: user participation to peer production.

1http://en.wikipedia.org/wiki/User:Raul654/Raul’s_laws A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-5
SLIDE 5

Motivation Background Peer production model Results

Social computing

◮ Commons-based peer production (Benkler 2002) ◮ 0-th law of Wikipedia: “The problem with Wikipedia is that

it only works in practice. In theory, it can never work.”1

◮ Example: user participation to peer production.

1http://en.wikipedia.org/wiki/User:Raul654/Raul’s_laws A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-6
SLIDE 6

Motivation Background Peer production model Results

An example: Wikipedia’s NPOV policy

A short course of wiki writing:2

◮ “Abortion is wrong” – Wrong! ◮ “The pro-life movement holds that abortion

is wrong, or occasionally that it is only justified in certain special cases” – Right!

2http://en.wikipedia.org/wiki/Wikipedia:NPOV/Examples A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-7
SLIDE 7

Motivation Background Peer production model Results

An example: Wikipedia’s NPOV policy

A short course of wiki writing:2

◮ “Abortion is wrong” – Wrong! ◮ “The pro-life movement holds that abortion

is wrong, or occasionally that it is only justified in certain special cases” – Right!

2http://en.wikipedia.org/wiki/Wikipedia:NPOV/Examples A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-8
SLIDE 8

Motivation Background Peer production model Results

An example: Wikipedia’s NPOV policy

A short course of wiki writing:2

◮ “Abortion is wrong” – Wrong! ◮ “The pro-life movement holds that abortion

is wrong, or occasionally that it is only justified in certain special cases” – Right!

2http://en.wikipedia.org/wiki/Wikipedia:NPOV/Examples A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-9
SLIDE 9

Motivation Background Peer production model Results

An example: Wikipedia’s NPOV policy

A short course of wiki writing:2

◮ “Abortion is wrong” – Wrong! ◮ “The pro-life movement holds that abortion

is wrong, or occasionally that it is only justified in certain special cases” – Right!

2http://en.wikipedia.org/wiki/Wikipedia:NPOV/Examples A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-10
SLIDE 10

Motivation Background Peer production model Results

An example: Wikipedia’s NPOV policy

A short course of wiki writing:2

◮ “Abortion is wrong” – Wrong! ◮ “The pro-life movement holds that abortion

is wrong, or occasionally that it is only justified in certain special cases” – Right!

2http://en.wikipedia.org/wiki/Wikipedia:NPOV/Examples A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-11
SLIDE 11

Motivation Background Peer production model Results

User participation and norm adoption

◮ Groups are characterized by norms ◮ Norms: approved behaviors to

follows, implicit knowledge, shared beliefs

◮ Participation to peer production

community requires certain norms to be adopted

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-12
SLIDE 12

Motivation Background Peer production model Results

User participation and norm adoption

◮ Groups are characterized by norms ◮ Norms: approved behaviors to

follows, implicit knowledge, shared beliefs

◮ Participation to peer production

community requires certain norms to be adopted

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-13
SLIDE 13

Motivation Background Peer production model Results

User participation and norm adoption

◮ Groups are characterized by norms ◮ Norms: approved behaviors to

follows, implicit knowledge, shared beliefs

◮ Participation to peer production

community requires certain norms to be adopted

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-14
SLIDE 14

Motivation Background Peer production model Results

A theory of norm adoption

◮ Social judgement Theory (Sherif and Hovland 1961) ◮ Self-categorization Theory (Turner 1989) ◮ Formalization: bounded confidence (BC) principle ◮ If ||x (t) − y (t) || < ε:

x (t + 1) = x (t) + µ

  • y (t) − x (t)
  • y (t + 1) = y (t) + µ
  • x (t) − y (t)
  • (1)

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-15
SLIDE 15

Motivation Background Peer production model Results

Norm adoption and coordination

◮ Models of group

formation

(continuous opinion dynamics under bounded confidence)

◮ Group Consensus,

polarization

◮ NOT tested

empirically

(Deffuant et al. 2001)

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-16
SLIDE 16

Motivation Background Peer production model Results

Norm adoption and coordination

◮ Models of group

formation

(continuous opinion dynamics under bounded confidence)

◮ Group Consensus,

polarization

◮ NOT tested

empirically

(Deffuant et al. 2001)

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-17
SLIDE 17

Motivation Background Peer production model Results

Norm adoption and coordination

◮ Models of group

formation

(continuous opinion dynamics under bounded confidence)

◮ Group Consensus,

polarization

◮ NOT tested

empirically

(Deffuant et al. 2001)

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-18
SLIDE 18

Motivation Background Peer production model Results

Main ingredients of the model

◮ Dynamic population of users (arrival rate λu) ◮ New pages are created (creation rate λp) ◮ Users interact (interaction rate λe) with pages using BC

rule

◮ Users have initial motivation c ◮ r (t) = s(t)+c

n(t)+c fraction of “successful” edits

◮ If attitude change: r (t) ← r (t) + 1

◮ Probability to abandon at time t:

λd (t) = r (t) τ0 + 1 − r (t) τ1 (2)

◮ Time scales: τ0 > τ1 (usually by orders of magnitude)

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-19
SLIDE 19

Motivation Background Peer production model Results

Main ingredients of the model

◮ Dynamic population of users (arrival rate λu) ◮ New pages are created (creation rate λp) ◮ Users interact (interaction rate λe) with pages using BC

rule

◮ Users have initial motivation c ◮ r (t) = s(t)+c

n(t)+c fraction of “successful” edits

◮ If attitude change: r (t) ← r (t) + 1

◮ Probability to abandon at time t:

λd (t) = r (t) τ0 + 1 − r (t) τ1 (2)

◮ Time scales: τ0 > τ1 (usually by orders of magnitude)

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-20
SLIDE 20

Motivation Background Peer production model Results

Main ingredients of the model

◮ Dynamic population of users (arrival rate λu) ◮ New pages are created (creation rate λp) ◮ Users interact (interaction rate λe) with pages using BC

rule

◮ Users have initial motivation c ◮ r (t) = s(t)+c

n(t)+c fraction of “successful” edits

◮ If attitude change: r (t) ← r (t) + 1

◮ Probability to abandon at time t:

λd (t) = r (t) τ0 + 1 − r (t) τ1 (2)

◮ Time scales: τ0 > τ1 (usually by orders of magnitude)

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-21
SLIDE 21

Motivation Background Peer production model Results

Main ingredients of the model

◮ Dynamic population of users (arrival rate λu) ◮ New pages are created (creation rate λp) ◮ Users interact (interaction rate λe) with pages using BC

rule

◮ Users have initial motivation c ◮ r (t) = s(t)+c

n(t)+c fraction of “successful” edits

◮ If attitude change: r (t) ← r (t) + 1

◮ Probability to abandon at time t:

λd (t) = r (t) τ0 + 1 − r (t) τ1 (2)

◮ Time scales: τ0 > τ1 (usually by orders of magnitude)

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-22
SLIDE 22

Motivation Background Peer production model Results

Main ingredients of the model

◮ Dynamic population of users (arrival rate λu) ◮ New pages are created (creation rate λp) ◮ Users interact (interaction rate λe) with pages using BC

rule

◮ Users have initial motivation c ◮ r (t) = s(t)+c

n(t)+c fraction of “successful” edits

◮ If attitude change: r (t) ← r (t) + 1

◮ Probability to abandon at time t:

λd (t) = r (t) τ0 + 1 − r (t) τ1 (2)

◮ Time scales: τ0 > τ1 (usually by orders of magnitude)

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-23
SLIDE 23

Motivation Background Peer production model Results

Main ingredients of the model

◮ Dynamic population of users (arrival rate λu) ◮ New pages are created (creation rate λp) ◮ Users interact (interaction rate λe) with pages using BC

rule

◮ Users have initial motivation c ◮ r (t) = s(t)+c

n(t)+c fraction of “successful” edits

◮ If attitude change: r (t) ← r (t) + 1

◮ Probability to abandon at time t:

λd (t) = r (t) τ0 + 1 − r (t) τ1 (2)

◮ Time scales: τ0 > τ1 (usually by orders of magnitude)

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-24
SLIDE 24

Motivation Background Peer production model Results

Main ingredients of the model

◮ Dynamic population of users (arrival rate λu) ◮ New pages are created (creation rate λp) ◮ Users interact (interaction rate λe) with pages using BC

rule

◮ Users have initial motivation c ◮ r (t) = s(t)+c

n(t)+c fraction of “successful” edits

◮ If attitude change: r (t) ← r (t) + 1

◮ Probability to abandon at time t:

λd (t) = r (t) τ0 + 1 − r (t) τ1 (2)

◮ Time scales: τ0 > τ1 (usually by orders of magnitude)

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-25
SLIDE 25

Motivation Background Peer production model Results

Main ingredients of the model

◮ Dynamic population of users (arrival rate λu) ◮ New pages are created (creation rate λp) ◮ Users interact (interaction rate λe) with pages using BC

rule

◮ Users have initial motivation c ◮ r (t) = s(t)+c

n(t)+c fraction of “successful” edits

◮ If attitude change: r (t) ← r (t) + 1

◮ Probability to abandon at time t:

λd (t) = r (t) τ0 + 1 − r (t) τ1 (2)

◮ Time scales: τ0 > τ1 (usually by orders of magnitude)

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-26
SLIDE 26

Motivation Background Peer production model Results

Norm adoption dynamics

5000 10000 15000 20000

time

0.0 0.2 0.4 0.6 0.8 1.0

  • pinion

ε = 0.1

5000 10000 15000 20000

time

0.0 0.2 0.4 0.6 0.8 1.0

  • pinion

ε = 0.125

5000 10000 15000 20000

time

0.0 0.2 0.4 0.6 0.8 1.0

  • pinion

ε = 0.15

5000 10000 15000 20000

time

0.0 0.2 0.4 0.6 0.8 1.0

  • pinion

ε = 0.175

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-27
SLIDE 27

Motivation Background Peer production model Results

Analysis of the model

◮ Question: what are the most important parameters?

(factor screening)

◮ Solution: sensitivity analysis

parameters → model → average lifetime

◮ Decomposition of response variance:

◮ Main interaction: fraction of variance accounted by one

parameter only.

◮ Total interaction: fraction of variance accounted by a

parameters in conjunction with others.

◮ Computational issue: using surrogate model instead of

simulator.

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-28
SLIDE 28

Motivation Background Peer production model Results

Analysis of the model

◮ Question: what are the most important parameters?

(factor screening)

◮ Solution: sensitivity analysis

parameters → model → average lifetime

◮ Decomposition of response variance:

◮ Main interaction: fraction of variance accounted by one

parameter only.

◮ Total interaction: fraction of variance accounted by a

parameters in conjunction with others.

◮ Computational issue: using surrogate model instead of

simulator.

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-29
SLIDE 29

Motivation Background Peer production model Results

Analysis of the model

◮ Question: what are the most important parameters?

(factor screening)

◮ Solution: sensitivity analysis

parameters → model → average lifetime

◮ Decomposition of response variance:

◮ Main interaction: fraction of variance accounted by one

parameter only.

◮ Total interaction: fraction of variance accounted by a

parameters in conjunction with others.

◮ Computational issue: using surrogate model instead of

simulator.

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-30
SLIDE 30

Motivation Background Peer production model Results

Analysis of the model

◮ Question: what are the most important parameters?

(factor screening)

◮ Solution: sensitivity analysis

parameters → model → average lifetime

◮ Decomposition of response variance:

◮ Main interaction: fraction of variance accounted by one

parameter only.

◮ Total interaction: fraction of variance accounted by a

parameters in conjunction with others.

◮ Computational issue: using surrogate model instead of

simulator.

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-31
SLIDE 31

Motivation Background Peer production model Results

Analysis of the model

◮ Question: what are the most important parameters?

(factor screening)

◮ Solution: sensitivity analysis

parameters → model → average lifetime

◮ Decomposition of response variance:

◮ Main interaction: fraction of variance accounted by one

parameter only.

◮ Total interaction: fraction of variance accounted by a

parameters in conjunction with others.

◮ Computational issue: using surrogate model instead of

simulator.

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-32
SLIDE 32

Motivation Background Peer production model Results

Analysis of the model

◮ Question: what are the most important parameters?

(factor screening)

◮ Solution: sensitivity analysis

parameters → model → average lifetime

◮ Decomposition of response variance:

◮ Main interaction: fraction of variance accounted by one

parameter only.

◮ Total interaction: fraction of variance accounted by a

parameters in conjunction with others.

◮ Computational issue: using surrogate model instead of

simulator.

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-33
SLIDE 33

Motivation Background Peer production model Results

Main interaction plot

◮ Gaussian Process based on Latin hypercube design with 64 points ◮ Decomposition of variance computed with winding stairs method, 10000

samples

0.0 0.2 0.4 0.6 0.8 1.0

parameter scaled value

20 40 60 80 100

main effect

daily edits daily users daily pages confidence speed const succ const pop rollback prob short life long life

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-34
SLIDE 34

Motivation Background Peer production model Results

2-way interaction effects

long life

20 30 40 50 60 70 80 90

confidence

0.1 0.2 0.3 0.4

lifetime

10 20 30 40 50 60 70 80

short life

0.2 0.4 0.6 0.8

confidence

0.1 0.2 0.3 0.4

lifetime

10 20 30 40 50 60 70

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-35
SLIDE 35

Motivation Background Peer production model Results

User activity lifespan

◮ User lifespan in blogs is exponential (Leskovec et al., 2007) ◮ Bimodal distr. in blogs (Guo et al., 2009) ◮ Wikipedia: mixture of lognormals (Ciampaglia and Vancheri 2010)

−10 −5 5 10 log (τ) (log-days) 20000 40000 60000 80000 100000

  • freq. (users)

enwiki.csv 15 10 5 5 10

log-lifetime u =ln(τ) (log-days)

0.0 0.2 0.4 0.6 0.8 1.0

Pr

  • U <u
  • English Wikipedia, data from August 2010

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-36
SLIDE 36

Motivation Background Peer production model Results

Follow-up work

◮ Computational model fitting using indirect inference. ◮ Need to introduce Poissonian cascade model.

−10 −5 5 τ (log-days) 0.00 0.02 0.04 0.06 0.08 0.10 density

ptwiki

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-37
SLIDE 37

Motivation Background Peer production model Results

Conclusions

  • 1. Confidence (i.e. tolerance to attitude change ε) most

important parameter in explaining user participation

  • 2. Other notable factors (e.g. initial motivation c) not

important

  • 3. New methodology for agent-based modeling, based on

empirical data

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-38
SLIDE 38

Motivation Background Peer production model Results

References

◮ Kling 1999. What is social informatics and why does it matter? D-Lib

  • Mag. 5 (1).

http://www.dlib.org/dlib/january99/kling/01kling.html ◮ Benkler 2002. Coase’s penguin: Linux and the nature of the firm. Yale

Law J. 112 369–446.

◮ Deffuant et al. 2001. Mixing beliefs among interaction agents. Adv.

  • Comp. Sys. 3, 87–98.

◮ Sherif and Hovland 1961. Assimilation and constrast effects in

communication and attitude change. Yale University Press.

◮ Turner 1989. Rediscovering the social group: a self-categorization

  • theory. Blackwell Publishers.

◮ Leskovec et al. 2007. Microscopic evolution of social networks. Proc. of

KDD ’08.

◮ Guo et al. 2009. Analyzing patterns of user content generation in online

social networks. Proc. of KDD ’09.

◮ Ciampaglia and Vancheri 2010. Empirical analysis of user participation

in online communities: the case of Wikipedia. Proc. of ICWSM 2010.

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia

slide-39
SLIDE 39

Motivation Background Peer production model Results

Questions?

A Bounded Confidence Approach to Understand User Participation in Peer Production Systems Giovanni Luca Ciampaglia