Network Economics -- Lecture 3: Incentives in online systems II: - PowerPoint PPT Presentation

Network Economics -- Lecture 3: Incentives in online systems II: robust reputation systems and information elicitation Patrick Loiseau EURECOM Fall 2016 1

References • Main: – N. Nisam, T. Roughgarden, E. Tardos and V. Vazirani (Eds). “Algorithmic Game Theory”, CUP 2007. Chapters 27. • Available online: http://www.cambridge.org/journals/nisan/downloads/Nisan_Non- printable.pdf • Additional: – Yiling Chen and Arpita Gosh, “Social Computing and User Generated Content,” EC’13 tutorial • Slides at http://www.arpitaghosh.com/papers/ec13_tutorialSCUGC.pdf and http://yiling.seas.harvard.edu/wp- content/uploads/SCUGC_tutorial_2013_Chen.pdf – M. Chiang. “Networked Life, 20 Questions and Answers”, CUP 2012. Chapters 3-5. • See the videos on www.coursera.org 2

Outline 1. Introduction 2. Eliciting effort and honest feedback 3. Reputation based on transitive trust 3

Importance of reputation systems • Internet enables interactions between entities • Benefit depends on the entities ability and reliability • Revealing history of previous interaction: – Informs on abilities – Deter moral hazard • Reputation: numerical summary of previous interactions records – Across users – can be weighted by reputation (transitivity of trust) – Across time 5

Reputation systems operation 6

Attacks on reputation systems • Whitewashing • Incorrect feedback • Sybil attack 7

A simplistic model C D • Prisoner’s dilemma again! • One shot C 1, 1 -1, 2 – (D, D) dominant • Infinitely repeated 2, -1 0, 0 D – Discount factor δ 8

Equilibrium with 2 players • Grim = Cooperate unless the other player defected in the previous round • (Grim, Grim) is a subgame perfect Nash equilibrium if δ≥1/2 – We only need to consider single deviations • à If users do not value future enough, they don’t cooperate 9

Game with N+1 Players (N odd) • Each round: players paired randomly • With reputation (reputation-grim): agents begin with good reputation and keep it as long as they play C against players with good reputation and D against those with bad ones – SPNE if δ ≥ 1/2 • Without reputation (personalized-grim): keep track of previous interaction with same agent – SPNE if δ ≥ 1-1/(2N) 10

Whitewashing • Play D and come back as new user! • Possible to avoid this with entry fee f 11

Different settings • How to enforce honest reporting of interaction experience? 1. Objective information publicly revealed: can just compare report to real outcome – E.g., weather prediction 2. No objective outcome is available – E.g., product quality – not objective – E.g., product breakdown frequency – objective but no revealed 13

The Brier scoring rule • Expert has belief q: – Sunny with proba q, rainy with proba 1-q • Announces prediction p (proba of sunny) • How to incentivize honest prediction? – Give him “score” • S(p, sunny) = 1 - (1-p) 2 • S(p, rainy) = 1 - p 2 • Expected score S(p, q) = 1-q+q 2 -(p-q) 2 – Maximized at p=q 14

Proper scoring rules • Definition: a scoring rule is proper if S(q, q) ≥ S(p, q) for all p • It is strictly proper if the inequality is strict for all p≠q • Brier rule is strictly proper • Other strictly proper scoring rule: – S(p, state) = log p state 15

Different settings • How to enforce honest reporting of interaction experience? 1. Objective information publicly revealed: can just compare report to real outcome – E.g., weather prediction 2. No objective outcome is available – E.g., product quality – not objective – E.g., product breakdown frequency – objective but no revealed 16

Peering agreement rewarding • Rewarding agreement is not good • If a good outcome is likely (e.g., because of well noted seller), a customer will not report a bad experience à peer-prediction method – Use report to update a reference distribution of ratings (prior distribution) – Reward based on comparison of probabilities of the reference rating and the actual reference report 17

Model • Product of given quality (called type) observed with errors • Each rater sends feedback to central processing center • Center computes rewards based exclusively on raters indications (no independent information) 18

Model (2) • Finite number of types t=1, …, T • Commonly known prior Pr 0 • Set of raters I – Each gets a ‘signal’ – S={s 1 , …, s M }: set of signals – S i : signal received by i, distributed as f(.|t) 19

Example • Two types: H (high) and L (low) – Pr 0 (H)=.5, Pr 0 (L)=.5 • Two possible signals: h or l • f(h|H)=.85, f(l|H)=.15, f(h|L)=.45, f(l|L)=.55 – Pr(h)=.65, Pr(l)=.35 20

Game • Rewards/others ratings revealed only after receiving all reports from all raters • à simultaneous game • x i : i’s report, x = (x 1 , …, x I ): vector of announcements • x i m : i’s report if signal s m • i’s strategy: • τ i (x): payment to i if vector of announcement x 21

Best Response • Best response • Truthful revelation is a Nash equilibrium if this holds for all i when x i m =s m 22

Example 23

Scoring rules • How to assign points to rater i based on his report and that of j? • Def: a scoring rule is a function that, for each possible announcement assigns a score to each possible value s in S • We cannot access s j , but in a truthful equilibrium, we can use j’s report • Def: A scoring rule is strictly proper if the rater maximizes his expected score by announcing his true belief 24

Logarithmic scoring rule • Ask belief on the probability of an event • A proper scoring rule is the Logarithmic scoring rule: Penalize a user the log of the probability that he assigns to the event that actually occurred 25

Peer-prediction method • Choose a reference rater r(i) • The outcome to be predicted is x r(i) • Player i does not report a distribution, but only his signal – The distribution is inferred from the prior • Result: For any mapping r, truthful reporting is a Nash equilibrium under the logarithmic scoring rule 26

Proof 27

Example 28

Remarks • Two other equilibria: always report h, always report l – Less likely • See other applications of Bayesian estimation by Amazon reviews in M. Chiang. “Networked Life, 20 Questions and Answers”, CUP 2012. Chapters 5. 29

Transitive trust approach • Assign trust values to agents that aggregate local trust given by others • t(i, j): trust that i reports on j • Graph • Reputation values • Determine a ranking of vertices 31

Example: PageRank 32

Example 2: max-flow algorithm 33

Slide in case you are ignorant about max-flow min-cut theorem 34

Example 3: the PathRank algorithm 35

Definitions • Monotonic: if adding an incoming edge to v never reduces the ranking of v – PageRank, max-flow, PathRank • Symmetric if the reputation F commutes with the permutation of the nodes – PageRank – Not max-flow, not PathRank 36

Incentives for honest reporting • Incentive issue: an agent may improve their ranking by incorrectly reporting their trust of other agents • Definition: A reputation function F is rank- strategyproof if for every graph G, no agent v can improve his ranking by strategic rating of others • Result: No monotonic reputation system that is symmetric can be rank-strategyproof – PageRank is not – But PathRank is 37

Robustness to sybil attacks • Suppose a node can create several nodes and divide the incoming trust in any way that preserves the total incoming trust • Definition: – sybil strategy – Value-sybilproof – Rank-sybilproof 38

Robustness to sybil attacks: results • Theorem: There is no symmetric rank- sybilproof function • Theorem (stronger): There is no symmetric rank-sybilproof function even if we limit sybil strategies to adding only one extra node • à PageRank is not rank-sybilproof 39

Robustness to sybil attacks: results (2) • Theorem: The max-flow based ranking algorithm is value-sybilproof – But it is not rank-sybilproof • Theorem: The PathRank based ranking algorithm is value-sybilproof and rank- sybilproof 40

Network Economics -- Lecture 3: Incentives in online systems II: - PowerPoint PPT Presentation

Network Economics -- Lecture 3: Incentives in online systems II: robust reputation systems and information elicitation Patrick Loiseau EURECOM Fall 2016 1 References Main: N. Nisam, T. Roughgarden, E. Tardos and V. Vazirani (Eds).

Incentives to relay 1) Incentives to relay traffic 2) Incentives to do it well 3)

Tax Incentives in Latin America 05/11/2012 CHILE a) Tax Stability Agreements b) Free Trade

Incentives Abercrombie & Kent Incentives Abercrombie & Kent Incentives is a division

Incentives for Corruption Ben Olken MIT February 2011 Olken Incentives for Corruption

Incentives Research in HE Those that found incentives effective Survey Incentives and Institutional

Network Economics -- Lecture 2: Incentives in online systems I: free riding and effort

WHY STUDY ECONOMICS? Choosing a major or minor in economics MYTHS OF ECONOMICS: Economics is

Historic Preservation Tax Incentives in Maine Gilman Street School, 2008 Waterville c. 1950

City of London Green Incentives Program Exploring the Potential for Municipal Incentives to

Update on Incentives & Benefits of Going green Ir. G. Lalchand 16 November 2009 GBI

ROCKY MOUNTAIN POWER watt smart Efficiency Incentives 1-16-20 Residential Incentives The

Network Economics -- Lecture 4: Incentives and games in security Patrick Loiseau EURECOM Fall

Data and Incentives Annie Liang 1 and Erik Madsen 2 1 Department of Economics, University of

and Incentives for Safety Juergen Bracht University of Aberdeen, Scotland Department of

Economics of network security Harald Vranken 1 Economics of network security Note that

Incentives: The Evil Hat Liz Keogh liz@lunivore.com @lunivore Perverse incentives are a

Evaluation metrics and proper scoring rules Classifier Calibration Tutorial ECML PKDD 2020 Dr.

Microbiome & Health Human microbiome distribution and functions Human microbiome: microbial

Workshop 15: Q-mode MVA Murray Logan August 6, 2016 Table of contents 1 Q-mode Inference

https://eval.ucdavis.edu Please take 10 minutes to fill out the online course evaluation.

Learning for Single-Shot Confidence Calibration in Deep Neural Networks through Stochastic

Peer Prediction Mechanisms and their Connections to Machine Learning Jens Witkowski ETH

Outline 1 Introduction 2 Discrete Predictors 3 Validation of Supervised Classifiers 4

Bloggers and Bitcoin Prices: A Textual Machine Learning Analysis Eric Ghysels UNC Chapel Hill

Network Economics -- Lecture 3: Incentives in online systems II: - PowerPoint PPT Presentation

Network Economics -- Lecture 3: Incentives in online systems II: robust reputation systems and information elicitation Patrick Loiseau EURECOM Fall 2016 1 References Main: N. Nisam, T. Roughgarden, E. Tardos and V. Vazirani (Eds).

Incentives to relay 1) Incentives to relay traffic 2) Incentives to do it well 3)

Tax Incentives in Latin America 05/11/2012 CHILE a) Tax Stability Agreements b) Free Trade

Incentives Abercrombie &amp; Kent Incentives Abercrombie &amp; Kent Incentives is a division

Incentives for Corruption Ben Olken MIT February 2011 Olken Incentives for Corruption

Incentives Research in HE Those that found incentives effective Survey Incentives and Institutional

Network Economics -- Lecture 2: Incentives in online systems I: free riding and effort

WHY STUDY ECONOMICS? Choosing a major or minor in economics MYTHS OF ECONOMICS: Economics is

Historic Preservation Tax Incentives in Maine Gilman Street School, 2008 Waterville c. 1950

City of London Green Incentives Program Exploring the Potential for Municipal Incentives to

Update on Incentives &amp; Benefits of Going green Ir. G. Lalchand 16 November 2009 GBI

ROCKY MOUNTAIN POWER watt smart Efficiency Incentives 1-16-20 Residential Incentives The

Network Economics -- Lecture 4: Incentives and games in security Patrick Loiseau EURECOM Fall

Data and Incentives Annie Liang 1 and Erik Madsen 2 1 Department of Economics, University of

and Incentives for Safety Juergen Bracht University of Aberdeen, Scotland Department of

Economics of network security Harald Vranken 1 Economics of network security Note that

Incentives: The Evil Hat Liz Keogh liz@lunivore.com @lunivore Perverse incentives are a

Evaluation metrics and proper scoring rules Classifier Calibration Tutorial ECML PKDD 2020 Dr.

Microbiome &amp; Health Human microbiome distribution and functions Human microbiome: microbial

Workshop 15: Q-mode MVA Murray Logan August 6, 2016 Table of contents 1 Q-mode Inference

https://eval.ucdavis.edu Please take 10 minutes to fill out the online course evaluation.

Learning for Single-Shot Confidence Calibration in Deep Neural Networks through Stochastic

Peer Prediction Mechanisms and their Connections to Machine Learning Jens Witkowski ETH

Outline 1 Introduction 2 Discrete Predictors 3 Validation of Supervised Classifiers 4

Bloggers and Bitcoin Prices: A Textual Machine Learning Analysis Eric Ghysels UNC Chapel Hill

Incentives Abercrombie & Kent Incentives Abercrombie & Kent Incentives is a division

Update on Incentives & Benefits of Going green Ir. G. Lalchand 16 November 2009 GBI

Microbiome & Health Human microbiome distribution and functions Human microbiome: microbial