eliciting informative feedback the peer prediction method
play

Eliciting Informative Feedback: The Peer-Prediction Method Colin - PowerPoint PPT Presentation

Eliciting Informative Feedback: The Peer-Prediction Method Colin Zheng and Kenneth Wang 1 The Setting Camera Quality: dist over {, } 2 ~ 1 ~ : dist over , jointly Rater Rater w/ 1 2 Report


  1. Eliciting Informative Feedback: The Peer-Prediction Method Colin Zheng and Kenneth Wang 1

  2. The Setting Camera Quality: dist 𝐸 over {𝑀, 𝐼} 𝑑 2 ~𝑇 𝑑 1 ~𝑇 𝑇 : dist over π‘š, β„Ž jointly Rater Rater w/ 𝐸 1 2 Report Report 𝑠 1 𝑠 2 Center: - Ebay - NetFlix - Amazon 2 2

  3. The Setting Camera Quality: dist 𝐸 over {𝑀, 𝐼} Task: 𝑑 2 ~𝑇 𝑑 1 ~𝑇 𝑇 : dist over How to make rational π‘š, β„Ž jointly raters report honestly Rater Rater w/ 𝐸 ( 𝑠 𝑗 = 𝑑 𝑗 ) 1 2 Naive Attempt: reward 𝜐 = $1 if 𝑠 1 = 𝑠 2 , 𝜐 = $0 otherwise Report Report Problem: rater 1 will report 𝑠 1 𝑠 2 β€œmore likely signal” of rater 2 Center: - Ebay - NetFlix - Amazon 3 3

  4. The Setting Camera Quality: dist 𝐸 over {𝑀, 𝐼} Task (formulation): Reporting 𝑠 𝑗 = 𝑑 𝑗 is a 𝑑 2 ~𝑇 𝑑 1 ~𝑇 𝑇 : dist over Nash Equilibrium π‘š, β„Ž jointly Rater Rater w/ 𝐸 1 2 βˆ€ rater 𝑗 , signal 𝑛 , 𝑦 β‰  𝑛 𝔽 𝑑 2 𝜐 1 𝑑 1 , 𝑑 2 𝑑 1 = 𝑛 β‰₯ 𝔽 𝑑 2 𝜐 1 𝑦, 𝑑 2 𝑑 1 = 𝑛 Report Report 𝑠 1 𝑠 2 𝔽 𝑑 1 𝜐 2 𝑑 2 , 𝑑 1 𝑑 2 = 𝑛 Center: β‰₯ 𝔽 𝑑 1 𝜐 2 𝑦, 𝑑 1 𝑑 2 = 𝑛 - Ebay - NetFlix - Amazon 4 4

  5. Solution using Proper Scoring Rule Camera Quality: Task: dist 𝐸 over {𝑀, 𝐼} choose payoff 𝜐 1 such that 𝔽 𝑑 2 𝜐 1 𝑦, 𝑑 2 𝑑 1 = 𝑛 𝑑 2 ~𝑇 𝑑 1 ~𝑇 is maximized at 𝑦 = 𝑛 𝑇 : dist over π‘š, β„Ž jointly Rater Rater w/ 𝐸 1 2 Equivalently, 𝔽 𝑨~π‘Ž 𝜐 1 𝑦, 𝑨 where π‘Ž = 𝑑 2 𝑑 1 =𝑛 Report Report 𝑠 1 𝑠 2 Recall: 𝑔 is Proper Scoring Rule if 𝔽 𝑨~π‘Ž [𝑔(𝑄, 𝑨)] is maximized at Center: 𝑄 = π‘Ž - Ebay - NetFlix So let 𝜐 1 𝑦, 𝑨 = 𝑔(𝑑 2 𝑑 1 =𝑦 , 𝑨) - Amazon 5 5

  6. Issue: costs to the rater In reality,  Evaluating and reporting honestly incur a relative cost 𝑑 > 0 (e.g. testing is time consuming, opportunity cost, ...)  Can be offset by scaling up the payoffs  Problems: – Paying too much for truthful information? – What if 𝑑 is unknown? [PRGJ08] 6 6

  7. Issue: costs to the center To reduce center’s cost:  Budget balancing: – pair up raters, make each pair a zero-sum game  Use linear optimization to find cost-optimal payoff function [JF06] 7 7

  8. Issue: risk aversion  When the center knows the raters’ utility function(s)  When the center does not know the utility function(s)  Using multiple reference raters reduces risk 8 8

  9. Issue: collusion  Using an honest reference rater  Randomized reference rater  Outside experts... 9 9

  10. Issue: Unknown/diff. priors Camera Quality: dist 𝐸 over {𝑀, 𝐼}  In reality, raters have diff. beliefs about 𝑑 2 ~𝑇 𝑑 1 ~𝑇 𝑇 : dist over (𝐸, 𝑇) , unknown to π‘š, β„Ž jointly Rater Rater w/ 𝐸 the center 1 2  Addressed in [WP12] for binary signals 𝑇 Report Report 𝑠 1 𝑠 2 Center: - Ebay - NetFlix - Amazon 10 10

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend