1
Eliciting Informative Feedback: The Peer-Prediction Method Colin - - PowerPoint PPT Presentation
Eliciting Informative Feedback: The Peer-Prediction Method Colin - - PowerPoint PPT Presentation
Eliciting Informative Feedback: The Peer-Prediction Method Colin Zheng and Kenneth Wang 1 The Setting Camera Quality: dist over {, } 2 ~ 1 ~ : dist over , jointly Rater Rater w/ 1 2 Report
2 2
Rater 1
Camera Quality: dist πΈ over {π, πΌ}
Rater 2 π‘2~π π‘1~π
π: dist over π, β jointly w/ πΈ
Center:
- Ebay
- NetFlix
- Amazon
Report π 1 Report π 2
The Setting
3 3
Rater 1
Camera Quality: dist πΈ over {π, πΌ}
Rater 2 π‘2~π π‘1~π
π: dist over π, β jointly w/ πΈ
Center:
- Ebay
- NetFlix
- Amazon
Report π 1 Report π 2
The Setting
Task:
How to make rational raters report honestly (π π = π‘π)
Naive Attempt: reward π = $1 if π 1 = π 2, π = $0 otherwise Problem: rater 1 will report βmore likely signalβ of rater 2
4 4
Rater 1
Camera Quality: dist πΈ over {π, πΌ}
Rater 2 π‘2~π π‘1~π
π: dist over π, β jointly w/ πΈ
Center:
- Ebay
- NetFlix
- Amazon
Report π 1 Report π 2
The Setting
Task (formulation):
Reporting π π = π‘π is a Nash Equilibrium β rater π, signal π, π¦ β π π½π‘2 π1 π‘1, π‘2 π‘1 = π β₯ π½π‘2 π1 π¦, π‘2 π‘1 = π π½π‘1 π2 π‘2, π‘1 π‘2 = π β₯ π½π‘1 π2 π¦, π‘1 π‘2 = π
5 5
Rater 1
Camera Quality: dist πΈ over {π, πΌ}
Rater 2 π‘2~π π‘1~π
π: dist over π, β jointly w/ πΈ
Center:
- Ebay
- NetFlix
- Amazon
Report π 1 Report π 2
Solution using Proper Scoring Rule
Task:
choose payoff π1 such that π½π‘2 π1 π¦, π‘2 π‘1 = π is maximized at π¦ = π Equivalently, π½π¨~π π1 π¦, π¨ where π = π‘2 π‘1=π
Recall: π is Proper Scoring Rule if
π½π¨~π[π(π, π¨)] is maximized at π = π
So let π1 π¦, π¨ = π(π‘2 π‘1=π¦,π¨)
6 6
Issue: costs to the rater
In reality,
- Evaluating and reporting honestly incur a
relative cost π > 0
(e.g. testing is time consuming, opportunity cost, ...)
- Can be offset by scaling up the payoffs
- Problems:
β Paying too much for truthful information? β What if π is unknown? [PRGJ08]
7 7
Issue: costs to the center
To reduce centerβs cost:
- Budget balancing:
β pair up raters, make each pair a zero-sum game
- Use linear optimization to find cost-optimal
payoff function [JF06]
8 8
Issue: risk aversion
- When the center knows the ratersβ utility
function(s)
- When the center does not know the utility
function(s)
- Using multiple reference raters reduces risk
9 9
Issue: collusion
- Using an honest reference rater
- Randomized reference rater
- Outside experts...
10 10
Issue: Unknown/diff. priors
- In reality, raters have
- diff. beliefs about
(πΈ, π), unknown to the center
- Addressed in [WP12]
for binary signals π
Rater 1
Camera Quality: dist πΈ over {π, πΌ}
Rater 2 π‘2~π π‘1~π π: dist over
π, β jointly w/ πΈ
Center:
- Ebay
- NetFlix
- Amazon
Report π 1 Report π 2