CS6501: T
- pics in Learning and Game Theory
(Fall 2019) Prediction Markets and Scoring Rules
Instructor: Haifeng Xu
CS6501: T opics in Learning and Game Theory (Fall 2019) Prediction - - PowerPoint PPT Presentation
CS6501: T opics in Learning and Game Theory (Fall 2019) Prediction Markets and Scoring Rules Instructor: Haifeng Xu Outline Recap:Scoring Rule and Information Elicitation Connection to Prediction Markets Manipulations in Prediction
CS6501: T
(Fall 2019) Prediction Markets and Scoring Rules
Instructor: Haifeng Xu
2
Ø Recap:Scoring Rule and Information Elicitation Ø Connection to Prediction Markets Ø Manipulations in Prediction Markets
3
Information Elicitation from A Single Expert
ØWe (designer) want to learn the distribution of random var 𝐹 ∈ [𝑜]
ØAn expert/predictor has a predicted distribution 𝜇 ∈ Δ( ØWant to incentivize the expert to truthfully report 𝜇
Idea: reward expert by designing a scoring rule 𝑇(𝑗; 𝑞) where: (1) 𝑞 is the expert’s report (may not equal 𝜇); (2) 𝑗 ∈ [𝑜] is the event realization
report 𝑞 = 𝜇 [uniquely] maximizes expected utility 𝑇(𝜇; 𝑞).
4
Example 1 [Log Scoring Rule] Ø 𝑇 𝑗; 𝑞 = log 𝑞3 Example 2 [Quadratic Scoring Rule] Ø 𝑇 𝑗; 𝑞 = 2𝑞3 − ∑7∈[(] 𝑞7
8
if there exists a (strictly) convex function 𝐻: Δ( → ℝ such that 𝑇 𝑗; 𝑞 = 𝐻 𝑞 + ∇𝐻(𝑞)(𝑓3 − 𝑞)
basis vector
5
Information Elicitation from Many Experts
ØReward for expert 𝑙’s prediction 𝑞A is
𝑇 𝑗; 𝑞A − 𝑇(𝑗; 𝑞ABC)
𝜇C 𝜇8 𝜇A
Idea: sequential elicitation – experts make predictions in sequence
6
Information Elicitation from Many Experts
𝜇C 𝜇8 𝜇A
predict once, then each expert maximizes utility by reporting true belief given her own knowledge. Remark
ØEach expert is expected to improve the prediction by aggregating
previous predictions and then update it
7
Information Elicitation from Many Experts
𝜇C 𝜇8 𝜇A
predict once, then each expert maximizes utility by reporting true belief given her own knowledge. Q1: how does sequential elicitation relate to prediction market? Q2: what happens is an expert can predict for multiple times?
8
Ø Recap:Scoring Rule and Information Elicitation Ø Connection to Prediction Markets Ø Manipulations in Prediction Markets
9
Equivalence of PMs and Sequential Elicitation
What does it mean?
ØExperts will have exactly the same incentives and receive the
same return
ØMarket maker’s total loss is the same
Theorem (informal). Under mild technical assumptions, efficient prediction markets are in one-to-one correspondence to sequential information elicitation using proper scoring rules. Next: will informally argue using the LMSR and log-scoring rules
10
Equivalence of LMSR and Log-Scoring Rules
Recall LMSR
Ø Value function with current sales quantity 𝑟: 𝑊 𝑟 = 𝑐 log ∑7∈[(] 𝑓GH/J Ø To buy 𝑦 ∈ ℝ( amount, a buyer pays: 𝑊 𝑟 + 𝑦 − 𝑊(𝑟) Ø Price function (they sum up to 1) 𝑞3 𝑟 = 𝑓GL/J ∑7∈[(] 𝑓GH/J = 𝜖𝑊(𝑟) 𝜖𝑟3
that moves the market price to her belief 𝜇.
11
Equivalence of LMSR and Log-Scoring Rules
Crucial terms: Ø Value function 𝑊 𝑟 = 𝑐 log ∑7∈[(] 𝑓GH/J Ø Price function 𝑞3 𝑟 =
NOL/P ∑H∈[Q] NOH/P = RS(G) RGL
Q1: If current market price is 𝑞ABC, what is the optimal payoff for an expert with belief 𝜇 = 𝑞A? Ø Let 𝑟ABC denote the market standing corresponding to price 𝑞ABC
𝑓GL
TUV/J
∑7∈[(] 𝑓GH
TUV/J = 𝑞3
ABC
12
Equivalence of LMSR and Log-Scoring Rules
Crucial terms: Ø Value function 𝑊 𝑟 = 𝑐 log ∑7∈[(] 𝑓GH/J Ø Price function 𝑞3 𝑟 =
NOL/P ∑H∈[Q] NOH/P = RS(G) RGL
Q1: If current market price is 𝑞ABC, what is the optimal payoff for an expert with belief 𝜇 = 𝑞A? Ø Let 𝑟ABC denote the market standing corresponding to price 𝑞ABC Ø Optimal purchase for the expert is 𝑦∗ such that 𝑞3 𝑟ABC + 𝑦∗ = 𝑓(GL
TUVXYL ∗)/J
∑7∈[(] 𝑓(GH
TUVXYH ∗)/J = 𝑞3
A
and pays 𝑊 𝑟ABC + 𝑦∗ − 𝑊(𝑟ABC) = 𝑐 log ∑7∈[(] 𝑓(GH
TUVXYH ∗)/J − 𝑐 log ∑7∈[(] 𝑓GH TUV/J
13
Equivalence of LMSR and Log-Scoring Rules
Q1: If current market price is 𝑞ABC, what is the optimal payoff for an expert with belief 𝜇 = 𝑞A? Ø Let 𝑟ABC denote the market standing corresponding to price 𝑞ABC Ø Optimal purchase for the expert is 𝑦∗ such that 𝑞3 𝑟ABC + 𝑦∗ = 𝑓(GL
TUVXYL ∗)/J
∑7∈[(] 𝑓(GH
TUVXYH ∗)/J = 𝑞3
A
and pays 𝑊 𝑟ABC + 𝑦∗ − 𝑊(𝑟ABC) = 𝑐 log ∑7∈[(] 𝑓(GH
TUVXYH ∗)/J − 𝑐 log ∑7∈[(] 𝑓GH TUV/J
∑7∈[(] 𝑓(GH
TUVXYH ∗)/J =
N(OL
TUVZ[L ∗)/P
\L
T
14
Equivalence of LMSR and Log-Scoring Rules
Q1: If current market price is 𝑞ABC, what is the optimal payoff for an expert with belief 𝜇 = 𝑞A? Ø Let 𝑟ABC denote the market standing corresponding to price 𝑞ABC Ø Optimal purchase for the expert is 𝑦∗ such that 𝑞3 𝑟ABC + 𝑦∗ = 𝑓(GL
TUVXYL ∗)/J
∑7∈[(] 𝑓(GH
TUVXYH ∗)/J = 𝑞3
A
and pays 𝑊 𝑟ABC + 𝑦∗ − 𝑊(𝑟ABC) = 𝑐 log ∑7∈[(] 𝑓(GH
TUVXYH ∗)/J − 𝑐 log ∑7∈[(] 𝑓GH TUV/J
= 𝑐 log
N(OL
TUVZ[L ∗)/P
\L
T
− 𝑐 log
NOL
TUV/P
\L
TUV
15
Equivalence of LMSR and Log-Scoring Rules
Q1: If current market price is 𝑞ABC, what is the optimal payoff for an expert with belief 𝜇 = 𝑞A? Ø Let 𝑟ABC denote the market standing corresponding to price 𝑞ABC Ø Optimal purchase for the expert is 𝑦∗ such that 𝑞3 𝑟ABC + 𝑦∗ = 𝑓(GL
TUVXYL ∗)/J
∑7∈[(] 𝑓(GH
TUVXYH ∗)/J = 𝑞3
A
and pays 𝑊 𝑟ABC + 𝑦∗ − 𝑊(𝑟ABC) = 𝑐 log ∑7∈[(] 𝑓(GH
TUVXYH ∗)/J − 𝑐 log ∑7∈[(] 𝑓GH TUV/J
= 𝑐 log
N(OL
TUVZ[L ∗)/P
\L
T
− 𝑐 log
NOL
TUV/P
\L
TUV
= 𝑦3
∗ − 𝑐(log 𝑞3 A − log 𝑞3 ABC)
Note: this holds for any 𝑗
16
Equivalence of LMSR and Log-Scoring Rules
Q1: If current market price is 𝑞ABC, what is the optimal payoff for an expert with belief 𝜇 = 𝑞A? Ø Let 𝑟ABC denote the market standing corresponding to price 𝑞ABC Ø Repeat our finding: expert pays 𝑦3
∗ − 𝑐(log 𝑞3 A − log 𝑞3 ABC)
Ø What is the expert utility if outcome 𝑗 is ultimately realized? 𝑦3
∗ − [𝑦3 ∗ − 𝑐(log 𝑞3 A − log 𝑞3 ABC)]
from contracts’ return
17
Equivalence of LMSR and Log-Scoring Rules
Q1: If current market price is 𝑞ABC, what is the optimal payoff for an expert with belief 𝜇 = 𝑞A? Ø Let 𝑟ABC denote the market standing corresponding to price 𝑞ABC Ø Repeat our finding: expert pays 𝑦3
∗ − 𝑐(log 𝑞3 A − log 𝑞3 ABC)
Ø What is the expert utility if outcome 𝑗 is ultimately realized? 𝑦3
∗ − [𝑦3 ∗ − 𝑐(log 𝑞3 A − log 𝑞3 ABC)]
= 𝑐 ⋅ [log 𝑞3
A − log 𝑞3 ABC]
= 𝑐 ⋅ [ 𝑇^_`(𝑗; 𝑞A) − 𝑇^_` 𝑗; 𝑞ABC ] = payment in the sequential elicitation (constant 𝑐 is a scalar)
18
Equivalence of LMSR and Log-Scoring Rules
Q1: If current market price is 𝑞ABC, what is the optimal payoff for an expert with belief 𝜇 = 𝑞A? Ø Let 𝑟ABC denote the market standing corresponding to price 𝑞ABC Ø Repeat our finding: expert pays 𝑦3
∗ − 𝑐(log 𝑞3 A − log 𝑞3 ABC)
Ø What is the expert utility if outcome 𝑗 is ultimately realized? Expert achieves the same utility in LMSR and log-scoring-rule elicitation for any event realization
19
Equivalence of LMSR and Log-Scoring Rules
Q2: What is the worst case loss (i.e., maximum possible payment) when using log-scoring rule in sequential info elicitation? Ø Total payment – if event 𝑗 realized – is ∑AaC
b
[log 𝑞3
A − log 𝑞3 ABC]
≤ 0 − log 𝑞3
e
Ø To avoid cases where some 𝑞3
e is too small (then we need to pay
a lot), should choose 𝑞e = (
C ( , ⋯ , C () as uniform distribution
Ø Worst-case loss is thus log 𝑜 (same as LMSR, up to constant 𝑐) = log 𝑞3
b − log 𝑞3 e
20
Back to Our Original Theorem…
ØPrevious argument generalizes to arbitrary proper scoring rules ØFormal proof employs duality theory
Theorem (informal). Under mild technical assumptions, efficient prediction markets are in
correspondence with sequential information elicitation using proper scoring rules.
21
Back to Our Original Theorem…
ØPrevious argument generalizes to arbitrary proper scoring rules ØFormal proof employs duality theory
Theorem (informal). Under mild technical assumptions, efficient prediction markets are in
correspondence with sequential information elicitation using proper scoring rules. The Correspondence PM with 𝑊 𝑟 corresponds to sequential elicitation with scoring rules determined by 𝑊∗ 𝑞 = the convex conjugate of 𝑊 𝑟
Ø Convex conjugate is in some sense the “dual” of function 𝑊(𝑟) Ø See paper Efficient Market Making via Convex Optimization for details
22
Ø Recap:Scoring Rule and Information Elicitation Ø Connection to Prediction Markets Ø Manipulations in Prediction Markets
23
ØGenerally, we cannot force experts to participate just once
ØManipulations arise when experts can predict multiple times
with multiple-round predictions) Initial market prediction 𝑞e
𝑞C 𝑞8 𝑞h
Game
24
ØPredict event 𝐹 ∈ {0,1}; Outcome drawn uniformly at random ØExpert Alice observes a signal 𝐵 = 𝐹
ØExpert Bob also observes the outcome, i.e., signal 𝐶 = 𝐹
Q: In A-B-A game, what should Alice predict at stage 1 and 3? Report her true prediction at stage 1 (which is perfectly correct)
25
ØAlice observes signal 𝐵 ∈ {0,1}, and Pr 𝐵 = 0 = 0.51 ØBob observes signal 𝐶 ∈ {0,1}, and Pr 𝐶 = 0 = 0.49
ØThey are asked to predict event 𝐹 = (whether 𝐵 + 𝐶 = 1)
Q: what is the optimal experts behaviors in A-B-A game? Market starts with initial prediction pe YES = Pe NO = 1/2
26
ØAlice observes signal 𝐵 ∈ {0,1}, and Pr 𝐵 = 0 = 0.51 ØBob observes signal 𝐶 ∈ {0,1}, and Pr 𝐶 = 0 = 0.49
ØThey are asked to predict event 𝐹 = (whether 𝐵 + 𝐶 = 1)
Q: what is the optimal experts behaviors in A-B-A game? Ø At stage 1, what is Alice’s probability belief of YES?
Ø Should Alice report this at stage 1?
27
ØAlice observes signal 𝐵 ∈ {0,1}, and Pr 𝐵 = 0 = 0.51 ØBob observes signal 𝐶 ∈ {0,1}, and Pr 𝐶 = 0 = 0.49
ØThey are asked to predict event 𝐹 = (whether 𝐵 + 𝐶 = 1)
Q: what is the optimal experts behaviors in A-B-A game? Ø What should Alice do at stage 1 then?
28
ØAlice observes signal 𝐵 ∈ {0,1}, and Pr 𝐵 = 0 = 0.51 ØBob observes signal 𝐶 ∈ {0,1}, and Pr 𝐶 = 0 = 0.49
ØThey are asked to predict event 𝐹 = (whether 𝐵 + 𝐶 = 1)
Q: what is the optimal experts behaviors in A-B-A game? Ø What should Bob predict at stage 2?
He only has one chance to predict, and his belief is the best given his current knowledge
29
ØAlice observes signal 𝐵 ∈ {0,1}, and Pr 𝐵 = 0 = 0.51 ØBob observes signal 𝐶 ∈ {0,1}, and Pr 𝐶 = 0 = 0.49
ØThey are asked to predict event 𝐹 = (whether 𝐵 + 𝐶 = 1)
Q: what is the optimal experts behaviors in A-B-A game? Ø What should Bob predict at stage 2?
30
ØAlice observes signal 𝐵 ∈ {0,1}, and Pr 𝐵 = 0 = 0.51 ØBob observes signal 𝐶 ∈ {0,1}, and Pr 𝐶 = 0 = 0.49
ØThey are asked to predict event 𝐹 = (whether 𝐵 + 𝐶 = 1)
Q: what is the optimal experts behaviors in A-B-A game? Ø What should Alice predict at stage 3?
Pr 𝑍𝐹𝑇 = 1 𝑝𝑠 0 à receiving a lot of credits
31
ØAlice observes signal 𝐵 ∈ {0,1}, and Pr 𝐵 = 0 = 0.51 ØBob observes signal 𝐶 ∈ {0,1}, and Pr 𝐶 = 0 = 0.49
ØThey are asked to predict event 𝐹 = (whether 𝐵 + 𝐶 = 1)
Remarks Ø Example shows how experts aggregate previous information and update their predictions along the way Ø Manipulations arise even if a single expert can predict twice
32
ØAlice observes signal 𝐵 ∈ {0,1}, and Pr 𝐵 = 0 = 0.51 ØBob observes signal 𝐶 ∈ {0,1}, and Pr 𝐶 = 0 = 0.49
ØThey are asked to predict event 𝐹 = (whether 𝐵 + 𝐶 = 1)
Remarks Ø This is an issue in prediction markets, since experts can buy and sell whenever they want Ø Equilibrium of PMs are still poorly understood, even for the fundamental A-B-A games
via Persuasion for state-of-the-art results
Haifeng Xu
University of Virginia hx4ad@virginia.edu