market for the U.S. Elections Miroslav Dudk Thanks: S Lahaie, D - - PowerPoint PPT Presentation

market for the u s elections
SMART_READER_LITE
LIVE PREVIEW

market for the U.S. Elections Miroslav Dudk Thanks: S Lahaie, D - - PowerPoint PPT Presentation

A combinatorial prediction market for the U.S. Elections Miroslav Dudk Thanks: S Lahaie, D Pennock, D Rothschild, D Osherson, A Wang, C Herget Polling accurate, but costly limited range of questions limited timeliness Polling accurate,


slide-1
SLIDE 1

A combinatorial prediction market for the U.S. Elections

Miroslav Dudík

Thanks: S Lahaie, D Pennock, D Rothschild, D Osherson, A Wang, C Herget

slide-2
SLIDE 2
slide-3
SLIDE 3
slide-4
SLIDE 4
slide-5
SLIDE 5
slide-6
SLIDE 6

Polling

accurate, but costly limited range of questions limited timeliness

slide-7
SLIDE 7

Polling

accurate, but costly limited range of questions limited timeliness

Prediction markets

accurate and cheap broad range of questions good timeliness

slide-8
SLIDE 8

Outline

Prediction markets: Setting and challenges Addressing the challenges: constraint generation Empirical evaluation: U.S. Elections 2008 Field experiment: U.S. Elections 2012

slide-9
SLIDE 9

Security

= proposition which becomes true or false at some point in future “Romney will win Florida in Elections 2012”

slide-10
SLIDE 10

Security

= proposition which becomes true or false at some point in future “Romney will win Florida in Elections 2012” Traders buy shares for some price: $0.45 per share For each share of a security receive: $1 if true $0 if false

slide-11
SLIDE 11

Market implementation: (automated) market maker

market maker sets prices if more shares bought, price increases the price equals the consensus probability

  • f the event

buy/sell buy/sell buy/sell market maker

slide-12
SLIDE 12

Combinatorial securities: more information

payoff is a function of common variables e.g., 50 states elect Obama or Romney

slide-13
SLIDE 13

Combinatorial securities: more information

Obama to lose FL, but win election Obama to win >8 of 10 Northeastern states

slide-14
SLIDE 14

Industry standard: ignore relationships

Treat them as independent markets: Las Vegas sports betting Kentucky horse racing Wall Street stock options Betfair political betting

slide-15
SLIDE 15

Industry standard: ignore relationships

Treat them as independent markets: Las Vegas sports betting Kentucky horse racing Wall Street stock options Betfair political betting Problem: arbitrage opportunities

slide-16
SLIDE 16

Arbitrage

trading with guaranteed profits

slide-17
SLIDE 17

Arbitrage

trading with guaranteed profits receive $1 if true

slide-18
SLIDE 18

Arbitrage

trading with guaranteed profits price $0.40 price $0.50

slide-19
SLIDE 19

Arbitrage

trading with guaranteed profits possible if prices incoherent prices cannot be realized as probabilities price $0.40 price $0.50

slide-20
SLIDE 20

Arbitrage

trading with guaranteed profits possible if prices incoherent prices cannot be realized as probabilities Pricing without arbitrage: #P-hard Industry standard = Ignore arbitrage price $0.40 price $0.50

slide-21
SLIDE 21

Arbitrage

trading with guaranteed profits possible if prices incoherent prices cannot be realized as probabilities Pricing without arbitrage: #P-hard Industry standard = Ignore arbitrage traders rewarded for computation instead of information poor information sharing

  • price $0.40

price $0.50

slide-22
SLIDE 22

Our approach: partial arbitrage removal

Separate pricing (must be fast) and information propagation

  • fast pricing in independent markets

for tractably small groups of securities

  • in parallel: constraint generation

to find and remove arbitrage Embedded in convex optimization (with many nice properties).

  • (Dudík et al. 2011)
slide-23
SLIDE 23

Setup: 𝑜 securities 𝐷: ℝ𝑜 → ℝ convex cost function 𝒓 ∈ ℝ𝑜 market state = #shares sold

Cost-based pricing

(Chen and Pennock 2007)

slide-24
SLIDE 24

Setup: 𝑜 securities 𝐷: ℝ𝑜 → ℝ convex cost function 𝒓 ∈ ℝ𝑜 market state = #shares sold

Cost-based pricing

(Chen and Pennock 2007)

𝒓 = ( 100, 400)

slide-25
SLIDE 25

Setup: 𝑜 securities 𝐷: ℝ𝑜 → ℝ convex cost function 𝒓 ∈ ℝ𝑜 market state = #shares sold Trading: 𝒔 ∈ ℝ𝑜 shares bought by a trader cost: 𝐷 𝒓 + 𝒔 − 𝐷 𝒓

Cost-based pricing

(Chen and Pennock 2007)

𝒓 = ( 100, 400)

slide-26
SLIDE 26

Setup: 𝑜 securities 𝐷: ℝ𝑜 → ℝ convex cost function 𝒓 ∈ ℝ𝑜 market state = #shares sold Trading: 𝒔 ∈ ℝ𝑜 shares bought by a trader cost: 𝐷 𝒓 + 𝒔 − 𝐷 𝒓

Cost-based pricing

(Chen and Pennock 2007)

𝒓 = ( 100, 400) 𝒔 = ( 0, 2)

slide-27
SLIDE 27

Setup: 𝑜 securities 𝐷: ℝ𝑜 → ℝ convex cost function 𝒓 ∈ ℝ𝑜 market state = #shares sold Trading: 𝒔 ∈ ℝ𝑜 shares bought by a trader cost: 𝐷 𝒓 + 𝒔 − 𝐷 𝒓 state updated: 𝒓′ ← 𝒓 + 𝒔 𝒓′ = ( 100, 402)

Cost-based pricing

(Chen and Pennock 2007)

𝒓 = ( 100, 400) 𝒔 = ( 0, 2)

slide-28
SLIDE 28

𝛼𝐷(𝒓) = ($0.70, $0.75) Setup: 𝑜 securities 𝐷: ℝ𝑜 → ℝ convex cost function 𝒓 ∈ ℝ𝑜 market state = #shares sold Trading: 𝒔 ∈ ℝ𝑜 shares bought by a trader cost: 𝐷 𝒓 + 𝒔 − 𝐷 𝒓 state updated: 𝒓′ ← 𝒓 + 𝒔 instantaneous prices: 𝛼𝐷(𝒓) 𝒓′ = ( 100, 402)

Cost-based pricing

(Chen and Pennock 2007)

𝒓 = ( 100, 400) 𝒔 = ( 0, 2)

slide-29
SLIDE 29

𝛼𝐷(𝒓) = ($0.70, $0.75) Setup: 𝑜 securities 𝐷: ℝ𝑜 → ℝ convex cost function 𝒓 ∈ ℝ𝑜 market state = #shares sold Trading: 𝒔 ∈ ℝ𝑜 shares bought by a trader cost: 𝐷 𝒓 + 𝒔 − 𝐷 𝒓 state updated: 𝒓′ ← 𝒓 + 𝒔 instantaneous prices: 𝛼𝐷(𝒓) 𝒓′ = ( 100, 402)

Cost-based pricing

(Chen and Pennock 2007)

𝒓 = ( 100, 400) 𝒔 = ( 0, 2)

slide-30
SLIDE 30

Can we just use existing approaches from graphical models?

MCMC—randomized, slow convergence mean field—non-convex belief propagation—lack of convergence

slide-31
SLIDE 31

Can we just use existing approaches from graphical models?

MCMC—randomized, slow convergence mean field—non-convex belief propagation—lack of convergence Problematic for pricing: poor convergence  volatility non-determinism  distorted incentives and user experience

slide-32
SLIDE 32

Our approach

implement a coherent pricing scheme

  • n small groups of securities; e.g.,

detect incoherence between groups act as an arbitrageur to restore coherence caveat: – difficult to detect incoherence in general – we detect only a subset of violations

priced 𝑓𝑟1 𝑓𝑟1 + 𝑓𝑟2 priced 𝑓𝑟2 𝑓𝑟1 + 𝑓𝑟2

number of shares bought so far

slide-33
SLIDE 33

Our approach

implement a coherent pricing scheme

  • n small groups of securities; e.g.,

detect incoherence between groups act as an arbitrageur to restore coherence caveat: – difficult to detect incoherence in general – we detect only a subset of violations

priced 𝑓𝑟1 𝑓𝑟1 + 𝑓𝑟2 priced 𝑓𝑟2 𝑓𝑟1 + 𝑓𝑟2

slide-34
SLIDE 34

Our approach

implement a coherent pricing scheme

  • n small groups of securities; e.g.,

detect incoherence between groups act as an arbitrageur to restore coherence caveat: – difficult to detect incoherence in general – we detect only a subset of violations

priced 𝑓𝑟1 𝑓𝑟1 + 𝑓𝑟2 priced 𝑓𝑟2 𝑓𝑟1 + 𝑓𝑟2

slide-35
SLIDE 35

Our approach

implement a coherent pricing scheme

  • n small groups of securities; e.g.,

detect incoherence between groups act as an arbitrageur to restore coherence caveat: – difficult to detect incoherence in general – we detect only a subset of violations

priced 𝑓𝑟1 𝑓𝑟1 + 𝑓𝑟2 priced 𝑓𝑟2 𝑓𝑟1 + 𝑓𝑟2

slide-36
SLIDE 36

For U.S. Elections: conjunction market

create 50 states (groups of size 2) create all pairs of states (groups of size 4) for conjunctions of 3 or more, group with opposite disjunction: 𝐵 ∧ 𝐶 ∧ 𝐷 with 𝐵 ∨ 𝐶 ∨ 𝐷 (groups of size 2) each group is independent market: fast pricing in parallel: generate, find, and fix constraints (via coordinate descent)

slide-37
SLIDE 37

For U.S. Elections: conjunction market

create 50 states (groups of size 2) create all pairs of states (groups of size 4) for conjunctions of 3 or more, group with opposite disjunction: 𝐵 ∧ 𝐶 ∧ 𝐷 with 𝐵 ∨ 𝐶 ∨ 𝐷 (groups of size 2) each group is independent market: fast pricing in parallel: generate, find, and fix constraints (via coordinate descent)

slide-38
SLIDE 38

Local coherence

Pairs: 𝑄 𝐵 ∧ 𝐶 + 𝑄 𝐵 ∧ 𝐶 = 𝑄 𝐵 Larger conjunctions: 𝑄 𝐵1 ∧ 𝐵2 ∧ ⋯ ∧ 𝐵𝑛 ≤ 𝑄 𝐵𝑗

slide-39
SLIDE 39

Clique constraints

For a disjunction 𝐵1 ∨ ⋯ ∨ 𝐵𝑛, pick a subset 𝐵𝑗1 ∨ ⋯ ∨ 𝐵𝑗𝑙 𝑄 𝐵1 ∨ ⋯ ∨ 𝐵𝑛 ≥ 𝑄 𝐵𝑗1 ∨ ⋯ ∨ 𝐵𝑗𝑙

slide-40
SLIDE 40

Clique constraints

For a disjunction 𝐵1 ∨ ⋯ ∨ 𝐵𝑛, pick a subset 𝐵𝑗1 ∨ ⋯ ∨ 𝐵𝑗𝑙 𝑄 𝐵1 ∨ ⋯ ∨ 𝐵𝑛 ≥ 𝑄 𝐵𝑗1 ∨ ⋯ ∨ 𝐵𝑗𝑙 𝑄 𝐵1 ∨ ⋯ ∨ 𝐵𝑛 ≥ 𝑘=1

𝑙

𝑄 𝐵𝑗𝑘 − 1≤𝑘<𝑚≤𝑙 𝑄 𝐵𝑗𝑘 ∧ 𝐵𝑗𝑚

slide-41
SLIDE 41

Clique constraints

For a disjunction 𝐵1 ∨ ⋯ ∨ 𝐵𝑛, pick a subset 𝐵𝑗1 ∨ ⋯ ∨ 𝐵𝑗𝑙 𝑄 𝐵1 ∨ ⋯ ∨ 𝐵𝑛 ≥ 𝑄 𝐵𝑗1 ∨ ⋯ ∨ 𝐵𝑗𝑙 𝑄 𝐵1 ∨ ⋯ ∨ 𝐵𝑛 ≥ 𝑘=1

𝑙

𝑄 𝐵𝑗𝑘 − 1≤𝑘<𝑚≤𝑙 𝑄 𝐵𝑗𝑘 ∧ 𝐵𝑗𝑚 #clique constraints exponential find only the tightest one!

(approximate submodular maximization via Feige et al. 2007)

slide-42
SLIDE 42

Tree constraints

For a disjunction 𝐵1 ∨ ⋯ ∨ 𝐵𝑛, 𝑄 𝐵1 ∨ ⋯ ∨ 𝐵𝑛 ≤ 𝑗=1

𝑛 𝑄 𝐵𝑗 − 𝑗,𝑘 ∈𝑈 𝑄 𝐵𝑗 ∧ 𝐵𝑘 (Galambos and Simoneli 1996)

slide-43
SLIDE 43

Tree constraints

For a disjunction 𝐵1 ∨ ⋯ ∨ 𝐵𝑛, 𝑄 𝐵1 ∨ ⋯ ∨ 𝐵𝑛 ≤ 𝑗=1

𝑛 𝑄 𝐵𝑗 − 𝑗,𝑘 ∈𝑈 𝑄 𝐵𝑗 ∧ 𝐵𝑘

where 𝑈 is a spanning tree on nodes 1, … , 𝑛

(Galambos and Simoneli 1996)

slide-44
SLIDE 44

Does it work?

Tested using a survey of Election 2008: singletons, pairs, triples Small data set—compare with exact: 10 states, 30k trades Large data set—compare with independent: 50 states, 300k trades

slide-45
SLIDE 45

Small data set: 10 states

log likelihood sensitivity parameter

more accurate sensitivity parameter

slide-46
SLIDE 46

Small data set: 10 states

log likelihood sensitivity parameter

sensitivity parameter more accurate

slide-47
SLIDE 47

Large data set: 50 states, 300k trades

log likelihood sensitivity parameter

sensitivity parameter more accurate

slide-48
SLIDE 48

No really, does it work?

slide-49
SLIDE 49

WiseQ Game

(launched September 16, 2012)

slide-50
SLIDE 50
slide-51
SLIDE 51
slide-52
SLIDE 52
slide-53
SLIDE 53

WiseQ by numbers

437 active users 3,137 trades 514 distinct bundles traded 1033 possible outcomes 44.5 million possible bundles allowed by our menu 17,222 securities in 2,840 small markets 20,983 coherence constraints

slide-54
SLIDE 54

Did market absorb information from users?

slide-55
SLIDE 55

Did market absorb information from users?

mean profit

slide-56
SLIDE 56

Did users place combinatorial bets?

slide-57
SLIDE 57

Did users place combinatorial bets?

50 100 150 200 250 300 350 400 450 unique users betting in a given category

slide-58
SLIDE 58

Did users place combinatorial bets?

50 100 150 200 250 300 350 400 450 unique users betting in a given category Presidential—singleton Senate, House—singleton

slide-59
SLIDE 59

Did users place combinatorial bets?

50 100 150 200 250 300 350 400 450 unique users betting in a given category Presidential—singleton Senate, House—singleton Presidential—combinatorial

slide-60
SLIDE 60

Did users place combinatorial bets?

50 100 150 200 250 300 350 400 450 unique users betting in a given category Presidential—singleton Senate, House—singleton Presidential—combinatorial Electoral votes Governor Additional combinatorial Economic indicators

slide-61
SLIDE 61

Numerical predictions: electoral votes

slide-62
SLIDE 62

probability electoral votes for Obama (4-Oct-2012) initialization prediction (20-Sep-2012)

Numerical predictions: electoral votes

actual outcome (6-Nov-2012)

slide-63
SLIDE 63

probability Job Numbers for September 2012 (4-Oct-2012) initialization prediction (20-Sep-2012)

Numerical predictions: job numbers

actual outcome

(5-Oct-2012)

slide-64
SLIDE 64

Summary

independent markets + constraints: tractable and accurate combinatorial markets can succeed with moderate numbers of users even on huge outcome spaces meaningful forecasts for challenging, but relevant outcomes: combinatorial and numerical