A PERSONALIZED BDM MECHANISM FOR EFFICIENT MARKET INTERVENTION - - PowerPoint PPT Presentation

a personalized bdm mechanism for efficient market
SMART_READER_LITE
LIVE PREVIEW

A PERSONALIZED BDM MECHANISM FOR EFFICIENT MARKET INTERVENTION - - PowerPoint PPT Presentation

A PERSONALIZED BDM MECHANISM FOR EFFICIENT MARKET INTERVENTION EXPERIMENTS Imanol Arrieta Ibarra joint work with Johan Ugander OBJECTIVES Introduce a subsidized product or service to a new market. Estimate its causal benefits on a


slide-1
SLIDE 1

A PERSONALIZED BDM MECHANISM FOR EFFICIENT MARKET INTERVENTION EXPERIMENTS

Imanol Arrieta Ibarra joint work with Johan Ugander

slide-2
SLIDE 2

OBJECTIVES

➤ Introduce a subsidized

product or service to a new market.

➤ Estimate its causal benefits

  • n a desired population.

➤ Estimate the demand for

the product.

➤ Be cost-efficient. ➤ Personalization allows more

efficient experiments to be run without sacrificing causal interpretation.

slide-3
SLIDE 3

MOTIVATION

➤ Providing Water Filters to a

low-income population in

  • Ghana. (Berry, Fischer and

Gutieras (2015))

➤ Introducing new agricultural

  • product. (Kremer, Duflo and

Robinson (2011))

➤ Introducing new health

products in developing countries.

slide-4
SLIDE 4

DEMAND ESTIMATION

➤ Dynamic Pricing. ➤ Take-it-or-leave-it (TIOLI). ➤ Willingness to pay: maximum

price at which someone is willing to pay in order to acquire a product.

➤ Vickrey auctions among the

population.

➤ Becker-DeGroot-Marschak

mechanism (BDM).

ESTIMATING CAUSAL EFFECTS

➤ Randomized Control Trials

(A/B) testing.

➤ Observational Studies

(Offline Policy Evaluation).

➤ Becker-DeGroot-Marschak

mechanism (BDM).

slide-5
SLIDE 5

BDM MECHANISM (SECOND PRICE AUCTION AGAINST RANDOM BIDDER)

  • 1. Offer a product (a water filter) to user U.
  • 2. Ask U for her willingness to pay: The maximum amount she

would pay to get the product.

  • 3. Draw a price at random P from an interval (0,T), where T is

some fixed number *.

  • 4. If the random price P is below user U’s reported willingness

to pay, she gets the product and pays P . Otherwise she does not get the product. * This is what we’ll personalize.

slide-6
SLIDE 6

CAUSALITY (AS NEYMAN-RUBIN CAUSAL MODEL)

➤ Measure an outcome variable Y that takes values Y(1) under

treatment and Y(0) under control.

➤ We are interested in the difference Y(1)-Y(0) ➤ In reality we get to only observe one of the two potential

  • utcomes.
slide-7
SLIDE 7

AVERAGE TREATMENT EFFECT

➤ Estimate the Average Treatment Effect for a given population:

E[Y(1)-Y(0)].

➤ Can use difference in means estimator:

slide-8
SLIDE 8

DIFFERENT PROBABILITIES OF ASSIGNMENT

➤ If the probability of treatment assignment is different for each

unit we can de-bias the estimates by using the Hajek estimator:

➤ Where Wi is a binary variable representing treatment

assignment.

̂ τHajek = ∑ 1

pi YobsWi

∑ 1

pi Wi

− ∑

1 1 − pi Yobs(1 − Wi)

1 1 − pi (1 − Wi)

slide-9
SLIDE 9

STRATIFICATION

➤ At each level of willingness to pay, the assignment to

treatment and control is random:

slide-10
SLIDE 10

DEMAND ESTIMATION

➤ Dynamic Pricing. ➤ Take-it-or-leave-it (TIOLI). ➤ Willingness to pay: maximum

price at which someone is willing to pay in order to acquire a product.

➤ Vickrey auctions among the

population.

➤ Becker-DeGroot-Marschak

mechanism (BDM).

ESTIMATING CAUSAL EFFECTS

➤ Randomized Control Trials

(A/B) testing.

➤ Observational Studies

(Offline Policy Evaluation).

➤ Becker-DeGroot-Marschak

mechanism (BDM).

slide-11
SLIDE 11

DEMAND ESTIMATION

➤ Having elicited the users’

willingness to pay, we can count the number of users which were willing to buy the product at each price point.

ESTIMATING CAUSAL EFFECTS

➤ As in Berry, Fischer and

Gutieras (2015).

➤ T

wo sources of randomness:

➤ Conditional on

willingness to pay, treatment is random.

➤ Conditional on

willingness to pay and being treated, price is random.

slide-12
SLIDE 12

PERSONALIZATION

➤ Reduce unnecessary costs for

researchers by minimizing potential subsidies.

➤ Reduce variance in

estimations by allowing better balance at each level of willingness to pay.

➤ Maintain incentive

compatibility to elicit correct valuations.

slide-13
SLIDE 13

PERSONALIZED BDM MECHANISM

  • 1. Offer a product with cost C to a subject with Xi observable

characteristics.

  • 2. Draw a price from without showing it to the subject
  • 3. Ask the subject to report her willingness to pay which

comes from a population

  • 4. If the subject gets the intervention and pays (the

lower price). Otherwise there is no exchange.

ϕ FΦ|Xi

wi

Wi|Xi ϕ < w ϕ

slide-14
SLIDE 14

THREE ALTERNATIVES FOR

  • 1. Make = E[W|Xi].
  • 2. Run an RCT (Randomized Control Trial):

.

  • 3. Draw prices from a personalized uniform distribution:

where a and b are chosen based on W|Xi .

Φ|Xi

Φ|Xi

FΦ|Xi(w) = 1 2 + 1 2𝕁{w > C}

FΦ|Xi(w) = ϵ 2 + (1 − ϵ) w𝕁{a ≤ w ≤ b} b − a + ϵ 2 𝕁{w ≥ C}

FΦ|Xi(w) = 𝕁{w > E[W|Xi]}

slide-15
SLIDE 15

DRAW PRICES FROM A PERSONALIZED UNIFORM DISTRIBUTION

➤ BDM is a special case where : ➤ We’ll refer to as PBDM the case where

FΦ|Xi(w) = ϵ 2 + (1 − ϵ) w𝕁{a ≤ w ≤ b} b − a + ϵ 2 𝕁{w ≥ C}

∀i ϵ = 0, a = 0, b = C a(Xi) = F−1

W|Xi (

δ 2), b(Xi) = F−1

W|Xi (1 − δ

2 )

slide-16
SLIDE 16

PBDM

FΦ|Xi(w) = ϵ 2 + (1 − ϵ) w𝕁{a ≤ w ≤ b} b − a + ϵ 2 𝕁{w ≥ C}

a(Xi) = F−1

W|Xi (

δ 2), b(Xi) = F−1

W|Xi (1 − δ

2 ) a b ϵ 2 C W

fΦ|Xi fW|Xi

slide-17
SLIDE 17

PBDM

FΦ|Xi(w) = ϵ 2 + (1 − ϵ) w𝕁{a ≤ w ≤ b} b − a + ϵ 2 𝕁{w ≥ C}

a(Xi) = F−1

W|Xi (

δ 2), b(Xi) = F−1

W|Xi (1 − δ

2 ) a b

ϵ 2

C W

fΦ|Xi fW|Xi

a b

ϵ 2

C W

fΦ|Xi fW|Xi

X1 X2

slide-18
SLIDE 18

ESTIMATOR VARIANCE

➤ Under Fisher’s null ( Y(1)=Y(0)= ) we get that the H-T

variance is:

➤ Minimized when:

Var( ̂ τHT) = 1 N2 (

N

i=1

1 − FΦ|Xi(Wi) FΦ|Xi(Wi) Yi(1)2 + FΦ|Xi(Wi) 1 − FΦ|Xi(Wi) Yi(0)2 ) = 1 N2

N

i=1

αi ( 1 − FΦ|Xi(Wi) FΦ|Xi(Wi) + FΦ|Xi(Wi) 1 − FΦ|Xi(Wi)) ∀i FΦ|Xi(Wi) = 1 2 α

slide-19
SLIDE 19

ESTIMATOR VARIANCE

  • 1. Make = E[W|Xi].
  • 2. Run an RCT (Randomized Control Trial) minimizes the

variance: .

  • 3. Draw prices from a personalized uniform distribution:

Φ|Xi

Var( ̂ τHT) = ∞ Var( ̂ τHT) = 2 ¯ α 2 ¯ α < Var( ̂ τHT) < ∞

slide-20
SLIDE 20

EXPECTED VARIANCE

➤ Using a Taylor expansion around the mean and taking a first

degree approximation.

➤ which gets minimized when:

E[Var( ̂ τHT)|X1, . . . , XN] ≈ 1 N2

N

i=1 (

1 − FΦ|Xi(E[Wi|Xi]) FΦ|Xi(E[Wi|Xi]) + FΦ|Xi(E[Wi|Xi]) 1 − FΦ|Xi(E[Wi|Xi]) )

FΦ|Xi(E[Wi|Xi]) = 1 2

slide-21
SLIDE 21

BUDGET REGRET

➤ We define budget regret as: ➤ and expected budget regret as: ➤ Every time we assign someone to treatment we incur some

regret derived from having been able to treat that subject with a lower subsidy had we known their true willingness to pay.

BR(Φ, W) = (W − Φ)I{Φ < W} br(FΦ|X) = EX,W[EFΦ|X[BR(Φ, W)]]

slide-22
SLIDE 22

BUDGET REGRET

  • 1. Make = E[W|Xi].
  • 2. Run an RCT (Randomized Control Trial) maximizes budget

regret: .

  • 3. Draw prices from a personalized uniform distribution:

Φ|Xi

br(FΦ|X) = E [ W 2 ] − θ, 0 < θ < E [ W 2 ] br(FΦ|X) = E[W] br(FΦ|X) = E [ W − ̂ a 2 ]

slide-23
SLIDE 23

INCENTIVE COMPATIBILITY

  • 1. Make = E[W|Xi].

➤ Subject is indifferent after convergence.

  • 2. Run an RCT (Randomized Control Trial) minimizes the variance:

➤ Subject indifferent amongst valuations .

  • 3. Draw prices from a personalized uniform distribution:

➤ Incentive compatible with probability higher than

Φ|Xi

1 − δ

slide-24
SLIDE 24

TIME CONSTRAINTS ON MECHANICAL TURK

➤ Understand MT workers

performance under time constraints.

➤ Measure how performance

changes conditional on how much workers value not being constrained.

➤ Evaluate performance on

turkers who paid not to be timed conditional on what they were paid.

➤ Used STAN to estimate the

distribution of W|X

slide-25
SLIDE 25

CONTEXT (DEMOGRAPHICS)

slide-26
SLIDE 26

CONTEXT (RISK AVERSION)

slide-27
SLIDE 27

PBDM

slide-28
SLIDE 28

PBDM

slide-29
SLIDE 29

PBDM

slide-30
SLIDE 30

DEMAND ESTIMATION

ESTIMATING CAUSAL EFFECTS

BDM PBDM Percentage treated 0.31 0.52 Hajek ATE 2.23 4.26 Standard Error 3.72 0.96 Average Budget Regret 65 45

25 50 75 100 125 150 175

Price

0% 20% 40% 60% 80% 100%

Percentage of users Inverse demand

BDM PBDM ?40 ?30 ?20 ?10 10 20 30 40

ATE(Difference in correct answers)

Block BDM Block PBDM Hajek BDM Hajek PBDM HT BDM HT PBDM

ATE estimates

slide-31
SLIDE 31

VARIANCE OF ESTIMATORS

2000 2500 3000 3500 4000 4500 5000 5500 6000

Budget

1 2 3 4 5 6 7 8 9

Standard error Standard error estimator as a function of budget

Hajek BDM Hajek PBDM

slide-32
SLIDE 32

PBDM SMOOTHNESS

slide-33
SLIDE 33

SUMMARY

➤ Presented a way to introduce personalization using machine

learning to experiments without losing the causal interpretation.

➤ Showed that personalization can reduce the cost of

unnecessary subsidies in this kind of experiments.

➤ Evaluated our methods on a Mechanical Turk experiment and

found that even though for the small sample size we were not able to find big differences in estimation preciseness, the amount of subsidy given to users was cut to half for our algorithm.

slide-34
SLIDE 34

FUTURE WORK

➤ Currently working on the estimation of heterogeneous

treatment effects.

➤ Optimal strategy when balancing the treatment conditional on

willingness to pay and the treatment conditional on price paid.

➤ Expanding this work to other types of mechanisms and

notions of incentive compatibility.

➤ Looking for applications where we can predict willingness to

pay from observed characteristics.

slide-35
SLIDE 35

THANK YOU

slide-36
SLIDE 36

WERE USERS UNDERSTANDING THE MECHANISM?

1 2 3 4 5 6

Tutorial stage

0.0 0.2 0.4 0.6 0.8 1.0

% revenue maximizers

slide-37
SLIDE 37

WAS THE ALGORITHM LEARNING?

25 50 75 100 125 150 175 200 PBDM mechanism 10 20 30 40 50 60 70 User Personalized mechanism

slide-38
SLIDE 38

ESTIMATING THE PROBABILITY OF TREATMENT

➤ We can estimate the probability of assignment by sampling

  • ver all possible permutations and simply averaging how

many times a given user would have been treated.

slide-39
SLIDE 39

PROPENSITY SCORES AND ARRIVAL ORDER RANDOMNESS

➤ Estimate probability of assignment at a given arrival position. ➤ Then, assume random arrival order and take average over

  • rder

0.69 0.72 0.45 Treatment probability: Willingness to pay: 50 120 70