The Price of Data Simone Galperti Aleksandr Levkun Jacopo Perego - PowerPoint PPT Presentation

The Price of Data Simone Galperti Aleksandr Levkun Jacopo Perego UC San Diego UC San Diego Columbia University August 2020

Overview introduction Data has become an essential commodity in modern economies A few markets for data have emerged, where data sources are compensated for the data they generate This paper: A theory of how to individually price the entries of a dataset so as to reflect their proper value Our questions: Normative: How much does each entry contribute to the total value of the dataset for its owner? Operational: What is owner’s WTP for an additional data entry? What drive these prices and how can we compute them? How are these prices affected by privacy concerns?

Overview introduction Data has become an essential commodity in modern economies A few markets for data have emerged, where data sources are compensated for the data they generate This paper: A theory of how to individually price the entries of a dataset so as to reflect their proper value Our questions: of the dataset for its owner? ▶ Normative: How much does each entry contribute to the total value ▶ Operational: What is owner’s WTP for an additional data entry? ▶ What drive these prices and how can we compute them? ▶ How are these prices affected by privacy concerns?

Overview introduction Our approach leverages a simple insight : used by its owner to achieve a given goal mathematical relationship Goal for Today’s Talk 1. Formalize relationship + data-pricing problem 2. Preliminary characterization of price determinants and properties 3. Showcase properties through examples ▶ The data-pricing problem is intimately related to how the dataset is ▶ When carefully formulated, the two problems are in a special

Overview introduction Our approach leverages a simple insight : design problem mathematical relationship Goal for Today’s Talk 1. Formalize relationship + data-pricing problem 2. Preliminary characterization of price determinants and properties 3. Showcase properties through examples ▶ The data-pricing problem is intimately related to the information ▶ When carefully formulated, the two problems are in a special

Overview introduction Our approach leverages a simple insight : design problem relationship Goal for Today’s Talk 1. Formalize relationship + data-pricing problem 2. Preliminary characterization of price determinants and properties 3. Showcase properties through examples ▶ The data-pricing problem is intimately related to the information ▶ When carefully formulated, the two problems are in a dual

Modeling Ingredients introduction A standard and flexible framework: Payoff state + players’ private signals about it Designer may use entries : (no privacy) (privacy) ▶ Finite static games with incomplete information Data entries and the dataset : ▶ A “data entry” is a state of the world: ▶ The “dataset” consists of all entries + their frequencies ▶ Without players’ consent ▶ Only with players’ consent

Preliminary Results introduction Pricing formula being non-separable across states What drives the prices? gambling against players ( novel interpretation for dual variables) Properties The effects of privacy protection ▶ Individual price for each data entry despite info-design problem ▶ (1) Designer’s payoff + (2) Designing information equivalent to ▶ Price captures externalities that each data entry may exert on others ▶ Price captures dependencies between dimensions of each data entry ▶ It lowers value of dataset, but can increase price of some entries

Related Literature introduction Information Design. Kamenica & Gentzkow (’11), Bergemann & Morris (’16, ’19), ... Duality & Correlated Equilibrium. Nau & McCardle (’90), Nau (’92), Hart & Schmeidler (’89), Myerson (’97) Duality & Bayesian Persuasion . Kolotilin (’18), Dworczak & Martini (’19), Dizdar & Kovac (’19), Dworczak & Kolotilin (’19) Markets for Information. Bergemann & Bonatti (’19) Bergemann & Bonatti (’15), Bergmann, Bonatti, Smolin (’18) Information Privacy. Ali, Lewis, and Vasserman (’20), Bergemann, Bonatti, and Gan (’20), Acemoglu, Makhdoumi, Malekian, and Ozdaglar, (’20), Acquisti, Taylor, Wagman (’16)

illustrative example

A Monopolist’s Problem (Bergemann et al. ’15) example Monopolist sells to potential buyers (assume MC=0) Monopolist does not directly observe buyers’ valuation A dataset contains data about the potential buyers: information she receives ▶ A share µ > 1 2 of the entries has valuation ω = 2 ▶ A share 1 − µ of the entries has valuation ω = 1 A data intermediary owns the dataset; can use it without buyers’ consent Monopolist sets price a and can discriminate depending on the

A Monopolist’s Problem (Bergemann et al. ’15) Suppose monopolist receives this information about the potential buyer example Monopolist would set s ′ s ′′ ω = 1 1 0 1 − µ 1 − 1 − µ ω = 2 µ µ { for “segment” s ′ 1 a ( s ) = 2 for “segment” s ′′ The total consumer surplus is V ∗ = 1 − µ and for each buyer ω { 0 if ω = 1 v ∗ ( ω ) = 1 − µ if ω = 2 µ

A Monopolist’s Problem example Our Questions: We do not interpret as monetary incentive to give up data Important, yet distinct issue ▶ What price p ( ω ) would/should the data intermediary be willing to pay to add one more buyer with valuation ω to her dataset? ▶ What price p ( ω ) would “properly” compensate buyer ω for role that her data plays to achieve V ∗ ? Broadly refer to these questions as the data-pricing problem

A Monopolist’s Problem example Our Questions: ▶ What price p ( ω ) would/should the data intermediary be willing to pay to add one more buyer with valuation ω to her dataset? ▶ What price p ( ω ) would “properly” compensate buyer ω for role that her data plays to achieve V ∗ ? Broadly refer to these questions as the data-pricing problem We do not interpret p ( ω ) as monetary incentive to give up data ▶ Important, yet distinct issue

Data Entries and Dataset model Finite set of players I = { 1 , . . . , n } Finite set of payoff states Ω 0 Finite set of private types Ω I = Ω 1 × . . . × Ω n , players’ own data Common prior belief µ ∈ ∆(Ω) , where Ω = Ω 0 × Ω I We refer to (Ω , µ ) as a dataset and to each ω as a data entry

Base Game and Information model Each player i has finite set of actions A i . Let A = A 1 × . . . × A n Utility function u i : A × Ω 0 → R ( ) Base game G = I, (Ω , µ ) , ( A i , u i ) i ∈ I An information structure is π : Ω → ∆( S 1 × . . . × S n ) , with S i finite ∀ i BNE ( G, π ) set of Bayes-Nash equilibria for ( G, π )

Designer as a Data Intermediary model We consider two cases: without players’ consent (akin to no privacy protection) their consent (akin to privacy protection) We begin by analyzing the data-pricing problem under omniscient design Designer provides information via π to players Objective is v : A × Ω 0 → R 1. Omniscient design. Designer already owns dataset and can use it 2. Design w/ Elicitation . Designer has to obtain players’ data and needs

data-pricing problem

The Notion of A Price data-pricing problem The data-pricing problem consists in finding a function output is information : p : Ω → R s.t. p ( ω ) reflects the “proper” value that ω generates for the designer p should depend on how data entries are used to produce information We think of data entries ω ’s as inputs into a production problem whose π : Ω → ∆( S ) Data-pricing problem ⇐ ⇒ Data-use problem

How Is Data Used? Build on the information-design literature: a given objective Question data-pricing problem ▶ How to optimally use data to produce information so as to maximize For each π , define ( ∏ ) ∑ V ( π ) = max v ( a, ω 0 ) σ ( a i | ω i , s i ) π ( s | ω ) µ ( ω ) σ ∈ BNE ( G,π ) ω,s,a i ∈ I The information-design problem consists of V ⋆ = max V ( π ) π ▶ What is the proper share of V ∗ to attribute to ω ? → p ( ω )

Direct Value of Data data-pricing problem may play a role in the payoff that is Not quite! it fails to capture that ? that is attributable to capture the share of Does generated by another Clearly, One possible approach to answer this question: 1. Find solution of ID problem π ∗ and σ ∗ 2. Compute direct value of ω . This is the expected payoff from ω ∑ v ∗ ( ω ) = v ( a, ω 0 ) σ ∗ ( a | s, ω I ) π ∗ ( s | ω ) s ∑ µ ( ω ) v ∗ ( ω ) = V ∗ ω

The Price of Data Simone Galperti Aleksandr Levkun Jacopo Perego - PowerPoint PPT Presentation

The Price of Data Simone Galperti Aleksandr Levkun Jacopo Perego UC San Diego UC San Diego Columbia University August 2020 Overview introduction Data has become an essential commodity in modern economies A few markets for data have

Greg Kirk Director Planfarm Pty Ltd Poor Price Good Price Poor Price Good Price Poor Season

Haskell Literacy in Six Slides Greg Price ( price ) 2008 Jan 29 Greg Price ( price ) () Haskell

Reasoning in Haskell Greg Price ( price ) 2008 Jan 31 Greg Price ( price ) () Reasoning in

COST OR PRICE COST OR PRICE REASONABLENESS REASONABLENESS (CPR) (CPR) UH APM A8.285 RCUH

Annual House Price Changes (New & Resale) 2014 Price Growth (Actual), 2015 Forecasts [New]

by Utah State University Lease Price: $6,399 for 400 hours Lease Price: $5,385 for 400 hours

Asymmetries in Commodity Price Asymmetries in Commodity Price Behaviour Asymmetries in Commodity

TSX: CSU CN Long Investment Thesis Current Price: $469.68 CAD Jan 2017 Price Target: $700 CAD

Putting a socially responsible price on carbon Putting a socially responsible price on carbon

Vickrey-Clark-Groves auctions First price vs second price Recap Last week: Introduction

* Price: Price * Enumeration getZones() during requirements analysis to model Price

Haskell: Compiler as Theorem-Prover Greg Price ( price ) 2007 Nov 19 code samples:

An agent-based model of price flexing by chain-store retailers Ondej Krl, FEA MU, MUES

Fiat Chrysler Automobiles NV NYSE: FCAU Price as of 10/06/15: $14.25 Target Price: $19.77

Price Transparency Program Agenda 1. Overview of Oregons Prescription Drug Price program 2.

Oil Price Shocks and Inflation (Bharat Trehan, FRBSF Economic Letter, 2005) 1 When oil price

Systematic uncertainties in long baseline experiments Tokai to Kamioka (T2K) experiment 295 km

Type Inference, Principal Typings, and Let-Polymorphism for First-Class Mixin Modules Henning

Experimental status of neutrino scattering S.Bolognesi (T2K, CEA Saclay) 2/17 A hot topic...

SSVI a la Bergomi Stefano De Marco 1 , Claude Martini 2 1 Ecole Polytechnique 2 Zeliade Systems

T URING MACHINES ( PROS ) easy to imagine easy to explain de-facto the standard model

I m p a c t o f c r o s s s e c t i o n u n c e r t a i n t i e s

Jon Pollock Director of Corporate Sales, BT. Todays agenda. Paneldebate

MPLS based Virtual Private Networks Sources: V. Alwayn, Advanced MPLS Design and Implementation ,

Sambuz

Useful Links

Newsletter

Mail Us

The Price of Data Simone Galperti Aleksandr Levkun Jacopo Perego - PowerPoint PPT Presentation

The Price of Data Simone Galperti Aleksandr Levkun Jacopo Perego UC San Diego UC San Diego Columbia University August 2020 Overview introduction Data has become an essential commodity in modern economies A few markets for data have

Greg Kirk Director Planfarm Pty Ltd Poor Price Good Price Poor Price Good Price Poor Season

Haskell Literacy in Six Slides Greg Price ( price ) 2008 Jan 29 Greg Price ( price ) () Haskell

Reasoning in Haskell Greg Price ( price ) 2008 Jan 31 Greg Price ( price ) () Reasoning in

COST OR PRICE COST OR PRICE REASONABLENESS REASONABLENESS (CPR) (CPR) UH APM A8.285 RCUH

Annual House Price Changes (New &amp; Resale) 2014 Price Growth (Actual), 2015 Forecasts [New]

by Utah State University Lease Price: $6,399 for 400 hours Lease Price: $5,385 for 400 hours

Asymmetries in Commodity Price Asymmetries in Commodity Price Behaviour Asymmetries in Commodity

TSX: CSU CN Long Investment Thesis Current Price: $469.68 CAD Jan 2017 Price Target: $700 CAD

Putting a socially responsible price on carbon Putting a socially responsible price on carbon

Vickrey-Clark-Groves auctions First price vs second price Recap Last week: Introduction

* Price: Price * Enumeration getZones() during requirements analysis to model Price

Haskell: Compiler as Theorem-Prover Greg Price ( price ) 2007 Nov 19 code samples:

An agent-based model of price flexing by chain-store retailers Ondej Krl, FEA MU, MUES

Fiat Chrysler Automobiles NV NYSE: FCAU Price as of 10/06/15: $14.25 Target Price: $19.77

Price Transparency Program Agenda 1. Overview of Oregons Prescription Drug Price program 2.

Oil Price Shocks and Inflation (Bharat Trehan, FRBSF Economic Letter, 2005) 1 When oil price

Systematic uncertainties in long baseline experiments Tokai to Kamioka (T2K) experiment 295 km

Type Inference, Principal Typings, and Let-Polymorphism for First-Class Mixin Modules Henning

Experimental status of neutrino scattering S.Bolognesi (T2K, CEA Saclay) 2/17 A hot topic...

SSVI a la Bergomi Stefano De Marco 1 , Claude Martini 2 1 Ecole Polytechnique 2 Zeliade Systems

T URING MACHINES ( PROS ) easy to imagine easy to explain de-facto the standard model

I m p a c t o f c r o s s s e c t i o n u n c e r t a i n t i e s

Jon Pollock Director of Corporate Sales, BT. Todays agenda. Paneldebate

MPLS based Virtual Private Networks Sources: V. Alwayn, Advanced MPLS Design and Implementation ,

Sambuz

Useful Links

Newsletter

Mail Us

Annual House Price Changes (New & Resale) 2014 Price Growth (Actual), 2015 Forecasts [New]