Query Incentive Networks Jon Kleinberg Prabhakar Raghavan Cornell - PowerPoint PPT Presentation

Query Incentive Networks Jon Kleinberg Prabhakar Raghavan Cornell University Yahoo! Research FOCS ’05, pages 132-141 Presented by Jian XIA for COMP670O: Game Theoretic Applications in CS Spring 2006, HKUST May 8, 2006 1 / 20

Incentive Query Propagation Root (seeking information or services) issues queries, together with a reward. Each node is offered a reward by its parent, and offers a (smaller) reward to its children for continued propagation. The propagation down each branch stops when the reward reaches 0 , or when a node with an answer is reached. The reward is only paid if on the chosen path to the answer. answer initial utility 11 v ∗ reward 9 reward 5 reward 2 “skim off”: 11-9=2 9-5=4 5-2=3 2-0=2 Motivations: Peer-to-Peer networks, social-networking systems 2 / 20

Basic Questions How much reward is needed in order for the root to achieve a reasonable probability of obtaining an answer? What is the effect of network topology? What is the strategic behavior of each node? — How much can it skim off and yet appear on the chosen path to the answer? 3 / 20

Game Formulation For each node (player) v , (integer-valued) utility: r > 0 (reward form its parent) (integer-valued) strategy function: f v ( r ) < r (reward to its children) v reward f v ( r ) reward r “skim off”: r − f v ( r ) Each node holds the answer with probability 1 − p > 0 1 — answer rarity: n = 1 − p — one of n nodes holds the answer on average 4 / 20

First Attack: Line Model An infinite line L starting from the root v ∗ initial utility r ∗ v ∗ reward f v ∗ ( r ∗ ) Auxiliary functions: α v ( f , x ) : the probability the nodes after v yield the answer, given that v offers reward x and that v does not possess the answer, where f = { f v : v ∈ L } β v ( f , x ) = 1 − α v ( f , x ) β v ( f , x ) = p · β w ( f , f w ( x )) , for any v and its child w 5 / 20

Existence of the Nash Equilibrium Definition of Nash equilibrium g — all functions g v are the same base: g v (1) = 0 for all nodes v induction: assume that g v ( x ) has been defined for all v and all x < r , then g v ( r ) = arg max ( r − x ) α v ( g , x ) x Theorem The set of functions g is a Nash equilibrium. 6 / 20

Existence of the Nash Equilibrium Idea: ( r − x ) α v ( g , x ) is proportional to the portion of the payoff over which v “has control”. Proof: Given the initial utility r ∗ at the root, and a choice of functions f at each node, consider the following four events: C : the query reaches v B : an answer is found in the sub-line starting from v (including v ) A : the reward is propagated down to v D : v holds the answer Define a random variable Y f ,r ∗ denoting the payoff to v , given f and r ∗ , then it can be shown that E [ Y f ,r ∗ ] = E [ Y f ,r ∗ | A ∩ B ∩ C ∩ D ] · Pr [ A ∩ B ∩ C ∩ D ] + E [ Y f ,r ∗ | A ∩ B ∩ C ∩ D ] · Pr [ A ∩ B ∩ C ∩ D ] = r · Pr [ A ∩ B ∩ C ∩ D ] + ( r − f v ( r )) · Pr [ A | B ∩ C ∩ D ] · α v ( f , f v ( r )) · Pr [ C ∩ D ] 7 / 20

Breakpoint Structure of Rewards hop function: h ( r ) , the number of hops the initial reward r can be pushed further — the number of times we have to iterate the function g to reduce an initial reward of r down to 0 — if the first node holding the answer is within h ( r ) hops from the root, everyone from the root to this node gets paid (else, no one gets paid) ˆ φ j : the probability that no node in the first j hops has the answer, given that the root does not — β v ∗ ( g , r ) = ˆ φ h ( r ) — ˆ φ j = p · ˆ φ j − 1 = p j — g v ( r ) = arg max x ( r − x )(1 − p h ( x ) ) breakpoint: u j , the minimum initial reward r to push j hops — h ( u j ) = j 8 / 20

Growth rate of the breakpoints ∆ j = u j − u j − 1 Since g v ( r ) = arg max x ( r − x )(1 − p h ( x ) ) , we have, ∆ j (1 − p j − 1 ) ≥ (∆ j + 1)(1 − p j − 2 ) therefore, 1 1 n n − 1) j − 2 − 1) ∆ j ≥ 1 − p ( p j − 2 − 1) = n (( For rarity n , the distance from the root to the nearest answer is O ( n ) with high probability in the line model. It follows, Theorem In line model, the utility required for v ∗ to find the answer with constant probability is exponential in n . 9 / 20

Second Attack: Branching Process Model infinite complete d -ary tree T rooted at node r ∗ each node in T is active independently with probability q random subtree T ′ of T : the set of all nodes reachable from the root using paths consisting entirely of active nodes — T ′ is a Galton-Watson tree generated from an offspring distribution that produces j children with probability q j (1 − q ) d − j (binomial distribution) � d � j — expected number of offspring per node (branching factor): b = qd b < 1 : T ′ is almost surely finite b > 1 : there is a positive probability of obtaining an infinite tree T ′ Structural lower bound: For an infinite random tree T ′ , the distance in T ′ to the nearest answer is O (log n ) with high probability. 10 / 20

Main Results (Sufficient) competition makes incentive networks efficient. When b < 2 , the utility required for v ∗ to find the answer with constant probability σ is Ω( n c ) . — exponential in the path length from the root to the answer — knowing fewer than 2 people is expensive When b > 2 , the utility required for v ∗ to find the answer with constant probability σ is O (log n ) . — linear in the path length from the root to the answer 11 / 20

Existence of the Nash Equilibrium Auxiliary functions: α v ( f , x ) : the probability the subtree of T ′ below v yields the answer, given that v offers reward x and that v does not possess the answer, where f = { f v : v ∈ T } β v ( f , x ) = 1 − α v ( f , x ) β v ( f , x ) = � w child of v [1 − q (1 − pβ w ( f , f w ( x )))] Definition of Nash equilibrium g — all functions g v are the same base: g v (1) = 0 for all nodes v induction: assume that g v ( x ) has been defined for all v and all x < r , then g v ( r ) = arg max x ( r − x − 1) α v ( g , x ) — For technical reason — place unit cost on the effort of establishing the “connection” along the path from the root to the chosen node Theorem The set of functions g is a Nash equilibrium. 12 / 20

Breakpoint Structure of Rewards hop function h ( r ) — the number of times we have to iterate the function g to reduce an initial reward of r down to 0 ˆ φ j : the probability that no node in the first j hops has the answer, given that the root does not — β v ∗ ( g , r ) = ˆ φ h ( r ) — ˆ φ j +1 = (1 − q (1 − p ˆ φ j )) d — g v ( r ) = arg max x ( r − x − 1)(1 − ˆ φ h ( x ) ) breakpoint: u j , the minimum initial reward r to push j hops — h ( u j ) = j 13 / 20

Breakpoint Structure of Rewards Inductive definition of u j base: u 1 = 1 , u 2 = 2 (Since g (1) = 0 , g (2) = 1 ) induction: assume that u 1 , . . . , u j has been defined, — for a given utility r , define the following linear functions of r : ℓ i ( r ) = ( r − u i − 1)(1 − ˆ φ i ) , the payoff to the root when it offers reward u i with utility r — for all r ≥ u j − 1 , ℓ j − 1 ( r ) > ℓ j − 2 ( r ) > · · · > ℓ 1 ( r ) — at r = 1 + u j , we have ℓ j ( r ) = 0 , and hence ℓ j ( r ) < ℓ j − 1 ( r ) — u j +1 = ⌈ y j +1 ⌉ , where y j +1 is the reward value of the cross point of ℓ j and ℓ j − 1 , the value y for which ( y − u j − 1)(1 − ˆ φ j ) = ( r − u j − 1 − 1)(1 − ˆ φ j − 1 ) Define ∆ ′ j = y j − u j − 1 and ∆ j = u j − u j − 1 , we have 1 − ˆ ∆ j φ j 1 + j +1 − 1 = 1 − ˆ ∆ ′ φ j − 1 14 / 20

Growth rate of the breakpoints R σ ( n, b ) : the minimum utility needed by v ∗ in order for the query process to yield an answer with probability at least σ — given an answer rarity n = (1 − p ) − 1 , a success probability σ , and a branching factor b = qd — Which is the asymptotic dependence of R σ ( n, b ) on n and b ? — assumption: σ ≫ n − 1 Useful lemmas for function t ( x ) = (1 − q (1 − px )) d 1 Claim: Fix ε such that dn < ε < 1 . If x ∈ [1 − ε, 1] , then t ′ ( x ) ∈ [ pb (1 − 2 bdε ) , pb ] . Claim: 1 − b n ≤ t (1) ≤ 1 − 1 dn . Claim: Suppose p, b and ε are such that pb (1 − 2 bdε ) > 1 , and let 0 < γ 0 < γ 1 ≤ ε . Let N ( γ 0 , γ 1 ) denote the number of iterations of the function t needed to reduce 1 − γ 0 to a quantity that is ≤ 1 − γ 1 . Then N ( γ 0 , γ 1 ) = Θ(log( γ 1 /γ 0 )) . 15 / 20

The case when b < 2 For a fixed b < 2 , consider the sequence of ˆ φ j values up to the point at which it drops below 1 − σ 0 , for a small constant σ 0 < σ — pb (1 − 2 bdε ) > 1 : σ 0 < σ small enough, and n large enough first segment: the set I 1 of indices j for which ˆ φ j ≥ 1 − κ 0 /n for a constant κ 0 second segment: the set I 2 of indices j for which 1 − κ 0 /n > ˆ φ j ≥ 1 − σ 0 Lemma: The first segment has length O (1) and the second segment has length Θ(log n ) . Lemma: There is a constant b 1 < 2 such that for all j in the second segment we have 1 − ˆ φ j +1 ≤ b 1 . 1 − ˆ φ j 16 / 20

Query Incentive Networks Jon Kleinberg Prabhakar Raghavan Cornell - PowerPoint PPT Presentation

Query Incentive Networks Jon Kleinberg Prabhakar Raghavan Cornell University Yahoo! Research FOCS 05, pages 132-141 Presented by Jian XIA for COMP670O: Game Theoretic Applications in CS Spring 2006, HKUST May 8, 2006 1 / 20 Incentive

Improve Query Performance with the Query Log Analyzer Kees Vegter Field Engineer Query Log

Query Execution 2 and Query Optimization Instructor: Matei Zaharia cs245.stanford.edu Query

On Threshold Behavior in Query Incentive Kirsch Kumar Liben-Nowell Networks Vassilvitskii

Query Processing Relevance feedback; query expansion; Web Search 1 Overview Indexes Query

Query Understanding: A Manifesto Daniel Tunkelang queryunderstanding.com Overview What is

Perfect Query FORMULA 5 critical sections in every successful query letter (c) 2019

Query Op)miza)on 1 Query op)miza)on Given an SQL query,

CS4224/CS5424 Lecture 9 Distributed Query Processing Query Processing Translates query into a

A Generic Mapping-based Query Translation A Generic Mapping-based Query Translation from SPARQL

Information Retrieval > Query Us User er Query Words Query Words Search Personalization

Module 13: Optimizing Query Performance Overview Introduction to the Query Optimizer

Chapter 3: Top-k Query Processing and Indexing 3.1 Top-k Algorithms 3.2 Approximate Top-k Query

CAS CS 460/660 Introduction to Database Systems Query Evaluation II 1.1 Cost-based Query

Query Execuon Declarave Query (SQL) We start from

Database Systems II Query Compiler CMPT 454, Simon Fraser University, Fall 2009, Martin Ester

Parallel Query Execution in POLARDB for MySQL ystein Grvlen Benny Wang Alibaba Cloud Agenda

Chapter 11 Randomized Algorithms III Min Cut CS 573: Algorithms, Fall 2013 October 1, 2013

The Metric Coalescent joint with David Aldous Daniel Lanoue University of California, Berkeley

Phase Transitions in Random Discrete Structures Mihyun Kang Institute of Optimization and

Cutting down trees with a Markov chainsaw L. AddarioBerry 1 N. Broutin 2 C. Holmgren 2 1

The Erd os-R enyi Process Phase Transition Part I: The Coarse Scaling Random Graph

Generalized Additive Models David L Miller Overview What is a GAM? What is smoothing? How do

Generalized Additive Models David L Miller Overview What is a GAM? What is smoothing? How do

Academic AI vs. Game AI Based on the book "Programming Game AI by Example Presented at

Sambuz

Useful Links

Newsletter

Mail Us

Query Incentive Networks Jon Kleinberg Prabhakar Raghavan Cornell - PowerPoint PPT Presentation

Query Incentive Networks Jon Kleinberg Prabhakar Raghavan Cornell University Yahoo! Research FOCS 05, pages 132-141 Presented by Jian XIA for COMP670O: Game Theoretic Applications in CS Spring 2006, HKUST May 8, 2006 1 / 20 Incentive

Improve Query Performance with the Query Log Analyzer Kees Vegter Field Engineer Query Log

Query Execution 2 and Query Optimization Instructor: Matei Zaharia cs245.stanford.edu Query

On Threshold Behavior in Query Incentive Kirsch Kumar Liben-Nowell Networks Vassilvitskii

Query Processing Relevance feedback; query expansion; Web Search 1 Overview Indexes Query

Query Understanding: A Manifesto Daniel Tunkelang queryunderstanding.com Overview What is

Perfect Query FORMULA 5 critical sections in every successful query letter (c) 2019

Query Op)miza)on 1 Query op)miza)on Given an SQL query,

CS4224/CS5424 Lecture 9 Distributed Query Processing Query Processing Translates query into a

A Generic Mapping-based Query Translation A Generic Mapping-based Query Translation from SPARQL

Information Retrieval &gt; Query Us User er Query Words Query Words Search Personalization

Module 13: Optimizing Query Performance Overview Introduction to the Query Optimizer

Chapter 3: Top-k Query Processing and Indexing 3.1 Top-k Algorithms 3.2 Approximate Top-k Query

CAS CS 460/660 Introduction to Database Systems Query Evaluation II 1.1 Cost-based Query

Query Execu*on Declara*ve Query (SQL) We start from

Database Systems II Query Compiler CMPT 454, Simon Fraser University, Fall 2009, Martin Ester

Parallel Query Execution in POLARDB for MySQL ystein Grvlen Benny Wang Alibaba Cloud Agenda

Chapter 11 Randomized Algorithms III Min Cut CS 573: Algorithms, Fall 2013 October 1, 2013

The Metric Coalescent joint with David Aldous Daniel Lanoue University of California, Berkeley

Phase Transitions in Random Discrete Structures Mihyun Kang Institute of Optimization and

Cutting down trees with a Markov chainsaw L. AddarioBerry 1 N. Broutin 2 C. Holmgren 2 1

The Erd os-R enyi Process Phase Transition Part I: The Coarse Scaling Random Graph

Generalized Additive Models David L Miller Overview What is a GAM? What is smoothing? How do

Generalized Additive Models David L Miller Overview What is a GAM? What is smoothing? How do

Academic AI vs. Game AI Based on the book &quot;Programming Game AI by Example Presented at

Sambuz

Useful Links

Newsletter

Mail Us

Information Retrieval > Query Us User er Query Words Query Words Search Personalization

Query Execuon Declarave Query (SQL) We start from

Academic AI vs. Game AI Based on the book "Programming Game AI by Example Presented at