Optimal ranking of online search requests for long-term revenue - PowerPoint PPT Presentation

1 Optimal ranking of online search requests for long-term revenue maximization Pierre L’Ecuyer Patrick Maill´ e, Nicol´ as Stier-Moses, Bruno Tuffin Technion, Israel, June 2016

2 Search engines ◮ Major role in the Internet economy ◮ most popular way to reach web pages ◮ 20 billion requests per month from US home and work computers only

2 Search engines ◮ Major role in the Internet economy ◮ most popular way to reach web pages ◮ 20 billion requests per month from US home and work computers only For a given (set of) keyword(s), a search engine returns a ranked list of links: the organic results. Organic results are supposed to be based on relevance only Is this true? Each engine has its own formula to measure (or estimate) relevance. May depend on user (IP address), location, etc.

4 How are items ranked? Relevance vs expected revenue?

9 barack obama basketball video - Google Search Google+ Search Images Maps Play YouTube News Gmail More Sign in Web Images Videos News Maps Books About 11,000,000 results Any country Country: Canada Any time Past hour Past 24 hours Past week Past month Past year All results ► 1:31 Verbatim Barack Obama playing Basketball Game. AMAZING FOOTAGE ... https://www.youtube.com/watch?v=0OIDdGQQ0L8 Images for barack obama basketball video Barack Obama's basketball fail - YouTube https://www.youtube.com/watch?v=gmTfKPx1Cug 1 Apr 2013 - 1 min - Uploaded by The Telegraph https://www.google.ca/search?q=barack+obama+basketball+video&ie=utf-8&oe=utf-8&gws_rd=cr&ei=jBpBVqaLL8jGesPTjYgL[2015-11-09 17:11:24]

11 Do search engines return biased results? Comparison between Google, Bing, and Blekko (Wright, 2012): ◮ Microsoft content is 26 times more likely to be displayed on the first page of Bing than on any of the two other search engines ◮ Google content appears 17 times more often on the first page of a Google search than on the other search engines Search engines do favor their own content

12 Do search engines return biased results? 100 99 . 2 98 . 4 97 . 9 98 97 . 5 Percentage 96 95 . 3 95 . 1 94 . 4 94 93 . 4 Top 1 Top 3 Top 5 First page Google Microsoft (Bing) Percentage of Google or Bing search results with own content not ranked similarly by any rival search engine (Wright, 2012).

13 Search Neutrality (relevance only) Some say search engines should be considered as a public utility. Idea of search neutrality: All content having equivalent relevance should have the same chance of being displayed. Content of higher relevance should never be displayed in worst position. More fair, better for users and for economy, encourages quality, etc. What is the precise definition of “relevance”? Not addressed here ... Debate: Should neutrality be imposed by law? Pros and cons. Regulatory intervention: The European Commission, is progressing toward an antitrust settlement deal with Google. “Google must be even-handed. It must hold all services, including its own, to exactly the same standards, using exactly the same crawling, indexing, ranking, display, and penalty algorithms.”

14 In general: trade-off in the rankings From the viewpoint of the SE: Tradeoff between ◮ relevance (long term profit) versus ◮ expected revenue (short term profit) Better relevance brings more customers in the long term because it builds reputation. What if the provider wants to optimize its long-term profit?

15 Simple model of search requests Request: random vector Y = ( M , R 1 , G 1 , . . . , R M , G M ) where M = number of pages (or items) that match the request, M ≤ m 0 ; R i ∈ [0 , 1]: measure of relevance of item i ; G i ∈ [0 , K ]: expected revenue (direct or indirect) from item i . has a prob. distribution over Ω ⊆ N × ([0 , 1] × [0 , K ]) m 0 . Can be discrete or continuous. y = ( m , r 1 , g 1 , . . . , r m , g m ) denotes a realization of Y . c i , j ( y ) = P [click page i if in position j ] = click-through rate (CTR). Assumed ր in r i and ց in j . Example: c i , j ( y ) = θ j ψ ( r i )

15 Simple model of search requests Request: random vector Y = ( M , R 1 , G 1 , . . . , R M , G M ) where M = number of pages (or items) that match the request, M ≤ m 0 ; R i ∈ [0 , 1]: measure of relevance of item i ; G i ∈ [0 , K ]: expected revenue (direct or indirect) from item i . has a prob. distribution over Ω ⊆ N × ([0 , 1] × [0 , K ]) m 0 . Can be discrete or continuous. y = ( m , r 1 , g 1 , . . . , r m , g m ) denotes a realization of Y . c i , j ( y ) = P [click page i if in position j ] = click-through rate (CTR). Assumed ր in r i and ց in j . Example: c i , j ( y ) = θ j ψ ( r i ) Decision (ranking) for any request y : Permutation π = ( π (1) , . . . , π ( m )) of the m matching pages. j = π ( i ) = position of i . Local relevance and local revenue for y and π : m m � � r ( π, y ) = c i ,π ( i ) ( y ) r i , g ( π, y ) = c i ,π ( i ) ( y ) g i . i =1 i =1

16 Deterministic stationary ranking policy µ It assigns a permutation π = µ ( y ) ∈ Π m to each y ∈ Ω. Long-term expected relevance per request (reputation of the provider) and expected revenue per request (from the organic links), for given µ : = r ( µ ) = E Y [ r ( µ ( Y ) , Y )] , r g = g ( µ ) = E Y [ g ( µ ( Y ) , Y )] .

16 Deterministic stationary ranking policy µ It assigns a permutation π = µ ( y ) ∈ Π m to each y ∈ Ω. Long-term expected relevance per request (reputation of the provider) and expected revenue per request (from the organic links), for given µ : = r ( µ ) = E Y [ r ( µ ( Y ) , Y )] , r g = g ( µ ) = E Y [ g ( µ ( Y ) , Y )] . Objective: Maximize long-term utility function ϕ ( r , g ). Assumption: ϕ is strictly increasing in both r and g . Example : expected revenue per unit of time ϕ ( r , g ) = λ ( r )( β + g ) , where λ ( r ) = arrival rate of requests, strictly increasing in r ; β = E [revenue per request] from non-organic links (ads on root page); g = E [revenue per request] from organic links.

16 Deterministic stationary ranking policy µ It assigns a permutation π = µ ( y ) ∈ Π m to each y ∈ Ω. Q: Is this the most general type of policy? Long-term expected relevance per request (reputation of the provider) and expected revenue per request (from the organic links), for given µ : = r ( µ ) = E Y [ r ( µ ( Y ) , Y )] , r g = g ( µ ) = E Y [ g ( µ ( Y ) , Y )] . Objective: Maximize long-term utility function ϕ ( r , g ). Assumption: ϕ is strictly increasing in both r and g . Example : expected revenue per unit of time ϕ ( r , g ) = λ ( r )( β + g ) , where λ ( r ) = arrival rate of requests, strictly increasing in r ; β = E [revenue per request] from non-organic links (ads on root page); g = E [revenue per request] from organic links.

17 Randomized stationary ranking policy ˜ µ µ ( y ) = { q ( π, y ) : π ∈ Π m } ˜ is a probability distribution, for each y = ( m , r 1 , g 1 , . . . , r m , g m ) ∈ Ω. Expected relevance �� M � � r = r (˜ µ ) = E Y q ( π, Y ) c i ,π ( i ) ( Y ) R i i =1 π Expected revenue �� M � � g = g (˜ µ ) = E Y q ( π, Y ) c i ,π ( i ) ( Y ) G i i =1 π

17 Randomized stationary ranking policy ˜ µ µ ( y ) = { q ( π, y ) : π ∈ Π m } ˜ is a probability distribution, for each y = ( m , r 1 , g 1 , . . . , r m , g m ) ∈ Ω. Let z i , j ( y ) = P [ π ( i ) = j ] under ˜ µ . Expected relevance   �� M � M M � � � r = r (˜ µ ) = E Y q ( π, Y ) c i ,π ( i ) ( Y ) R i = E Y z i , j ( Y ) c i , j ( Y ) R i   i =1 i =1 j =1 π Expected revenue   �� M � M M � � �  . g = g (˜ µ ) = E Y q ( π, Y ) c i ,π ( i ) ( Y ) G i = E Y z i , j ( Y ) c i , j ( Y ) G i  i =1 i =1 j =1 π In terms of ( r , g ), we can redefine (simpler) ˜ µ ( y ) = Z ( y ) = { z i , j ( y ) ≥ 0 : 1 ≤ i , j ≤ m } (doubly stochastic matrix).

18 Q: Here we have a stochastic dynamic programming problem, but the rewards are not additive! Usual DP techniques do not apply. How can we compute an optimal policy? Seems very hard in general!

19 Optimization problem max ϕ ( r , g ) = λ ( r )( β + g ) µ ∈ ˜ ˜ U subject to   M M � � r = E Y z i , j ( Y ) c i , j ( Y ) R i   i =1 j =1   M M � � = z i , j ( Y ) c i , j ( Y ) G i g E Y   i =1 j =1 µ ( y ) ˜ = Z ( y ) = { z i , j ( y ) : 1 ≤ i , j ≤ m } for all y ∈ Ω .

19 Optimization problem max ϕ ( r , g ) = λ ( r )( β + g ) µ ∈ ˜ ˜ U subject to   M M � � r = E Y z i , j ( Y ) c i , j ( Y ) R i   i =1 j =1   M M � � = z i , j ( Y ) c i , j ( Y ) G i g E Y   i =1 j =1 µ ( y ) ˜ = Z ( y ) = { z i , j ( y ) : 1 ≤ i , j ≤ m } for all y ∈ Ω . To each ˜ µ corresponds ( r , g ) = ( r (˜ µ ) , g (˜ µ )). µ ∈ ˜ Proposition: The set C = { ( r (˜ µ ) , g (˜ µ )) : ˜ U} is convex. Optimal value: ϕ ∗ = max ( r , g ) ∈C ϕ ( r , g ) = ϕ ( r ∗ , g ∗ ) (optimal pair). Idea: find ( r ∗ , g ∗ ) and recover an optimal policy from it.

20 level curves of ϕ ( r , g ) g ( r ∗ , g ∗ ) • C r

21 ∇ ϕ ( r ∗ , g ∗ ) ′ ( r − r ∗ , g − g ∗ ) = 0 level curves of ϕ ( r , g ) g ( r ∗ , g ∗ ) • C r

Optimal ranking of online search requests for long-term revenue - PowerPoint PPT Presentation

1 Optimal ranking of online search requests for long-term revenue maximization Pierre LEcuyer Patrick Maill e, Nicol as Stier-Moses, Bruno Tuffin Technion, Israel, June 2016 2 Search engines Major role in the Internet economy

Online Submodular Set Cover, Ranking, and Repeated Active Learning Online Ranking: At each round,

Easy and Hard Outline Constraint Ranking in OT The Constraint Ranking problem Making fast

Tutorial: TF-Ranking for sparse features Tutorial: TF-Ranking for sparse features This tutorial

The short- -term and long term and long- -term term The short stratospheric and tropospheric

Ranking candidate genes from Ranking candidate genes from perturbation experiments Niko

TVM for Ads Ranking @ Facebook Hao Lu, Ansha Yu, Yinghai Lu, Andrew Tulloch Ads Ranking at

South Burlington School District Proposed Long-Term Bond Why issue a long term bond? Entities

Search Engines Issues Avi Rappoport Search Tools Consulting Search Issues Enterprise Search

Online Ranking Combination Erzs ebet Frig o Institute for Computer Science and Control (MTA

Spectral Method and Regularized MLE Are Both Optimal for Top- K Ranking Cong Ma ORFE, Princeton

CAPITAL OUTLAY PROJECT REQUESTS FY 2018 - 2019 M ID F EBRUARY 2018: C APITAL O UTLAY REQUESTS ARE

Tabu Search Search Tabu Page 1 Part I Part I Tabu Search Principles Search Principles Tabu

Uninformed Search 2 Informed Search Rest of blind search An informed search strategyone

Informed search algorithms Outline Best-first search Greedy best-first search A *

Foundations of Artificial Intelligence 9. State-Space Search: Tree Search and Graph Search Malte

Statistical Ranking Problem Tong Zhang Statistics Department, Rutgers University Ranking

Machine Learning 11 AI Slides (6e) c Lin Zuoquan@PKU 1998-2020 11 1 11 Machine Learning

Policy tradeoffs under risk of abrupt climate change by Y. Tsur and A. Zemel Comments by

The algebra of integrated partial belief systems Manuele Leonelli 1 , Eva Riccomagno 2 , James Q.

Dark sectors and missing energy searches at the LHC Tongyan Lin UC Berkeley / LBL October 9,

DATA ANALYTICS USING DEEP LEARNING GT 8803 // FALL 2018 // JOY ARULRAJ L E C T U R E # 0 2 : A

Konstantin Turitsyn T-4 & CNLS, Los Alamos National Laboratory Landau Institute for

AST 1420 Galactic Structure and Dynamics Today: dynamics of stars in galactic disks Last

Adventures of a Long-Range Walker Thierry DAUXOIS CNRS & ENS Lyon 1 Stefano Ruffo

Sambuz

Useful Links

Newsletter

Mail Us

Optimal ranking of online search requests for long-term revenue - PowerPoint PPT Presentation

1 Optimal ranking of online search requests for long-term revenue maximization Pierre LEcuyer Patrick Maill e, Nicol as Stier-Moses, Bruno Tuffin Technion, Israel, June 2016 2 Search engines Major role in the Internet economy

Online Submodular Set Cover, Ranking, and Repeated Active Learning Online Ranking: At each round,

Easy and Hard Outline Constraint Ranking in OT The Constraint Ranking problem Making fast

Tutorial: TF-Ranking for sparse features Tutorial: TF-Ranking for sparse features This tutorial

The short- -term and long term and long- -term term The short stratospheric and tropospheric

Ranking candidate genes from Ranking candidate genes from perturbation experiments Niko

TVM for Ads Ranking @ Facebook Hao Lu, Ansha Yu, Yinghai Lu, Andrew Tulloch Ads Ranking at

South Burlington School District Proposed Long-Term Bond Why issue a long term bond? Entities

Search Engines Issues Avi Rappoport Search Tools Consulting Search Issues Enterprise Search

Online Ranking Combination Erzs ebet Frig o Institute for Computer Science and Control (MTA

Spectral Method and Regularized MLE Are Both Optimal for Top- K Ranking Cong Ma ORFE, Princeton

CAPITAL OUTLAY PROJECT REQUESTS FY 2018 - 2019 M ID F EBRUARY 2018: C APITAL O UTLAY REQUESTS ARE

Tabu Search Search Tabu Page 1 Part I Part I Tabu Search Principles Search Principles Tabu

Uninformed Search 2 Informed Search Rest of blind search An informed search strategyone

Informed search algorithms Outline Best-first search Greedy best-first search A *

Foundations of Artificial Intelligence 9. State-Space Search: Tree Search and Graph Search Malte

Statistical Ranking Problem Tong Zhang Statistics Department, Rutgers University Ranking

Machine Learning 11 AI Slides (6e) c Lin Zuoquan@PKU 1998-2020 11 1 11 Machine Learning

Policy tradeoffs under risk of abrupt climate change by Y. Tsur and A. Zemel Comments by

The algebra of integrated partial belief systems Manuele Leonelli 1 , Eva Riccomagno 2 , James Q.

Dark sectors and missing energy searches at the LHC Tongyan Lin UC Berkeley / LBL October 9,

DATA ANALYTICS USING DEEP LEARNING GT 8803 // FALL 2018 // JOY ARULRAJ L E C T U R E # 0 2 : A

Konstantin Turitsyn T-4 &amp; CNLS, Los Alamos National Laboratory Landau Institute for

AST 1420 Galactic Structure and Dynamics Today: dynamics of stars in galactic disks Last

Adventures of a Long-Range Walker Thierry DAUXOIS CNRS &amp; ENS Lyon 1 Stefano Ruffo

Sambuz

Useful Links

Newsletter

Mail Us

Konstantin Turitsyn T-4 & CNLS, Los Alamos National Laboratory Landau Institute for

Adventures of a Long-Range Walker Thierry DAUXOIS CNRS & ENS Lyon 1 Stefano Ruffo