http://cs246.stanford.edu Web advertising We discussed how to - PowerPoint PPT Presentation

CS246: ¡Mining ¡Massive ¡Datasets ¡ Jure ¡Leskovec, ¡ Stanford ¡University ¡ http://cs246.stanford.edu ¡

� Web ¡advertising ¡ � We ¡discussed ¡how ¡to ¡ ¡ match ¡advertisers ¡to ¡ ¡ queries ¡in ¡real-‑time ¡ ¡ � But ¡we ¡did ¡not ¡discuss ¡ ¡ how ¡to ¡estimate ¡CTR ¡ � Recommendation ¡engines ¡ � We ¡discussed ¡how ¡to ¡build ¡ recommender ¡systems ¡ � But ¡we ¡did ¡not ¡discuss ¡ the ¡cold ¡start ¡problem ¡ 3/7/2013 ¡ Jure ¡Leskovec, ¡Stanford ¡CS246: ¡Mining ¡Massive ¡Datasets, ¡http://cs246.stanford.edu ¡ 2 ¡

� What ¡do ¡CTR ¡and ¡ cold ¡start ¡have ¡in ¡ ¡ common? ¡ � With ¡every ¡ad ¡we ¡show/ ¡ product ¡we ¡recommend ¡ we ¡gather ¡more ¡data ¡ about ¡the ¡ad/product ¡ ¡ � Theme: ¡Learning ¡through ¡ experimentation ¡ 3/7/2013 ¡ Jure ¡Leskovec, ¡Stanford ¡CS246: ¡Mining ¡Massive ¡Datasets, ¡http://cs246.stanford.edu ¡ 3 ¡

� �� ¡ � The ¡old ¡way: ¡Pay ¡by ¡impression ¡ � Best ¡strategy: ¡Go ¡with ¡the ¡highest ¡bidder ¡ � �� ¡ � The ¡new ¡way: ¡ Pay ¡per ¡click! ¡ � Best ¡strategy: ¡Go ¡with ¡expected ¡revenue ¡ � �� i ¡ for ¡query ¡q ? ¡ � E[revenue i,q ] ¡= ¡P(click i ¡| ¡q) ¡* ¡amount i,q ¡ ¡ Bid amount for Prob. user will click on ad i given ad i on query q that she issues query q (Known) (Unknown! Need to gather information) 3/7/2013 ¡ Jure ¡Leskovec, ¡Stanford ¡CS246: ¡Mining ¡Massive ¡Datasets, ¡http://cs246.stanford.edu ¡ 4 ¡

� Clinical ¡trials: ¡ � Investigate ¡effects ¡of ¡different ¡treatments ¡while ¡ minimizing ¡patient ¡losses ¡ � Adaptive ¡routing: ¡ � Minimize ¡delay ¡in ¡the ¡network ¡by ¡investigating ¡ different ¡routes ¡ � Asset ¡pricing: ¡ � Figure ¡out ¡product ¡prices ¡while ¡trying ¡to ¡make ¡ most ¡money ¡ 3/7/2013 ¡ Jure ¡Leskovec, ¡Stanford ¡CS246: ¡Mining ¡Massive ¡Datasets, ¡http://cs246.stanford.edu ¡ 5 ¡

3/7/2013 ¡ Jure ¡Leskovec, ¡Stanford ¡CS246: ¡Mining ¡Massive ¡Datasets, ¡http://cs246.stanford.edu ¡ 6 ¡

3/7/2013 ¡ Jure ¡Leskovec, ¡Stanford ¡CS246: ¡Mining ¡Massive ¡Datasets, ¡http://cs246.stanford.edu ¡ 7 ¡

� Each ¡arm ¡ i ¡ � Wins ¡(reward= 1 ) ¡with ¡fixed ¡(unknown) ¡prob. ¡ � i ¡ � Loses ¡(reward= 0 ) ¡with ¡fixed ¡(unknown) ¡prob. ¡ 1-‑ � i ¡ � All ¡draws ¡are ¡independent ¡given ¡ � 1 ¡ �� k ¡ � How ¡to ¡pull ¡arms ¡to ¡maximize ¡total ¡reward? ¡ 3/7/2013 ¡ Jure ¡Leskovec, ¡Stanford ¡CS246: ¡Mining ¡Massive ¡Datasets, ¡http://cs246.stanford.edu ¡ 8 ¡

� How ¡does ¡this ¡map ¡to ¡our ¡setting? ¡ � Each ¡ query ¡ is ¡a ¡ bandit ¡ � Each ¡ ad ¡ is ¡an ¡ arm ¡ � We ¡want ¡to ¡estimate ¡the ¡ �� of ¡ winning ¡ � i ¡ (i.e., ¡ �� i ) ¡ � �� ¡ 3/7/2013 ¡ Jure ¡Leskovec, ¡Stanford ¡CS246: ¡Mining ¡Massive ¡Datasets, ¡http://cs246.stanford.edu ¡ 9 ¡

The ¡setting: ¡ � Set ¡of ¡ k ¡ choices ¡(arms) ¡ � Each ¡choice ¡ i ¡is ¡associated ¡with ¡unknown ¡ probability ¡distribution ¡ P i ¡ supported ¡in ¡ [0,1] ¡ � We ¡play ¡the ¡game ¡for ¡ T ¡rounds ¡ � In ¡each ¡round ¡ t : ¡ ¡ � (1) ¡ We ¡pick ¡some ¡arm ¡ j ¡ ¡ � (2) ¡We ¡obtain ¡random ¡sample ¡ X t ¡from ¡ P j ¡ ¡ � Note ¡reward ¡is ¡independent ¡of ¡previous ¡draws ¡ � � Our ¡goal ¡is ¡to ¡maximize ¡ � � � ¡ �� i ! ¡But ¡every ¡time ¡we ¡ ¡ pull ¡some ¡arm ¡ i ¡we ¡get ¡to ¡learn ¡a ¡bit ¡about ¡ � i ¡ 3/7/2013 ¡ Jure ¡Leskovec, ¡Stanford ¡CS246: ¡Mining ¡Massive ¡Datasets, ¡http://cs246.stanford.edu ¡ 10 ¡

� Online ¡optimization ¡with ¡limited ¡feedback ¡ Choices ¡ X 1 ¡ X 2 ¡ X 3 ¡ X 4 ¡ X 5 ¡ X 6 ¡ � ¡ ¡ a 1 ¡ 1 ¡ 1 ¡ ¡ a 2 ¡ 0 ¡ 1 ¡ 0 ¡ ¡ � ¡ ¡ a k ¡ 0 ¡ ¡ Time � Like ¡in ¡online ¡algorithms: ¡ � Have ¡to ¡make ¡a ¡choice ¡each ¡time ¡ � But ¡we ¡only ¡receive ¡information ¡about ¡the ¡ chosen ¡action ¡ ¡ 3/7/2013 ¡ Jure ¡Leskovec, ¡Stanford ¡CS246: ¡Mining ¡Massive ¡Datasets, ¡http://cs246.stanford.edu ¡ 11 ¡

� Policy : ¡a ¡strategy/rule ¡that ¡in ¡each ¡iteration ¡ tells ¡me ¡which ¡arm ¡to ¡pull ¡ ¡ � Hopefully ¡policy ¡depends ¡on ¡the ¡history ¡of ¡rewards ¡ ¡ � How ¡to ¡quantify ¡performance ¡of ¡the ¡ algorithm? ¡Regret! ¡ 3/7/2013 ¡ Jure ¡Leskovec, ¡Stanford ¡CS246: ¡Mining ¡Massive ¡Datasets, ¡http://cs246.stanford.edu ¡ 12 ¡

� Let ¡be ¡ � � ¡ the ¡mean ¡of ¡ � � ¡ � Payoff/reward ¡of ¡ best ¡arm : ¡ � � � �� ¡ � � Let ¡ � � � � � � � � ¡be ¡the ¡sequence ¡of ¡arms ¡pulled ¡ � Instantaneous ¡ regret ¡at ¡time ¡ � : ¡ � � � � � � � � ¡ � Total ¡regret: ¡ ¡ ¡ � � � � � � � ¡ �� Typical ¡goal: ¡Want ¡a ¡policy ¡(arm ¡allocation ¡ � � � � � ¡as ¡ � � � ¡ strategy) ¡that ¡guarantees: ¡ 3/7/2013 ¡ Jure ¡Leskovec, ¡Stanford ¡CS246: ¡Mining ¡Massive ¡Datasets, ¡http://cs246.stanford.edu ¡ 13 ¡

� If ¡we ¡knew ¡the ¡payoffs, ¡which ¡arm ¡would ¡we ¡ pull? ¡ ¡ �� ¡ � ¡ � What ¡if ¡we ¡only ¡care ¡about ¡estimating ¡ ¡ payoffs ¡ � � ? ¡ � � Pick ¡each ¡arm ¡equally ¡often: ¡ � ¡ � � � � � � Estimate: ¡ � � � � � �� ¡ �� Regret: ¡ � � � ¡ � 3/7/2013 ¡ Jure ¡Leskovec, ¡Stanford ¡CS246: ¡Mining ¡Massive ¡Datasets, ¡http://cs246.stanford.edu ¡ 14 ¡

� Regret ¡is ¡defined ¡in ¡terms ¡of ¡average ¡reward ¡ � So ¡if ¡we ¡can ¡estimate ¡avg. ¡reward ¡we ¡can ¡ minimize ¡regret ¡ � Consider ¡algorithm: ¡Greedy ¡ Take ¡the ¡action ¡with ¡the ¡highest ¡avg. ¡reward ¡ � Example: ¡ Consider ¡2 ¡actions ¡ � A1 ¡reward ¡1 ¡with ¡prob. ¡0.3 ¡ ¡ � A2 ¡has ¡reward ¡1 ¡with ¡prob. ¡0.7 ¡ � Play ¡ A1 , ¡get ¡reward ¡1 ¡ � Play ¡ A2 , ¡get ¡reward ¡0 ¡ � Now ¡avg. ¡reward ¡of ¡ A1 ¡will ¡never ¡drop ¡to ¡0, ¡ ¡ and ¡we ¡will ¡never ¡play ¡action ¡ A2 ¡ 3/7/2013 ¡ Jure ¡Leskovec, ¡Stanford ¡CS246: ¡Mining ¡Massive ¡Datasets, ¡http://cs246.stanford.edu ¡ 15 ¡

� The ¡example ¡illustrates ¡a ¡classic ¡problem ¡in ¡ ¡ decision ¡making: ¡ � We ¡need ¡to ¡trade ¡off ¡ exploration ¡(gathering ¡data ¡ about ¡arm ¡payoffs) ¡and ¡ exploitation ¡(making ¡ decisions ¡based ¡on ¡data ¡already ¡gathered) ¡ ¡ � The ¡Greedy ¡does ¡not ¡explore ¡ �� ¡ � Exploration: ¡Pull ¡an ¡arm ¡we ¡never ¡pulled ¡before ¡ � Exploitation: ¡Pull ¡an ¡arm ¡for ¡which ¡we ¡currently ¡ have ¡the ¡highest ¡estimate ¡of ¡ � � ¡ 3/7/2013 ¡ Jure ¡Leskovec, ¡Stanford ¡CS246: ¡Mining ¡Massive ¡Datasets, ¡http://cs246.stanford.edu ¡ 16 ¡

� The ¡problem ¡with ¡our ¡ Greedy ¡algorithm ¡is ¡ that ¡it ¡is ¡ too ¡certain ¡in ¡the ¡estimate ¡of ¡ � � ¡ � When ¡we ¡have ¡seen ¡a ¡single ¡reward ¡of ¡0 ¡we ¡ �� conclude ¡the ¡average ¡reward ¡is ¡0 ¡ ¡ � Greedy ¡does ¡not ¡explore ¡ �� ¡ ¡ 3/7/2013 ¡ Jure ¡Leskovec, ¡Stanford ¡CS246: ¡Mining ¡Massive ¡Datasets, ¡http://cs246.stanford.edu ¡ 17 ¡

Algorithm: ¡Epsilon-‑Greedy ¡ � For ¡t=1:T ¡ � Set ¡ � � � �� ¡ � With ¡prob. ¡ � � : ¡ Explore ¡by ¡picking ¡an ¡arm ¡chosen ¡ uniformly ¡at ¡random ¡ � With ¡prob. ¡ � � � � : ¡ Exploit ¡by ¡picking ¡an ¡arm ¡with ¡ highest ¡empirical ¡mean ¡payoff ¡ � �� ¡ For ¡suitable ¡choice ¡of ¡ � � ¡it ¡holds ¡that ¡ � � � �� ¡ � 3/7/2013 ¡ Jure ¡Leskovec, ¡Stanford ¡CS246: ¡Mining ¡Massive ¡Datasets, ¡http://cs246.stanford.edu ¡ 18 ¡

� What ¡are ¡some ¡issues ¡with ¡ Epsilon ¡Greedy ? ¡ � �� : ¡Algorithm ¡explicitly ¡distinguishes ¡ between ¡exploration ¡and ¡exploitation ¡ ¡ � More ¡importantly: ¡Exploration ¡makes ¡ suboptimal ¡ choices ¡ (since ¡it ¡picks ¡any ¡arm ¡equally ¡likely) ¡ ¡ � Idea: ¡When ¡exploring/exploiting ¡we ¡need ¡to ¡ compare ¡ arms ¡ 3/7/2013 ¡ Jure ¡Leskovec, ¡Stanford ¡CS246: ¡Mining ¡Massive ¡Datasets, ¡http://cs246.stanford.edu ¡ 19 ¡

http://cs246.stanford.edu Web advertising We discussed how to - PowerPoint PPT Presentation

CS246: Mining Massive Datasets Jure Leskovec, Stanford University http://cs246.stanford.edu Web advertising We discussed how to match advertisers to queries in real-time

http://cs246.stanford.edu Web pages are not equally important www.joe-schmoe.com vs.

http://cs246.stanford.edu Instructor: Jure Leskovec TAs: Aditya Parameswaran

http://cs246.stanford.edu CPU Machine Learning, Statistics Memory Classical Data Mining

http://cs246.stanford.edu High dim. Graph Infinite Machine Apps data data data learning

http://cs246.stanford.edu More algorithms for streams: (1) Filtering a data stream: Bloom

http://cs246.stanford.edu High-dimension == many features Find concepts/topics/genres:

http://cs246.stanford.edu High dim. High dim. Graph Graph Infinite Infinite Machine Machine Apps

http://cs246.stanford.edu Classic model of algorithms You get to see the entire input, then

http://cs246.stanford.edu Rank nodes using link structure PageRank: Link voting: P

http://cs246.stanford.edu Web advertising Weve learned how to match advertisers to

http://cs246.stanford.edu Web advertising We discussed how to match advertisers to queries

http://cs246.stanford.edu Web advertising We discussed how to match advertisers to

http://cs246.stanford.edu Training data 100 million ratings, 480,000 users, 17,770 movies

http://cs246.stanford.edu TAs : Bahman Bahmani Juthika Dabholkar Pierre Kreitmann

http://cs246.stanford.edu High dimensional == many features Find

http://cs246.stanford.edu Supermarket shelf management Market-basket model: Goal: Identify

1. If no action is taken by MCA, such exclusivity shall be limited as follows: i. East of the

New State Laws and Proposed Federal Legislation Rache hel A l Ande nderson, son, Data ta

The first-ever Sitecore development accelerator Adam Najmanowicz Cognifide adam@cognifide.com

State Board Meeting August 26, 2020 9:00 to 12:40 Meeting to be held electronically by ZOOM

Building profitable business together

Changing with the times Tatton Group Forums Ryan Seaton www.tattoninvestments.com | Tel:

FINDING A POSTDOCTORAL POSITION IN MATHEMATICS. LAUREN WILLIAMS The process of applying for a

Advice Complexity of Adaptive Priority Algorithms: Part 2 Lower Bounds Joan Boyar 1 , Kim