Diffusion of User Tracking Data in the Online Advertising Ecosystem
Muhammad Ahmad Bashir and Christo Wilson
Diffusion of User Tracking Data in the Online Advertising Ecosystem - - PowerPoint PPT Presentation
Diffusion of User Tracking Data in the Online Advertising Ecosystem Muhammad Ahmad Bashir and Christo Wilson Your Digital Privacy Footprint 2 Your Digital Privacy Footprint 2 Your Digital Privacy Footprint 2 Your Digital Privacy
Muhammad Ahmad Bashir and Christo Wilson
2
2
2
2
2
2
2
2
3
3
Ad Space Content User
3
Ad Space Content User 1
3
Ad Space Content User 1 Supply Side Platforms (SSPs) 2
3
Ad Space Content User 1 Supply Side Platforms (SSPs) Ad Exchanges 2 3
3
Ad Space Content User 1 Supply Side Platforms (SSPs) Ad Exchanges 2 3 4 Demand Side Platforms (DSPs)
3
Ad Space Content User 1 Supply Side Platforms (SSPs) Ad Exchanges 2 3 4 Demand Side Platforms (DSPs) Advertisers 5
3
Ad Space Content User 1 Supply Side Platforms (SSPs) Ad Exchanges 2 3 4 Demand Side Platforms (DSPs) 6 Advertisers 5
3
Ad Space Content User 1 Supply Side Platforms (SSPs) Ad Exchanges 2 3 4 Demand Side Platforms (DSPs) 6 Advertisers 5
Although RightMedia does not win the auction, it still sees the impression
3
Ad Space Content User 1 Supply Side Platforms (SSPs) Ad Exchanges 2 3 4 Demand Side Platforms (DSPs) 6 Advertisers 5
Although RightMedia does not win the auction, it still sees the impression
3
Ad Space Content User 1 Supply Side Platforms (SSPs) Ad Exchanges 2 3 4 Demand Side Platforms (DSPs) 6 Advertisers 5
Although RightMedia does not win the auction, it still sees the impression
4
4
4
4
4
4
4
5
Model the Diffusion of Impressions in the Advertising Ecosystem
5
Model the Diffusion of Impressions in the Advertising Ecosystem
1. What fraction of user impressions are viewed by ad companies? 2. How much ad and tracker blocking extensions help?
5
Model the Diffusion of Impressions in the Advertising Ecosystem
1. What fraction of user impressions are viewed by ad companies? 2. How much ad and tracker blocking extensions help?
5
Model the Diffusion of Impressions in the Advertising Ecosystem
Key Terms:
6
7
retargeted ads through 150 top publishers.
7
retargeted ads through 150 top publishers.
7
retargeted ads through 150 top publishers.
7
Example Inclusion Chain
8
8
Inclusion Chains
8
Publishers A&A (Advertising and Analytics)
Inclusion Chains
8
Publishers A&A (Advertising and Analytics)
Graph Representation Inclusion Chains
8
Publishers A&A (Advertising and Analytics)
Graph Representation Inclusion Chains
8
1 1
Publishers A&A (Advertising and Analytics)
Graph Representation
1
Inclusion Chains
8
1 1
Publishers A&A (Advertising and Analytics)
Graph Representation
1
Inclusion Chains
8
1 1 1 1
Publishers A&A (Advertising and Analytics)
Graph Representation
1
Inclusion Chains
8
1 1 1 1 2
Publishers A&A (Advertising and Analytics)
Graph Representation Inclusion Chains
8
1 1 1 1 2
Publishers A&A (Advertising and Analytics)
Graph Representation
Nodes: Publishers or A&A domains Edges: Publisher —> A&A A&A —> A&A
Inclusion Chains
8
1 1 1 1 2
Publishers A&A (Advertising and Analytics)
Graph Representation
Nodes: Publishers or A&A domains Edges: Publisher —> A&A A&A —> A&A
Inclusion Chains
Nodes:
Edges:
9
9
9
9
10
Model the Diffusion of Impressions in the Advertising Ecosystem
11
[1]. Burken et al. User centric walk: An integrated approach for modeling the browsing behavior of users on the web. ASS 2005
We simulate browsing traces for 200 users using method from [1].
11
[1]. Burken et al. User centric walk: An integrated approach for modeling the browsing behavior of users on the web. ASS 2005
We simulate browsing traces for 200 users using method from [1]. 1. User generates an impression on N selected publishers.
11
[1]. Burken et al. User centric walk: An integrated approach for modeling the browsing behavior of users on the web. ASS 2005
We simulate browsing traces for 200 users using method from [1]. 1. User generates an impression on N selected publishers. 2. Impressions are forwarded to A&A domains via:
11
[1]. Burken et al. User centric walk: An integrated approach for modeling the browsing behavior of users on the web. ASS 2005
We simulate browsing traces for 200 users using method from [1]. 1. User generates an impression on N selected publishers. 2. Impressions are forwarded to A&A domains via:
11
[1]. Burken et al. User centric walk: An integrated approach for modeling the browsing behavior of users on the web. ASS 2005
We simulate browsing traces for 200 users using method from [1]. 1. User generates an impression on N selected publishers. 2. Impressions are forwarded to A&A domains via:
11
[1]. Burken et al. User centric walk: An integrated approach for modeling the browsing behavior of users on the web. ASS 2005
We simulate browsing traces for 200 users using method from [1]. 1. User generates an impression on N selected publishers. 2. Impressions are forwarded to A&A domains via:
11
[1]. Burken et al. User centric walk: An integrated approach for modeling the browsing behavior of users on the web. ASS 2005
We simulate browsing traces for 200 users using method from [1]. 1. User generates an impression on N selected publishers. 2. Impressions are forwarded to A&A domains via:
3. RTB winner is decided based on probability (function of edge weights).
11
[1]. Burken et al. User centric walk: An integrated approach for modeling the browsing behavior of users on the web. ASS 2005
12
Publisher Exchange DSP/Advertiser
Node Type
Direct Indirect
Activation
p1 p2 e2 e1 a3 a2 a1 a5 a4
100 100 10 90 80 20 5 95
12
Publisher Exchange DSP/Advertiser
Node Type
Direct Indirect
Activation
p1 p2 e2 e1 a3 a2 a1 a5 a4
We manually select 37 exchanges which are allowed to forward indirect impressions to solicit bids during RTB
100 100 10 90 80 20 5 95
12
Publisher Exchange DSP/Advertiser
Node Type
Direct Indirect
Activation
p1 p2 e2 e1 a3 a2 a1 a5 a4 p1
We manually select 37 exchanges which are allowed to forward indirect impressions to solicit bids during RTB
100 100 10 90 80 20 5 95
12
Publisher Exchange DSP/Advertiser
Node Type
Direct Indirect
Activation
p1 p2 e2 e1 a3 a2 a1 a5 a4 p1 e1
1
We manually select 37 exchanges which are allowed to forward indirect impressions to solicit bids during RTB
100 100 10 90 80 20 5 95
12
Publisher Exchange DSP/Advertiser
Node Type
Direct Indirect
Activation
p1 p2 e2 e1 a3 a2 a1 a5 a4 p1 e1 a2 a1
1 1 1
We manually select 37 exchanges which are allowed to forward indirect impressions to solicit bids during RTB
100 100 10 90 80 20 5 95
12
Publisher Exchange DSP/Advertiser
Node Type
Direct Indirect
Activation
p1 p2 e2 e1 a3 a2 a1 a5 a4 p1 e1 a2 a1 a5 a4
1 1 1 1
We manually select 37 exchanges which are allowed to forward indirect impressions to solicit bids during RTB
100 100 10 90 80 20 5 95
12
Publisher Exchange DSP/Advertiser
Node Type
Direct Indirect
Activation
p1 p2 e2 e1 a3 a2 a1 a5 a4 p1 p2 e1 a2 a1 a5 a4
1 1 1 1
We manually select 37 exchanges which are allowed to forward indirect impressions to solicit bids during RTB
100 100 10 90 80 20 5 95
12
Publisher Exchange DSP/Advertiser
Node Type
Direct Indirect
Activation
p1 p2 e2 e1 a3 a2 a1 a5 a4 p1 p2 e2 e1 a2 a1 a5 a4
1 1 1 1 1
We manually select 37 exchanges which are allowed to forward indirect impressions to solicit bids during RTB
100 100 10 90 80 20 5 95
12
Publisher Exchange DSP/Advertiser
Node Type
Direct Indirect
Activation
p1 p2 e2 e1 a3 a2 a1 a5 a4 p1 p2 e2 e1 a3 a2 a1 a5 a4
1 1 1 1 1
We manually select 37 exchanges which are allowed to forward indirect impressions to solicit bids during RTB
100 100 10 90 80 20 5 95
12
Publisher Exchange DSP/Advertiser
Node Type
Direct Indirect
Activation
p1 p2 e2 e1 a3 a2 a1 a5 a4 p1 p2 e2 e1 a3 a2 a1 a5 a4
1 1 1 2 1 1
We manually select 37 exchanges which are allowed to forward indirect impressions to solicit bids during RTB
100 100 10 90 80 20 5 95
12
Publisher Exchange DSP/Advertiser
Node Type
Direct Indirect
Activation
p1 p2 e2 e1 a3 a2 a1 a5 a4 p1 p2 e2 e1 a3 a2 a1 a5 a4
1 1 1 2 1 2
We manually select 37 exchanges which are allowed to forward indirect impressions to solicit bids during RTB
100 100 10 90 80 20 5 95
13
0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 Fraction of A&A Domains (CDF) Fraction of Impressions Observed Cookie Matching-Only RTB Constrained RTB Relaxed
We have 3 simulation models:
13
0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 Fraction of A&A Domains (CDF) Fraction of Impressions Observed Cookie Matching-Only RTB Constrained RTB Relaxed
We have 3 simulation models:
13
0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 Fraction of A&A Domains (CDF) Fraction of Impressions Observed Cookie Matching-Only RTB Constrained RTB Relaxed
We have 3 simulation models:
13
RTB-Relaxed
impressions in RTB-Constrained
0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 Fraction of A&A Domains (CDF) Fraction of Impressions Observed Cookie Matching-Only RTB Constrained RTB Relaxed
We have 3 simulation models:
14
DoubleClick OpenX PubMatic
14
DoubleClick OpenX PubMatic
14
DoubleClick OpenX PubMatic
14
DoubleClick OpenX PubMatic
14
DoubleClick OpenX PubMatic
14
DoubleClick OpenX PubMatic
14
DoubleClick OpenX PubMatic
14
DoubleClick OpenX PubMatic
15
0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 Fraction of A&A Domains (CDF) Fraction of Impressions Observed Disconnect Ghostery AdBlock Plus No Removal
15
0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 Fraction of A&A Domains (CDF) Fraction of Impressions Observed Disconnect Ghostery AdBlock Plus No Removal
15
Acceptable Ads program.
A&A domains even with blocking extensions.
0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 Fraction of A&A Domains (CDF) Fraction of Impressions Observed Disconnect Ghostery AdBlock Plus No Removal
16
16
EasyList Disconnect Ghostery AdBlock Plus No Blocking
Impressions Observed (%)
10 20 30 40 50 60 70 80 90 100
16
EasyList Disconnect Ghostery AdBlock Plus No Blocking
Impressions Observed (%)
10 20 30 40 50 60 70 80 90 100
16
EasyList Disconnect Ghostery AdBlock Plus No Blocking
Impressions Observed (%)
10 20 30 40 50 60 70 80 90 100
Domain Impression % google-analytics 97.0 youtube 91.7 quantserve 91.6 scorecardresearch 91.6 skimresources 91.3 twitter 91.1 pinterest 91.0 addthis 90.0 criteo 90.0 bluekai 90.8
Top 10 domains with most observed impressions under AdBlock Plus
(December 2015)
17
realistic conditions.
realistic conditions.
ahmad@ccs.neu.edu
http://personalization.ccs.neu.edu/Projects/AdGraphs/
20
21
22
22
<html> <body> <script src=“a1.com/cookie-match.js” </script> <!-- Tracking pixel inserted dynamically by cookie-match.js --> <img src=“a2.com/pixel.jpg” /> <iframe src=“a3.com/banner.html”> <script src=“a4.com/ad.js”> </script> </iframe> </body> </html>
DOM Tree for http://p.com/index.html
22
<html> <body> <script src=“a1.com/cookie-match.js” </script> <!-- Tracking pixel inserted dynamically by cookie-match.js --> <img src=“a2.com/pixel.jpg” /> <iframe src=“a3.com/banner.html”> <script src=“a4.com/ad.js”> </script> </iframe> </body> </html>
DOM Tree for http://p.com/index.html Inclusion of resources
22
<html> <body> <script src=“a1.com/cookie-match.js” </script> <!-- Tracking pixel inserted dynamically by cookie-match.js --> <img src=“a2.com/pixel.jpg” /> <iframe src=“a3.com/banner.html”> <script src=“a4.com/ad.js”> </script> </iframe> </body> </html>
DOM Tree for http://p.com/index.html
p
Inclusion of resources
22
<html> <body> <script src=“a1.com/cookie-match.js” </script> <!-- Tracking pixel inserted dynamically by cookie-match.js --> <img src=“a2.com/pixel.jpg” /> <iframe src=“a3.com/banner.html”> <script src=“a4.com/ad.js”> </script> </iframe> </body> </html>
DOM Tree for http://p.com/index.html
p
Inclusion of resources
22
<html> <body> <script src=“a1.com/cookie-match.js” </script> <!-- Tracking pixel inserted dynamically by cookie-match.js --> <img src=“a2.com/pixel.jpg” /> <iframe src=“a3.com/banner.html”> <script src=“a4.com/ad.js”> </script> </iframe> </body> </html>
DOM Tree for http://p.com/index.html
p a1
Inclusion of resources
22
<html> <body> <script src=“a1.com/cookie-match.js” </script> <!-- Tracking pixel inserted dynamically by cookie-match.js --> <img src=“a2.com/pixel.jpg” /> <iframe src=“a3.com/banner.html”> <script src=“a4.com/ad.js”> </script> </iframe> </body> </html>
DOM Tree for http://p.com/index.html
p a1
Inclusion of resources
22
<html> <body> <script src=“a1.com/cookie-match.js” </script> <!-- Tracking pixel inserted dynamically by cookie-match.js --> <img src=“a2.com/pixel.jpg” /> <iframe src=“a3.com/banner.html”> <script src=“a4.com/ad.js”> </script> </iframe> </body> </html>
DOM Tree for http://p.com/index.html
p a1
Inclusion of resources
a2
22
<html> <body> <script src=“a1.com/cookie-match.js” </script> <!-- Tracking pixel inserted dynamically by cookie-match.js --> <img src=“a2.com/pixel.jpg” /> <iframe src=“a3.com/banner.html”> <script src=“a4.com/ad.js”> </script> </iframe> </body> </html>
DOM Tree for http://p.com/index.html
p a1
Inclusion of resources
22
<html> <body> <script src=“a1.com/cookie-match.js” </script> <!-- Tracking pixel inserted dynamically by cookie-match.js --> <img src=“a2.com/pixel.jpg” /> <iframe src=“a3.com/banner.html”> <script src=“a4.com/ad.js”> </script> </iframe> </body> </html>
DOM Tree for http://p.com/index.html
p a1 a2
Inclusion of resources
22
<html> <body> <script src=“a1.com/cookie-match.js” </script> <!-- Tracking pixel inserted dynamically by cookie-match.js --> <img src=“a2.com/pixel.jpg” /> <iframe src=“a3.com/banner.html”> <script src=“a4.com/ad.js”> </script> </iframe> </body> </html>
DOM Tree for http://p.com/index.html
p a1 a2
Inclusion of resources
22
<html> <body> <script src=“a1.com/cookie-match.js” </script> <!-- Tracking pixel inserted dynamically by cookie-match.js --> <img src=“a2.com/pixel.jpg” /> <iframe src=“a3.com/banner.html”> <script src=“a4.com/ad.js”> </script> </iframe> </body> </html>
DOM Tree for http://p.com/index.html
p a1 a3 a2
Inclusion of resources
22
<html> <body> <script src=“a1.com/cookie-match.js” </script> <!-- Tracking pixel inserted dynamically by cookie-match.js --> <img src=“a2.com/pixel.jpg” /> <iframe src=“a3.com/banner.html”> <script src=“a4.com/ad.js”> </script> </iframe> </body> </html>
DOM Tree for http://p.com/index.html
p a1 a3 a2
Inclusion of resources
22
<html> <body> <script src=“a1.com/cookie-match.js” </script> <!-- Tracking pixel inserted dynamically by cookie-match.js --> <img src=“a2.com/pixel.jpg” /> <iframe src=“a3.com/banner.html”> <script src=“a4.com/ad.js”> </script> </iframe> </body> </html>
DOM Tree for http://p.com/index.html
p a1 a3 a2 a4
Inclusion of resources
23
(a) DOM Tree for http://p.com/index.html
<html> <body> <script src=”a1.com/cookie-match.js”></script> <!-- Tracking pixel inserted dynamically by cookie-match.js --> <img src=”a2.com/pixel.jpg”/> <iframe src=”a3.com/banner.html”> <script src=”a4.com/ads.js”></script> </iframe> </body> </html>
(d) Referer Graph (c) Inclusion Graph a1 a2 a4 a1 a2 a4 a3 (b) Inclusion Tree
p.com/index.html a1.com/cookie-match.js a2.com/pixel.jpg a3.com/banner.html a4.com/ads.js
p a3 p Publisher A&A
24
0.2 0.4 0.6 0.8 1
0.1 0.2 0.3 0.4 0.5 0.6 0.7 CDF Impression Fraction Difference (Burken - Random) Per A&A Node RTB Relaxed-Top 10% RTB Relaxed-Disconnect RTB Relaxed-Ghostery RTB Relaxed-Random 30% RTB Relaxed-No Blocking