Annoyed Users: Ads and Ad-Block Usage in the Wild Enric Pujol - - PowerPoint PPT Presentation

annoyed users ads and ad block usage in the wild
SMART_READER_LITE
LIVE PREVIEW

Annoyed Users: Ads and Ad-Block Usage in the Wild Enric Pujol - - PowerPoint PPT Presentation

Annoyed Users: Ads and Ad-Block Usage in the Wild Enric Pujol Oliver Hohlfeld Anja Feldmann TU Berlin RWTH Aachen TU Berlin IMC15 Tokyo, Japan 2 http://www.journalism.org/2015/04/29/digital-news-revenue-fact-sheet Page Fair and Adobe


slide-1
SLIDE 1

Annoyed Users: Ads and Ad-Block Usage in the Wild

IMC’15 Tokyo, Japan

Oliver Hohlfeld RWTH Aachen Anja Feldmann TU Berlin Enric Pujol TU Berlin

slide-2
SLIDE 2

2

slide-3
SLIDE 3

3

http://www.journalism.org/2015/04/29/digital-news-revenue-fact-sheet Page Fair and Adobe report on Ad Blocking

slide-4
SLIDE 4

4

Mozilla Add-on statistics

slide-5
SLIDE 5

Publishers reaction

5 The Guardian

slide-6
SLIDE 6

6

(Why can I not see BILD.de?)

slide-7
SLIDE 7

Acceptable ads program (non-intrusive ads)

7

White-listing within AdBlockPlus (ABP)

slide-8
SLIDE 8

Our work

8

Online advertising Residential Broadband Network Methodology based on AdBlockPlus Characterize ad traffic Ad-blocker usage

slide-9
SLIDE 9

9

slide-10
SLIDE 10

10

slide-11
SLIDE 11

11

URL

||ads.example.com^ $ script, image, domain=bar.com

www.bar.com

#document HTML HEAD BODY … IMG DOM Tree Content-Type Base URL

Is this information available in HTTP header traces?

http://ads.example. com/ad.gif www.foo.com

Content Policy

slide-12
SLIDE 12

12

||ads.example.com ^ $ script, image, domain=foo.com

slide-13
SLIDE 13

13

||ads.example.com ^ $ script, image, domain=foo.com

HTTP logs Bro IDS

slide-14
SLIDE 14

14

||ads.example.com ^ $ script, image, domain=foo.com

HTTP logs Bro IDS Map Web Pages ReferrerMap

slide-15
SLIDE 15

15

||ads.example.com ^ $ script, image, domain=foo.com

Classify requests libadblockplus + filter lists HTTP logs Bro IDS Map Web Pages ReferrerMap

slide-16
SLIDE 16

Limitations

16

HTTPS traffic Content-Type HTTP logs Referrer Map Web page Classification Hidden ads (CSS) List sanity

slide-17
SLIDE 17

Ad-blocker usage indicators

17

Low ratio of ad requests Easylist updates

AdBlock Plus

Caveat: ad-free content

Any ad-blocker

Caveat: HTTPS + NATs

How to find a suitable threshold?

slide-18
SLIDE 18

Active measurements

Values for the complete set of Alexa top 1K

Browser Mode #HTTP ad-requests ratio Vanilla 57.8K 16 % Adblock Plus + all lists 48.5K <0.1 %

18 28.10.2015

slide-19
SLIDE 19

When can we apply ad-ratio indicator?

At least 1K requests. Set threshold to 5%

19 28.10.2015

Vanilla: no plugin AdBP-Pa: adblockPlus + all lists

slide-20
SLIDE 20

Data sets

Trace RBN-1 RBN-2 Duration 4 days 15 and ½ hours Subscribers 7.5 K 19.7 K HTTPbytes 18.8 TB 11.4 TB HTTPrequests 131.95 M 85.09 M Characterize ad traffic Assess ad-blocker prevalence

20 28.10.2015

slide-21
SLIDE 21

How much ad-related traffic?

21 28.10.2015

Ad-related traffic EasyList and derivatives (55 %) EasyPrivacy (35 %) Acceptable ads (10 %) 0 % 18 % 100 % Black-listed (57%)

“Loose filters” @@||gstatic.com^$document

slide-22
SLIDE 22

How many ABP-capable browsers?

Requests: 50 % (all) 72 % (ads)

22 28.10.2015

500K pairs <IP, UserAgent> 1.6K unique UserAgent strings 9.2 K users Heavy hitters Mobile Firefox Chrome Safari IE 33 % 24 % 15 % 21 % 7 % Known browsers

slide-23
SLIDE 23

23

Ad Ratio EasyList

✘ ✘ ✘ ✔ ✔ ✘ ✔ ✔

28.10.2015

Browsers with no evidence of an adblocker: 46% of the ad traffic

slide-24
SLIDE 24

Inconsistent indicators a) various users in the same household

24 28.10.2015

Ad Ratio EasyList

✘ ✘ ✘ ✔ ✔ ✘ ✔ ✔

slide-25
SLIDE 25

Inconsistent indicators a) different plugin, or b) ad-free content

25 28.10.2015

Ad Ratio EasyList

✘ ✘ ✘ ✔ ✔ ✘ ✔ ✔

slide-26
SLIDE 26

Evidence of Adblock Plus for 22% of the users Still generate ad traffic!

26 28.10.2015

Ad Ratio EasyList

✘ ✘ ✘ ✔ ✔ ✘ ✔ ✔

?

slide-27
SLIDE 27

27

Most AdBlockPlus users do not change default settings

82% 11 % 7% EasyPrivacy Acceptable ads EasyList No ad blocker EasyPrivacy Acceptable ads

99% 87%

ABP users

90% 70%

How do users configure ABP?

AGGREGATED

slide-28
SLIDE 28

Summary

Can we use existing functionality to classify ad traffic? Yes, we can! (but limitations apply)

28 28.10.2015

slide-29
SLIDE 29

Summary

Can we use existing functionality to classify ad traffic? Yes, we can! (but limitations apply) Which traffic is potentially affected by Adblock Plus? ~18% requests and ~1% HTTP bytes Google, CDNs, Cloud Providers and dedicated ASes 10% of the ad-related traffic is white listed

29 28.10.2015

slide-30
SLIDE 30

Summary

Can we use existing functionality to classify ad traffic? Yes, we can! (but limitations apply) Which traffic is potentially affected by Adblock Plus? ~18% requests and ~1% HTTP bytes Google, CDNs, Cloud Providers and dedicated ASes 10% of the ad-related traffic conforms Acceptable Ads Can we measure how end users adopt ad-block plugins? Yes, in our traces ~20% of users Out-of-the-box i.e., acceptable ads / no EasyPrivacy

30 28.10.2015

slide-31
SLIDE 31

Thank you!

Can we use existing functionality to classify ad traffic? Yes, we can! (but limitations apply) Which traffic is potentially affected by Adblock Plus? ~18% requests and ~1% HTTP bytes Google, CDNs, Cloud Providers and dedicated ASes 10% of the ad-related traffic conforms Acceptable Ads Can we measure how end users adopt ad-block plugins? Yes, in our traces ~20% of users Out-of-the-box i.e., acceptable ads / no EasyPrivacy

31 28.10.2015

slide-32
SLIDE 32

32 28.10.2015