Dont Use Computer Vision For Web Security Florian Tramr CV-COPS - PowerPoint PPT Presentation

Don’t Use Computer Vision For Web Security Florian Tramèr CV-COPS August 28 th 2020

Computer Vision For Web Security (Most) users ingest web content visually Detection of undesirable content can (partially) be framed as a computer vision problem Content takedown Anti Phishing Ad-blocking “Does this webpage look “Is this a video of a “Is this image an ad?” similar to Google.com?” terrorist attack” 2

Act I Don’t Use Computer Vision For Client-Side Web Security ML model is run on the user’s machine 3

An illustrative example: Ad-Blocking “AdVersarial: Perceptual Ad Blocking meets Adversarial Machine Learning” (with Pascal Dupré, Gili Rusak, Giancarlo Pellegrino and Dan Boneh) ACM CCS 2019, https://arxiv.org/abs/1811.03194 4

Why use CV for Ad-Blocking? Humans should be able to recognize ads 5

Why use CV for Ad-Blocking? Detecting ad-disclosures programmatically is hard! 6

Perceptual Ad-Blocking Ad Highlighter [Storey et al., 2017] > Traditional vision techniques (image hash, OCR) Sentinel by Adblock Plus [Paraska, 2018] > Locates ads in screenshots using neural networks Percival by Brave [Din et al., 2019] > CNN embedded in Chromium’s rendering pipeline 7

The Problem: Adversarial Examples Biggio et al. 2014, Szegedy et al. 2014, Goodfellow et al. 2015, ... 8

How Secure is Perceptual Ad-Blocking? 9

How (in)-Secure is Perceptual Ad-Blocking? … so that Tom’s post Jerry uploads gets blocked malicious content … 10

Attacking Perceptual Ad-Blocking How? Adversarial Examples (aka gradient descent) > Nothing too special here Why? Ad-blocking is the perfect threat model for adversarial examples > This is the cool part! 11

The Adversarial Examples Threat Model 1. (There’s an adversary) 2. Adv. cannot change the distribution of inputs > Otherwise, Adv could just use a “test-set attack” (Gilmer et al. 2018) 3. Adv. can only use “small” perturbations > Otherwise, Adv could just change the class semantics 4. Adv. has access to model weights or query API 12

The Adversarial Examples Threat Model 1. There’s an adversary 2. Adv. cannot change the distribution of inputs 3. Adv. can only use “small” perturbations 4. Adv. has access to model weights or query API Challenge: find a setting where this threat model is realistic 13

The Ad-Blocking Threat Model 1. There’s an adversary > Web publishers, ad-networks have financial incentive to evade ad-blocking 2. Adv. cannot change the distribution of inputs > Ad campaigns are meticulously designed to maximize user engagement 3. Adv. can only use “small” perturbations > Website users should be unaffected and still click on ads! 4. Adv. has access to model weights or query API > Ad-blocker is run client-side so the model weights are public New challenge: find a setting other than ad-blocking where this threat model is realistic 14

Client-Side Web-Security is Hard Near-impossible to resist dynamic/adaptive attacks True beyond ad-blocking: > Don’t do client-side visual anti-phishing! True beyond computer vision: > Don’t use client-side ML models to detect spam or malware 15

So What Can We Do? 1. Client-side black-lists: > Signatures of known malware > List of known phishing domains (e.g., Google safe browsing) > Ad-blocking filter lists 2. Server-side ML: Efficiency > Real-time spam & malware detection > More features Content takedown > What about computer-vision? “Security by obscurity” 16

Act II Computer Vision In Server-Side Web Security: A Privacy Nightmare 17

The Problem Server-side ML = Server-side Data 18

Privacy vs Security: Choose One Does content-security warrant sharing our... • Emails? > It seems so • Downloaded apps? > Google / Apple / ... already know this anyway • Website screenshots for ad-blocking or anti-phishing? > That seems excessive... 19

Screenshot Sharing For Security is a Thing! source: https://www.phish.ai/ 20

Some Research Questions Is visual anti-phishing secure? > Can computer vision achieve low-enough false positives? > Do phishing websites have to look similar to legitimate websites? > Automated black-box attacks? Is it private? > Can browser extensions be tricked into screenshotting sensitive data? > Can this data be extracted from trained neural nets? 21

Conclusion 1. Don’t Use Computer Vision Machine Learning For Client-Side Web Security “In fact, it’s better if you don’t use ML at all” 2. Don’t collect screenshots from my browser! Þ Don’t Use Computer Vision For Web Security Questions? tramer@cs.stanford.edu 22

Dont Use Computer Vision For Web Security Florian Tramr CV-COPS - PowerPoint PPT Presentation

Dont Use Computer Vision For Web Security Florian Tramr CV-COPS August 28 th 2020 Computer Vision For Web Security (Most) users ingest web content visually Detection of undesirable content can (partially) be framed as a computer vision

Computer Vision Computer Vision How does vision work? What is vision for? Ela Claridge

DNS and Security DNS and Security DNS and Security DNS and Security DNS and Security DNS and

Web Services Web Services Towards Web Services Towards Web Services Towards Web Services A

Web Application Security Attacks on the Web Attacker Web User Application Web Database Web

They Don t Want Them Or You t Want Them Or You They Don Don t Have Them: t Have

Don Juans Troubles Don Juans Troubles Hey, Anna, how are you? Don Juans Troubles Hey,

Web Mining Web Mining Web Mining Web Mining Web mining is the use of data mining techniques

CS262: Computer Vision (and Human-Computer Interaction) John Magee 1 Computer Vision How are

Lecture 1: Semantic Web and RDF Aidan Hogan aidhog@gmail.com THE WEB The Web is now 26 years

Branding Presentation VISION Mevushal VISION Muscat of Alexandria & Viognier VISION

Web Mining Web Mining to automatically discover and extract information from Web

Web Mining Web Mining Web mining is the use of data mining techniques to automatically

Web MINING Web MINING Overview Overview Dr Ahmed Rafea Rafea Dr Ahmed 1 Web Mining Outline

Computer Security http://security.di.unimi.it/sicurezza1819/ Chapter 3: 1 Chapter 18: Web

Web Scraping 1 / 9 Web Scraping Two ways to mine data from the web The hard way, by web

Agenda Web MVC-2: Apache Struts Drawbacks with Web Model 1 Web Model 2 (Web MVC) Rimon

Validation of the Nuclear Data Evaluation Code CONRAD WONDER 2012 | Olivier LITAIZE 1 , Pascal

Choices and Intervals P ASCAL M AILLARD (Weizmann Institute of Science) AofA, Paris, June 17, 2014

On trees invariant under edge contraction Pascal Maillard (Universit Paris-Sud) based on joint

IATA's New Distribution Capability April 13, 2017 12 pm EDT www.thecompanydime.com

NC-CELL: Network Coding-based Content Distribution in Cellular Networks for Cloud Applications

Improved Cryptanalysis of Py Paul Crowley LShift Ltd State of the Art in Stream Ciphers 2006 Py

CSE 341 Lecture 7 anonymous functions; composition of functions Ullman 5.1.3, 5.6 slides

Arrays, ArrayLists, Wrapper Classes, Auto-boxing, Enhanced for loop, Array Copying Check out

Dont Use Computer Vision For Web Security Florian Tramr CV-COPS - PowerPoint PPT Presentation

Dont Use Computer Vision For Web Security Florian Tramr CV-COPS August 28 th 2020 Computer Vision For Web Security (Most) users ingest web content visually Detection of undesirable content can (partially) be framed as a computer vision

Computer Vision Computer Vision How does vision work? What is vision for? Ela Claridge

DNS and Security DNS and Security DNS and Security DNS and Security DNS and Security DNS and

Web Services Web Services Towards Web Services Towards Web Services Towards Web Services A

Web Application Security Attacks on the Web Attacker Web User Application Web Database Web

They Don t Want Them Or You t Want Them Or You They Don Don t Have Them: t Have

Don Juans Troubles Don Juans Troubles Hey, Anna, how are you? Don Juans Troubles Hey,

Web Mining Web Mining Web Mining Web Mining Web mining is the use of data mining techniques

CS262: Computer Vision (and Human-Computer Interaction) John Magee 1 Computer Vision How are

Lecture 1: Semantic Web and RDF Aidan Hogan aidhog@gmail.com THE WEB The Web is now 26 years

Branding Presentation VISION Mevushal VISION Muscat of Alexandria &amp; Viognier VISION

Web Mining Web Mining to automatically discover and extract information from Web

Web Mining Web Mining Web mining is the use of data mining techniques to automatically

Web MINING Web MINING Overview Overview Dr Ahmed Rafea Rafea Dr Ahmed 1 Web Mining Outline

Computer Security http://security.di.unimi.it/sicurezza1819/ Chapter 3: 1 Chapter 18: Web

Web Scraping 1 / 9 Web Scraping Two ways to mine data from the web The hard way, by web

Agenda Web MVC-2: Apache Struts Drawbacks with Web Model 1 Web Model 2 (Web MVC) Rimon

Validation of the Nuclear Data Evaluation Code CONRAD WONDER 2012 | Olivier LITAIZE 1 , Pascal

Choices and Intervals P ASCAL M AILLARD (Weizmann Institute of Science) AofA, Paris, June 17, 2014

On trees invariant under edge contraction Pascal Maillard (Universit Paris-Sud) based on joint

IATA's New Distribution Capability April 13, 2017 12 pm EDT www.thecompanydime.com

NC-CELL: Network Coding-based Content Distribution in Cellular Networks for Cloud Applications

Improved Cryptanalysis of Py Paul Crowley LShift Ltd State of the Art in Stream Ciphers 2006 Py

CSE 341 Lecture 7 anonymous functions; composition of functions Ullman 5.1.3, 5.6 slides

Arrays, ArrayLists, Wrapper Classes, Auto-boxing, Enhanced for loop, Array Copying Check out

Branding Presentation VISION Mevushal VISION Muscat of Alexandria & Viognier VISION