MetaCAPTCHA: A Metamorphic Throttling Service for the Web Akshay - - PowerPoint PPT Presentation

metacaptcha a metamorphic throttling service for the web
SMART_READER_LITE
LIVE PREVIEW

MetaCAPTCHA: A Metamorphic Throttling Service for the Web Akshay - - PowerPoint PPT Presentation

Introduction System Architecture Evaluations References MetaCAPTCHA: A Metamorphic Throttling Service for the Web Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng { akshay, buithai, letien, nhhuyng, wuchang } @cs.pdx.edu Portland


slide-1
SLIDE 1

Introduction System Architecture Evaluations References

MetaCAPTCHA: A Metamorphic Throttling Service for the Web

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng

{akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu Portland State University

October 21, 2014

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 1/30

slide-2
SLIDE 2

Introduction System Architecture Evaluations References

1 Introduction

The problem Current Prevention Methods Our Approach

2 System Architecture

System Overview Variable Cost Function Puzzles

3 Evaluations

Experimental Setup Defense-in-Depth Conclusions

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 2/30

slide-3
SLIDE 3

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

Traditional email spam

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 3/30

slide-4
SLIDE 4

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

The market is moving to social Spam

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 4/30

slide-5
SLIDE 5

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

Difficult to detect kind of spam

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 5/30

slide-6
SLIDE 6

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

As a result

Email spam is reducing but social spam is edging up, with

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 6/30

slide-7
SLIDE 7

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

As a result

Email spam is reducing but social spam is edging up, with Large volume Four million Facebook users receive spam each day in 2011 [5, 9]

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 6/30

slide-8
SLIDE 8

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

As a result

Email spam is reducing but social spam is edging up, with Large volume Four million Facebook users receive spam each day in 2011 [5, 9] Fast growth Cost businesses $20.5 billion annually and projected to $198 billion in the next four years [12]

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 6/30

slide-9
SLIDE 9

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

As a result

Email spam is reducing but social spam is edging up, with Large volume Four million Facebook users receive spam each day in 2011 [5, 9] Fast growth Cost businesses $20.5 billion annually and projected to $198 billion in the next four years [12] High conversion rate The “clickthrough” rate of spam URLs on Twitter was almost two times higher than email spam in 2010 [6]

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 6/30

slide-10
SLIDE 10

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

Two main methods

CAPTCHAs proof-of-work

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 7/30

slide-11
SLIDE 11

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

CAPTCHAs

  • can prevent bots effectively ..

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 8/30

slide-12
SLIDE 12

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

CAPTCHAs

  • can prevent bots effectively .. as long as there aren’t OCR

algorithms that can solve it [13]

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 8/30

slide-13
SLIDE 13

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

CAPTCHAs

  • can prevent bots effectively .. as long as there aren’t OCR

algorithms that can solve it [13]

  • no way to have variable cost of solving [10, 11]

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 8/30

slide-14
SLIDE 14

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

CAPTCHAs

  • can prevent bots effectively .. as long as there aren’t OCR

algorithms that can solve it [13]

  • no way to have variable cost of solving [10, 11]
  • can only use for infrequent transactions due to the usability

burden [14]

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 8/30

slide-15
SLIDE 15

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

CAPTCHAs

  • can prevent bots effectively .. as long as there aren’t OCR

algorithms that can solve it [13]

  • no way to have variable cost of solving [10, 11]
  • can only use for infrequent transactions due to the usability

burden [14] = annoying

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 8/30

slide-16
SLIDE 16

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

Proof-of-work

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 9/30

slide-17
SLIDE 17

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

Proof-of-work

  • does not have CAPTCHA’s usability issues

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 9/30

slide-18
SLIDE 18

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

Proof-of-work

  • does not have CAPTCHA’s usability issues
  • can be used in frequent transactions

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 9/30

slide-19
SLIDE 19

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

Proof-of-work

  • does not have CAPTCHA’s usability issues
  • can be used in frequent transactions
  • thus, can have variable cost of solving

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 9/30

slide-20
SLIDE 20

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

Proof-of-work

  • does not have CAPTCHA’s usability issues
  • can be used in frequent transactions
  • thus, can have variable cost of solving

= nice right?

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 9/30

slide-21
SLIDE 21

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

Proof-of-work

  • does not have CAPTCHA’s usability issues
  • can be used in frequent transactions
  • thus, can have variable cost of solving

= nice right? but

  • many proposed systems do not have an accurate user

reputation

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 9/30

slide-22
SLIDE 22

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

Proof-of-work

  • does not have CAPTCHA’s usability issues
  • can be used in frequent transactions
  • thus, can have variable cost of solving

= nice right? but

  • many proposed systems do not have an accurate user

reputation

  • or, are too tightly integrated with a given application [3]

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 9/30

slide-23
SLIDE 23

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

Proof-of-work

  • does not have CAPTCHA’s usability issues
  • can be used in frequent transactions
  • thus, can have variable cost of solving

= nice right? but

  • many proposed systems do not have an accurate user

reputation

  • or, are too tightly integrated with a given application [3]

= boo :(

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 9/30

slide-24
SLIDE 24

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

”License” and ”tax” spam

Combines the strength of CAPTCHA and proof-of-work as puzzles

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 10/30

slide-25
SLIDE 25

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

”License” and ”tax” spam

Combines the strength of CAPTCHA and proof-of-work as puzzles Variable Cost Function The more you spam the “harder“ puzzles you have to solve. Uses a Bayesian reputation system

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 10/30

slide-26
SLIDE 26

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

”License” and ”tax” spam

Combines the strength of CAPTCHA and proof-of-work as puzzles Variable Cost Function The more you spam the “harder“ puzzles you have to solve. Uses a Bayesian reputation system Secure The solver code is metamorphic: changing code randomly in each transaction

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 10/30

slide-27
SLIDE 27

Introduction System Architecture Evaluations References The problem Current Prevention Methods Our Approach

”License” and ”tax” spam

Combines the strength of CAPTCHA and proof-of-work as puzzles Variable Cost Function The more you spam the “harder“ puzzles you have to solve. Uses a Bayesian reputation system Secure The solver code is metamorphic: changing code randomly in each transaction Easy to use Easy to install & manage allowing the addition or removal of ”ineffective” puzzles

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 10/30

slide-28
SLIDE 28

Introduction System Architecture Evaluations References System Overview Variable Cost Function Puzzles

Communication Protocol

Authentication = Kerberos model

Figure 1: MetaCAPTCHA puzzle delivery and solution verification

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 11/30

slide-29
SLIDE 29

Introduction System Architecture Evaluations References System Overview Variable Cost Function Puzzles

Using Bayesian model

Training Data

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 12/30

slide-30
SLIDE 30

Introduction System Architecture Evaluations References System Overview Variable Cost Function Puzzles

Using Bayesian model

Training Data → Naive Bayes classifier

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 12/30

slide-31
SLIDE 31

Introduction System Architecture Evaluations References System Overview Variable Cost Function Puzzles

Using Bayesian model

Training Data → Naive Bayes classifier → Reputation score r between 0 and 1

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 12/30

slide-32
SLIDE 32

Introduction System Architecture Evaluations References System Overview Variable Cost Function Puzzles

Using Bayesian model

Training Data → Naive Bayes classifier → Reputation score r between 0 and 1 → Puzzle difficult t

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 12/30

slide-33
SLIDE 33

Introduction System Architecture Evaluations References System Overview Variable Cost Function Puzzles

Using Bayesian model

Training Data → Naive Bayes classifier → Reputation score r between 0 and 1 → Puzzle difficult t → Random generated puzzles

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 12/30

slide-34
SLIDE 34

Introduction System Architecture Evaluations References System Overview Variable Cost Function Puzzles

Using Bayesian model

Training Data → Naive Bayes classifier → Reputation score r between 0 and 1 → Puzzle difficult t → Random generated puzzles Client solves the puzzles until reaches the total amount of time t

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 12/30

slide-35
SLIDE 35

Introduction System Architecture Evaluations References System Overview Variable Cost Function Puzzles

Non-interactive, interactive or both

Web apps determine what puzzle types to protect their websites

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 13/30

slide-36
SLIDE 36

Introduction System Architecture Evaluations References System Overview Variable Cost Function Puzzles

Non-interactive, interactive or both

Web apps determine what puzzle types to protect their websites Puzzle types

  • Non-interactive puzzles

◮ Targeted Hash-Reversal [4] ◮ Modifed Time-Lock [3] Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 13/30

slide-37
SLIDE 37

Introduction System Architecture Evaluations References System Overview Variable Cost Function Puzzles

Non-interactive, interactive or both

Web apps determine what puzzle types to protect their websites Puzzle types

  • Non-interactive puzzles

◮ Targeted Hash-Reversal [4] ◮ Modifed Time-Lock [3]

  • Interactive puzzles

◮ reCAPTCHA Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 13/30

slide-38
SLIDE 38

Introduction System Architecture Evaluations References System Overview Variable Cost Function Puzzles

Non-interactive, interactive or both

Web apps determine what puzzle types to protect their websites Puzzle types

  • Non-interactive puzzles

◮ Targeted Hash-Reversal [4] ◮ Modifed Time-Lock [3]

  • Interactive puzzles

◮ reCAPTCHA

  • Hybrid

◮ CAPTCHA+: reCAPTCHA and Modifed Time-Lock

More puzzle types can be added/removed with no changes to the web application

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 13/30

slide-39
SLIDE 39

Introduction System Architecture Evaluations References Experimental Setup Defense-in-Depth Conclusions

Environment & Dataset

  • Deployed MetaCAPTCHA on a live discussion forum active

for about two months in 2012

  • Had ≈ 2000 messages, ≈ 500 users, ≈ 100 sub-forums with

≈ 1000 threads

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 14/30

slide-40
SLIDE 40

Introduction System Architecture Evaluations References Experimental Setup Defense-in-Depth Conclusions

F-measure of different features

10-fold cross-validation to train on ≈ 1500 messages and test the classifier on multiple features

Spam Ham 0.2 0.4 0.6 0.8 1 0.013 0.935 0.983

DShield Blacklist Language GEOIP SA Score Thanks Account Age Akismet Total F-Measure

Figure 2: Using multiple features is better than using one or a few

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 15/30

slide-41
SLIDE 41

Introduction System Architecture Evaluations References Experimental Setup Defense-in-Depth Conclusions

Puzzle Difficulty Accuracy

0.0 1.0 2.0 3.0 4.0 5.0 6.0 7.0 0.0 0.2 0.4 0.6 0.8 1.0 Spammer Mixed Non-Spammer Puzzle Difficulty (hrs) Probability

0.14 5.18

Figure 3: CDF solving time of spammers, non-spammers and mixed users

  • ≈ 90% of spammers solved a puzzle over 6 hrs long
  • ≈ 95% of non-spammers solved no puzzles at all and ≈ 5%

spent between 7.2 secs to 8.4 minutes

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 16/30

slide-42
SLIDE 42

Introduction System Architecture Evaluations References Experimental Setup Defense-in-Depth Conclusions

Mixed users posted more ham than spam

1 2 3 4 5 6 7 8 9 10 11 12 5 10 15 20 25 30

138 131 141 502

Ham Spam User Message Posts (count)

Figure 4: Distribution of spam and ham sent by mixed users. Mixed users sent very little spam (between 1 and 8) when compared to the total messages they posted. X-axis inidicates User ID

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 17/30

slide-43
SLIDE 43

Introduction System Architecture Evaluations References Experimental Setup Defense-in-Depth Conclusions

MetaCAPTCHA can really hurt spammers

  • Slow down 90% of ”spammers” significantly so they don’t

spam others

  • No impact on 95% of honest users

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 18/30

slide-44
SLIDE 44

Introduction System Architecture Evaluations References Experimental Setup Defense-in-Depth Conclusions

MetaCAPTCHA can really hurt spammers

  • Slow down 90% of ”spammers” significantly so they don’t

spam others

  • No impact on 95% of honest users
  • Some improvement on those ”unlucky users” who blamed

their computers were too slow

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 18/30

slide-45
SLIDE 45

Introduction System Architecture Evaluations References Experimental Setup Defense-in-Depth Conclusions

MetaCAPTCHA can really hurt spammers

  • Slow down 90% of ”spammers” significantly so they don’t

spam others

  • No impact on 95% of honest users
  • Some improvement on those ”unlucky users” who blamed

their computers were too slow Future Works

  • Using spammer’s computing resources for volunteer

computing (e.g. SETI@Home)

  • Bitcoin as proof-of-work; turning spammers into miners

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 18/30

slide-46
SLIDE 46

Acknowledgment This work as supported by the National Science Foundation under Grant Number CNS-1017034

Question?

slide-47
SLIDE 47

Introduction System Architecture Evaluations References

References I

[1]

  • A. Back et al. Hashcash-a denial of service counter-measure. URL: http://www. hashcash.
  • rg/papers/hashcash. pdf, 2002.

[2]

  • C. Dwork and M. Naor. Pricing via processing or combatting junk mail. In Advances in

CryptologyCRYPTO92, pages 139–147. Springer, 1993. [3]

  • W. Feng and E. Kaiser. kapow webmail: Effective disincentives against spam. Proc. of 7th CEAS, 2010.

[4] W.-c. Feng and E. Kaiser. The case for public work. In IEEE Global Internet Symposium, 2007, pages 43–48. IEEE, 2007. [5] Geoffrey A. Fowler, Shayndi Raice, Amir Efrati. Facebook, Twitter battle ’social’ spam. http://www.theaustralian.com.au/business/wall-street-journal/ facebook-twitter-battle-social-spam/story-fnay3ubk-1226237108998, Jan 2012. [6]

  • C. Grier, K. Thomas, V. Paxson, and M. Zhang. @spam: the underground on 140 characters or less. In

Proceedings of the 17th ACM conference on Computer and communications security, CCS ’10, pages 27–37, New York, NY, USA, 2010. ACM. [7]

  • P. Heymann, G. Koutrika, and H. Garcia-Molina. Fighting spam on social web sites: A survey of approaches

and future challenges. Internet Computing, IEEE, 11(6):36–45, 2007. [8]

  • B. Laurie and R. Clayton. Proof-of-work proves not to work. In The Third Annual Workshop on Economics

and Information Security, 2004. [9] Mark Risher. Social Spam and Abuse — Annual Trend Review. http://blog.impermium.com/2012/01/13/social-spam-and-abuse-the-year-in-review/, Jan 2012. [10]

  • M. Motoyama, K. Levchenko, C. Kanich, D. McCoy, G. M. Voelker, and S. Savage. Re:

Captchas–understanding captcha-solving services in an economic context. In USENIX Security Symposium, volume 10, 2010. Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 20/30

slide-48
SLIDE 48

Introduction System Architecture Evaluations References

References II

[11]

  • M. Motoyama, D. McCoy, K. Levchenko, S. Savage, and G. M. Voelker. Dirty jobs: The role of freelance

labor in web service abuse. In Proceedings of the 20th USENIX conference on Security, pages 14–14. USENIX Association, 2011. [12] SPAM LAWS. Spam Statistics and Facts. http://www.spamlaws.com/spam-stats.html, 2011. [13]

  • O. R. Team. List of weaknesses. http://ocr-research.org.ua/list.html.

[14]

  • J. Yan and A. El Ahmad. Usability of captchas or usability issues in captcha design. In Proceedings of the

4th symposium on Usable privacy and security, pages 44–52. ACM, 2008. Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 21/30

slide-49
SLIDE 49

Introduction System Architecture Evaluations References

MetaCAPTCHA’s system model

MetaCAPTCHA Web App

Submit Message Need Proof-of-Work Solve Puzzle Get Proof-of-Work 1 2 4 3 Show Proof-of-Work 5 Initial Setup

Client

Browser + User

Figure 5: User’s browser must show proof-of-work before the web application accepts the user’s message. The dotted line indicates initial setup performed by the web application to use the MetaCAPTCHA service.

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 22/30

slide-50
SLIDE 50

Introduction System Architecture Evaluations References

Using Bayesian model

MetaCAPTCHA calcualtes a reputation score r between 0 and 1

  • probability that a given message is spam as determined by a

Naive Bayes classifier The reputation score r is translated to the puzzle difficulty t which is the amount of time a client must be kept busy solving puzzles 1 t = (tmax + 1)r − 1, tmax = tp sp(1 − δ) where, δ is the reduction in spam the web application is seeking (e.g. 10%) from an average amount sp of spam messages received in time period tp

1inspired by Laurie and Clayton’s work on proof-of-work [8] Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 23/30

slide-51
SLIDE 51

Introduction System Architecture Evaluations References

Solving Puzzles

Puzzles are randomly generated based on the list that is configured

  • Must be solved by the user’s browsers or the users
  • If the solution is returned in time t′ < t, then a new puzzle is

chosen and issued

  • This process is repeated until the client has computed for at

least t amount of time The idea behind issuing several puzzles is to ensure that no user can complete an online transaction unless they have computed for a length of time ≥ t

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 24/30

slide-52
SLIDE 52

Introduction System Architecture Evaluations References

Puzzles = CAPTCHAs + Proof-of-work

Proof-of-work

  • First proposed by Dwork and Naor [2] to combat email spam
  • Non-interactive
  • Difficult to solve in terms of time & complexity, but easy to

verify answers A famous example is Hashcash [1] - a computational challenge where the computer has to find a k-bit partial hash collision on string x, given a hash function H and string y, such that the first k bits of H(x) and H(y) are equal

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 25/30

slide-53
SLIDE 53

Introduction System Architecture Evaluations References

More experimental setup

  • forum users divided into three categories, (i) spammers: (ii)

non-spammers, and (ii) mixed: those who sent both spam and ham

  • Here, ’users’ implies the senders of messages included in

ground-truth information provided by the forum.

  • After the categorization, there were 99 messages sent by

non-spammers, 240 messages sent by spammers, and 151 messages sent by mixed users in the test set (34% of ground-truth data picked uniformly at random).

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 26/30

slide-54
SLIDE 54

Introduction System Architecture Evaluations References

Puzzle difficulty parameters

  • tmax = 6.82 hrs based on time period tp = 1 month
  • number of spam messages sp seen in that month is 1442, and

a spam reduction factor δ = 0.6

  • ≈ 90% of spammers solved a puzzle over 6 hrs long
  • ≈ 5% of non-spammers solved a puzzle between 7.2 secs and

8.4 minutes long

  • ≈ 95% of non-spammers solved no puzzles at all.

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 27/30

slide-55
SLIDE 55

Introduction System Architecture Evaluations References

Solving time of spammers, non-spammers and mixed users

0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 0.0 0.2 0.4 0.6 0.8 1.0 Spammer Mixed Non-Spammer Reputation Score Probability

0.88 0.06

Figure 6: CDF of reputation scores assigned to spammers, non-spammers, and mixed users (those that sent at least 1 spam and 1 ham)

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 28/30

slide-56
SLIDE 56

Introduction System Architecture Evaluations References

Reputation Score Evaluations

  • ≈ 90% of spammers have reputation scores over 0.95. ≈ 99%
  • f non-spammers got a reputation of 0.065 or less.
  • Only one honest user suffered the ill fate of being assigned a

reputation of 0.88, whereas 94% were assigned a reputation of zero — implying that they did not solve a puzzle at all!

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 29/30

slide-57
SLIDE 57

Introduction System Architecture Evaluations References

Majority of mixed users posted more ham than spam

1 2 3 4 5 6 7 8 9 10 11 12 5 10 15 20 25 30

138 131 141 502

Ham Spam User Message Posts (count)

Figure 7: Distribution of spam and ham sent by mixed users. Mixed users sent very little spam (between 1 and 8) when compared to the total messages they posted. X-axis inidicates User ID

Akshay Dua, Thai Bui, Tien Le, Nhan Huynh, Wu-chang Feng {akshay, buithai, letien, nhhuyng, wuchang}@cs.pdx.edu MetaCAPTCHA: A Metamorphic Throttling Service 30/30