BotGraph: Large Scale Spamming Botnet Detec5on
Yao Zhao
Yinglian Xie*, Fang Yu*, Qifa Ke*, Yuan Yu*, Yan Chen and Eliot Gillum‡ EECS Department, Northwestern University MicrosoK Research Silicon Valley* MicrosoK Coopera5on ‡
1
BotGraph: Large Scale Spamming Botnet Detec5on Yao Zhao Yinglian - - PowerPoint PPT Presentation
BotGraph: Large Scale Spamming Botnet Detec5on Yao Zhao Yinglian Xie * , Fang Yu * , Qifa Ke * , Yuan Yu * , Yan Chen and Eliot Gillum EECS Department, Northwestern University MicrosoK Research Silicon Valley * MicrosoK Coopera5on 1
Yinglian Xie*, Fang Yu*, Qifa Ke*, Yuan Yu*, Yan Chen and Eliot Gillum‡ EECS Department, Northwestern University MicrosoK Research Silicon Valley* MicrosoK Coopera5on ‡
1
2
Zombie (Compromised host)
Spammer’s Server Captcha solver
RDSXXTD3 User/Pwd
3
4
5
Login data Login graph
Graph generation
Random graph based clustering Verification & prune Sendmail data Spamming botnets Suspicious clusters Signup data
EWMA based change detection
Aggressive signups Verification & prune Signup botnets
DryadLINQ clusters
(ID, IP, time) (ID, time, # of recipients) (ID, IP, time)
6
Large predic5on error Back to normal Date
Number of Signup Accounts
25 20 15 10 5 1-Jul 2-Jul 3-Jul 4-Jul 5-Jul 6-Jul 7-Jul 8-Jul 9-Jul Signup Count EWMA Prediction
7
Login data Login graph
Graph generation
Random graph based clustering Verification & prune Sendmail data Spamming botnets Suspicious clusters Signup data
EWMA based change detection
Aggressive signups Verification & prune Signup botnets
(ID, IP, time) (ID, time, # of recipients) (ID, IP, time)
8
9
– Consider edges with weight>1
10
2 ASes 3 ASes 5 ASes 1 AS 4 ASes User1 User2 User3 User4 User5 User6
11
12
1st group 2nd group 3rd group
13
14
Login data Login graph
Graph generation
Random graph based clustering Verification & prune Sendmail data Spamming botnets Suspicious clusters Signup data
EWMA based change detection
Aggressive signups Verification & prune Signup botnets
(ID, IP, time) (ID, time, # of recipients) (ID, IP, time)
17
18
– 500M users and 200~300GB data per month
19
20
21
22
23
24
25