FBS-Radar: Uncovering Fake Base Stations at Scale in the Wild Zhenhua - - PowerPoint PPT Presentation

fbs radar uncovering fake base stations at scale in the
SMART_READER_LITE
LIVE PREVIEW

FBS-Radar: Uncovering Fake Base Stations at Scale in the Wild Zhenhua - - PowerPoint PPT Presentation

Feb. 26 Mar. 1 FBS-Radar: Uncovering Fake Base Stations at Scale in the Wild Zhenhua Li Weiwei Wang Chen Qian Christo Wilson Jian Chen Yunhao Liu Taeho Jung Lan Zhang Kebin Liu Xiangyang Li lizhenhua1983@gmail.com


slide-1
SLIDE 1

FBS-Radar: Uncovering Fake Base Stations at Scale in the Wild

lizhenhua1983@gmail.com http://www.greenorbs.org/people/lzh/

  • Mar. 1st, 2017

Zhenhua Li Christo Wilson Chen Qian

1

  • Feb. 26 – Mar. 1

Weiwei Wang Jian Chen Lan Zhang Kebin Liu Yunhao Liu Xiangyang Li Taeho Jung

slide-2
SLIDE 2

Outline

Background State of the Art Our System Locating FBSes Summary

2

slide-3
SLIDE 3

3

Story 1

From 95599 From 95599 (Agriculture

(Agriculture Bank of China) Bank of China):

: We’re processing the student loan you’ve applied for, and now requiring you to transfer a deposit of ¥9900 (≈ $1500) to the bank account XXXXXXXXX. SMS Text SMS Text Message Message

* Note: This is a simplified version of the actual story which involves more complex details.

slide-4
SLIDE 4

4

Story 2

From 95566 From 95566 (Bank of (Bank of China): China): We’re processing the house mortgage for

  • you. Please prepare

¥17,600,000 (≈ $2,600,000) ... SMS Text SMS Text Message Message

* Note: This is a simplified version of the actual story which involves more complex details.

Fake Base Stations

slide-5
SLIDE 5

5

GSM (Global System for Mobile Communication)

Bi Birt rth Ye Year Use ser r Sca Scale Sp Speed Se Secu curi rity

2G – – GSM SM 1990 1990

> 1 billion 1 billion Low Low

Po Poor

3G – CDMA 2008 < 2 billion Middle Middle 4G – LTE 2009 ≈ 3 billion High Fine

authentica cation

X

Fake Base Stations

slide-6
SLIDE 6

6

FBS Carrier

Very high signal strength

slide-7
SLIDE 7

7

Fake Base Station (FBS)

Legitimate BS FBS

Wireless Transceiver Engineering Laptop Engineering Cellphone

USB Cable

FBS

slide-8
SLIDE 8

8

FBS Attack on GSM Phones

Current Connection Location Update Which BS has the highest signal strength?

  • 70 dBm
  • 30 dBm
  • 60 dBm
  • 100 dBm

I may have to switch my BS connec8on … GSM

slide-9
SLIDE 9

9

FBS Attack on GSM Phones

New Connection

  • 70 dBm
  • 30 dBm
  • 60 dBm
  • 100 dBm

GSM

slide-10
SLIDE 10

10

FBS Can Also Impact 3G/4G Phones

Jamming Signal GSM 3G/4G

Degrade

GSM

GSM has existed for many years, so abandoning GSM also needs many years …

slide-11
SLIDE 11

11

FBS Attack Is NOT Hypothetical

US China China India Russia UK

Year # FBS Msgs 2013 >> 2.9 billion 2014 >> 4.2 billion 2015 >> 5.7 billion

N * billion

slide-12
SLIDE 12

12

FBS Industry in China

Device: $400 Daily income: $40 Device: $1000 Daily income: $70 Device: $700 Daily income: up to $1400

slide-13
SLIDE 13

13

FBS Industry in China

Device: $400 Daily income: $40 Device: $1000 Daily income: $70 Device: $700 Daily income: up to $1400

slide-14
SLIDE 14

State of the Art

14

slide-15
SLIDE 15

15

Electronic Fence

Huge infrastructure costs à Poor scalability

slide-16
SLIDE 16

16

FBS-signal Detection Car

Random walk à Limited coverage & “dull”

slide-17
SLIDE 17

17

User Reporting

Most users don’t realize the existence of FBSes Dial 12321

slide-18
SLIDE 18

18

Client-side Tools

Do they really work in large-scale practice? …

slide-19
SLIDE 19

Our System: FBS-Radar

19

slide-20
SLIDE 20

20

Baidu PhoneGuard Users Opt-in

Report multiple fields of su susp spici cious SMS messages

p Sender’s number is not in the recipient’s contact list p Sender’s number is an authoritative number

slide-21
SLIDE 21

21

Five Methods

  • 1. Signal

Strength Examina8on

  • 2. BS ID Syntax

Checking

  • 3. Message

Content Mining

  • 4. BS-WiFi

Loca8on Analysis

  • 5. BS-Handover

Speed Es8ma8on

0.23%

0.15% 0.16%

4.1%

0.39% ~100M users

slide-22
SLIDE 22

22

3.1 Signal Strength Examination

  • 40 dBm

FBS > -40 dBm

0.23% of user- reported suspicious SMS messages

slide-23
SLIDE 23

23

3.2 BS ID Syntax Checking

BS ID = MCC + MNC + LAC + CID

p MCC: Mobile Country Code, 3 digits p MNC: Mobile Network Code, 2 digits p LAC: Location Area Code, 16 bits p CID: Cell Identity, 16 bits for 2G/3G and 28 bits for 4G

0.15% of suspicious messages were sent by BSes with syntactically invalid IDs

slide-24
SLIDE 24

24

3.3 Message Content Mining

p Bag-of-words SVM (Support Vector Machine) classifier trained on 200,000 hand-labeled SMS messages

① Labelling suspicious messages; ② Word segmenta8on; ③ Feature extrac8on; ④ Quan8zing the feature vector; ⑤ Training the SVM model; ⑥ Preprocessing the test set; ⑦ SVM classifica8on of the test set.

0.16% of suspicious messages came from authoritative phone numbers and were determined to contain fraud text content

l Computa8on intensive l Viola8on of user privacy

slide-25
SLIDE 25

25

3.4 BS-WiFi Location Analysis

BS Location User WiFi Location

4.1% 4.1% of suspicious messages were sent by BSes

that were not in their correct geolocation, i.e., they were spoofing the ID of a legitimate but distant BS.

slide-26
SLIDE 26

26

3.4 Counterfeiting a Nearby BS ID

Current Connection Location Update If I counterfeit a nearby BS ID …

  • 70 dBm
  • 30 dBm
  • 60 dBm
  • 100 dBm

My loca8on does not change a lot, so I needn’t switch to a new BS J

slide-27
SLIDE 27

27

3.5 BS-Handover Speed Estimation

p For BS-WiFi location analysis, what if the WiFi location information is not available?

slide-28
SLIDE 28

28

4.5 BS-Handover Speed Estimation

>> 0.39% of suspicious SMS messages come from FBSes

slide-29
SLIDE 29

29

Detection Performance

p > 4.7% 7% of suspicious messages should have come from FBSes

  • False positive rate is only 0.05% (according to user feedback),

mainly due to the inaccuracy of our WiFi database p Set-3 (by message content mining) is >98% covered by the

  • ther 4 sets
  • No need to collect the text content of users’ messages!
  • No need to collect the text content of users’ messages!
slide-30
SLIDE 30

30

Arresting FBS Operators

p With the help of FBS-Radar, the police have arrested tens to hundreds of FBS operators every month

slide-31
SLIDE 31

Locating FBSes

31

slide-32
SLIDE 32

32

Locating FBSes based on User Device Locations

Time BS ID1

Ć Ć

Window BS ID2

Ć Ć

p FBSes frequently move and change their IDs Ø We take both temporal temporal and spatial spatial locality into account

Only those FBS messages

1) using the same BS ID, 2) happening in the same time

time window window,

and 3) located in the same spatial cluster

can be attributed to one FBS.

slide-33
SLIDE 33

33

Locating FBSes based on User Device Locations

FBS deviation distance

p The centroid of every

every cluster is

the estimated location of an FBS.

This loca8on accuracy is sufficient for us to track FBSes!

slide-34
SLIDE 34

34

Real-time Locations of FBSes

Public URL à http://shoujiweishi.baidu.com/static/map/pseudo.html

slide-35
SLIDE 35

35

l Using extensive crowdsourced data, we evaluate five

five different different methods methods for for detect detecting ing FBSes FBSes in the wild,

and find that FBSes can be precisely identified without sacrificing user privacy. l We present a reasonable method for locating

locating FBSes FBSes

with an acceptable accuracy.

l FBS-Radar FBS-Radar is is currently currently in in use use by by ~100 ~100M people

  • people. It protects users from millions of malicious

messages from FBSes every day, and has helped the authorities arrest numerous FBS operators every month.

Summary

slide-36
SLIDE 36

Backup slides

slide-37
SLIDE 37

37

FBS Attack: Passive vs. Active

Passive: IMSI-catcher Active: Push spam/fraud SMS

messages with spoofed phone numbers Year # FBS Msgs 2013 >> 2.9 billion 2014 >> 4.2 billion 2015 >> 5.7 billion

Rarely reported in China, but sometimes reported in the US

slide-38
SLIDE 38

38

Ground Truth

p Our ONLY ground truth comes from users’ feedback

We think this message comes from an FBS. What do you think?

p Yes: 99.95% p No: 0.05%

Manual double-check

slide-39
SLIDE 39

39

Why not use GPS?

p Most people turn GPS off in most time to save battery, so we have to ask users for GPS privilege

Locajon accuracy increases by 20%? User scale decreases by 20%? for harassment …

slide-40
SLIDE 40

40

Localizing User Devices based on WiFi Information

Dominant Cluster

Centroid deviation distance

p The centroid of the dominant

dominant

cluster is the estimated location of the user device k-means

DBSCAN DBSCAN

slide-41
SLIDE 41

41

Spam and Fraud SMS Messages

Spam (Ads)

“We are selling excellent, cheap goods and food from

  • Jul. to Aug. 2016. Visit our shops at the People’s

Square as soon as possible!” --- sent from a (usually not well-known) mart or grocery. “We provide very cheap and legal invoices that can help you quickly make a big fortune. Don’t hesitate, dial us via the phone number: 010-61881234!” --- sent from a (usually not well-known) company.

Fraud

“Dear user, you are lucky to be the winner of this month’s big award! You will be offered 10-GB FREE 4G traffic by clicking on this URL: http://www.10086award.com.” --- sent from 10086 (China Mobile). “Dear customer, you have failed to pay for this year’s management fee of 100 dollars. If you do not pay for it before Jul. 30th, you will face a fine of 500 dollars. You should pay it by transferring money to the following bank account: ...” --- sent from 95533 (Bank of China).

Spoofed phone numbers Spoofed phone numbers

slide-42
SLIDE 42

42

FBS-Radar: 4-fold Design Goals

p Detect as many FBSes as possible with very few false positives, without specialized hardware p Automatically filter spam/fraud FBS messages from user devices with a high precision p Provide actionable intelligence about geolocations

  • f FBSes to aid law enforcement agencies

p Use minimal resources on client side, minimize collection of sensitive data, and not require root.

slide-43
SLIDE 43

43

FBS-Radar & Baidu PhoneGuard

Content-free Analysis BS-location Database WiFi-location Database Content Analysis User Report Authoritative Phone Number List Message Logs SVM Machine Learning Cluster

Baidu PhoneGuard

http:// shoujiweishi.baidu.com

Crowdsourced data from ~100 Million Users

slide-44
SLIDE 44

44

Database and List

Content-free Analysis BS-location Database WiFi-location Database Content Analysis User Report Authoritative Phone Number List Message Logs SVM Machine Learning Cluster

BS ID à <lat, lon, radius, tag> WiFi MAC à <lat, lon, tag> ≈ 1500 phone numbers

slide-45
SLIDE 45

45

FBS-Radar: Timeline

2014.01-07 Design & Implementa8on 2014.08 Online released 2015 ~17.5 million suspicious SMS messages reported per day 2016 ~32 million suspicious SMS messages reported per day

slide-46
SLIDE 46

46

Informed Consent from Users

slide-47
SLIDE 47

47

Opt-in Options for Users

Detection Rules (for FBS-Radar) Intelligent Detection ON/OFF Cloud-side Detection ON/OFF Content Detection of Suspicious SMS Messages ON/OFF Suspicious Voice Call & SMS Message Detection ON/OFF Contacts’ Voice Call & SMS Message Detection ON/OFF

Baidu Baidu PhoneGuard PhoneGuard App App