Furious MAC Decomposition of MAC Address Structure for Granular Device Inference
Jeremy Martin∗, Erik C. Rye∗, Robert Beverly+
∗US Naval Academy
Annapolis, MD
+US Naval Postgraduate School
Monterey, CA
December 9, 2016
1 / 24
Decomposition of MAC Address Structure for Granular Device Inference - - PowerPoint PPT Presentation
Furious MAC Decomposition of MAC Address Structure for Granular Device Inference Jeremy Martin , Erik C. Rye , Robert Beverly + US Naval Academy Annapolis, MD + US Naval Postgraduate School Monterey, CA December 9, 2016 1 / 24
∗US Naval Academy
+US Naval Postgraduate School
1 / 24
2 / 24
◮ FuriousMAC: can we trust the first 3 bytes alone?
◮ Contiguous? ◮ Sequential? ◮ Predictable? e.g., fine-grained make and model? 3 / 24
◮ FuriousMAC: can we trust the first 3 bytes alone?
◮ Contiguous? ◮ Sequential? ◮ Predictable? e.g., fine-grained make and model? 3 / 24
◮ FuriousMAC: can we trust the first 3 bytes alone?
◮ Contiguous? ◮ Sequential? ◮ Predictable? e.g., fine-grained make and model? 3 / 24
◮ FuriousMAC: can we trust the first 3 bytes alone?
◮ Contiguous? ◮ Sequential? ◮ Predictable? e.g., fine-grained make and model? 3 / 24
4 / 24
5 / 24
◮ Management frames containing WPS-enriched data fields ◮ Discovery protocols, primarily mDNS ◮ Easily extensible 6 / 24
◮ Access Points (Beacons and Probe Responses), client devices (Probe Requests)
◮ Advantages: Unencrypted, non-associated state, low data-rates, wide range of
◮ Disadvantage: Not used by all devices (iOS, Ubiquiti, etc.)
◮ mDNS data field, dns.txt: reveals a model identification key-value pair,
◮ Advantages: Fills in some high profile gaps → iOS!! ◮ Disadvantages: Layer-2 encryption, associated state, often higher data-rate, not
7 / 24
◮ Two billion frames ◮ 2.8 million unique devices across a spectrum of IoT devices ◮ January 2015 – May 2016 ◮ IRB exemption: Only examine MACs, management frames, and discovery
8 / 24
9 / 24
0 10 20 30 40 50 60 70 80 90 a0 b0 c0 d0 e0 f0
10 20 30 40 50 60 70 80 90 a0 b0 c0 d0 e0 f0
MacBookPro9,2 iPad Mini 2 (Cellular) iPhone 5c (GSM) iPad Mini 2 (WiFi)
10 / 24
11 / 24
12 / 24
Top 10 Manufacturers - Clients WPS Count % non-WPS Count % LGE 11,184 22.60 Apple 231,214 44.36 Ralink 4,279 8.64 Samsung 48,617 9.33 Motorola 3,260 6.58 Murata 48,246 9.26 HTC 3,256 6.57 Intel 25,734 4.95 Prosoft 2,234 4.50 HP 15,287 2.94 Amazon 2,222 4.49 Microsoft 13,949 2.68 Huawei 1,905 3.83 Ezurio 12,385 2.38 Asus 1,659 3.34 Epson 6,839 1.32 ZTE 1,619 3.25 Lexmark 5,289 1.01 Alco 1,036 2.10 Sonos 4,542 .09 Other 16,859 34.10 Other 109,271 20.96
13 / 24
14 / 24
0 10 20 30 40 50 60 70 80 90 a0 b0 c0 d0 e0 f0
10 20 30 40 50 60 70 80 90 a0 b0 c0 d0 e0 f0
MacBookPro9,2 iPad Mini 2 (Cellular) iPhone 5c (GSM) iPad Mini 2 (WiFi)
15 / 24
0 10 20 30 40 50 60 70 80 90 a0 b0 c0 d0 e0 f0
10 20 30 40 50 60 70 80 90 a0 b0 c0 d0 e0 f0
LGL39C LG-E460 LG-P659 VS870 4G LG-E440 LG-F200S LG-P769 LG-E451g Nexus 4 LG-D410 LG-P760 LG-LS720 LGMS659 LGMS500 LG-D500 LG-E455 LG-D680 LG-E465f LG-P655H LG-D686 LG-D520 LG-E470f LG-V510 LG-E467f
16 / 24
0 10 20 30 40 50 60 70 80 90 a0 b0 c0 d0 e0 f0
10 20 30 40 50 60 70 80 90 a0 b0 c0 d0 e0 f0
BLU STUDIO 7.0 O+ Ultra Micromax Q380 i-mobile i-STYLE 218 A3-A20 Windows DASH JR K T07 T06 BLU STUDIO C Micromax Q391 Archos 35b Titanium i-mobile_IQ_BIG2 irisX8 Micromax A316 DOOV L1M Micromax AQ5001
17 / 24
0 10 20 30 40 50 60 70 80 90 a0 b0 c0 d0 e0 f0
10 20 30 40 50 60 70 80 90 a0 b0 c0 d0 e0 f0
OC810 RC8021 H560N RC8025 Broadcom OpenRG Platform AD1018 iCamera Ralink Wireless Linux Client OC821D WAP-PLUS WAP
18 / 24
◮ Captured from Italy in 2013; do not appear in our corpus ◮ Anonymized data, to include MAC addresses
◮ 1,746 global addresses recovered (test data), find closest MAC
◮ If CRAWDAD manufacturer/model matches corpus closest-match
◮ Validation achieves 81.3% accuracy
An Analysis of Wi-Fi Network Discovery Mechanisms. In ACM AsiaCCS, 2016. 19 / 24
20 / 24
◮ Compare using simple distance (48-bit integer representation)
◮ Manufacturer/model in test set compared to manufacturer/model
◮ Each set is used once as test data against the remaining four sets
◮ ∼90.95% (lexicographical distance) vs ∼91.16% (simple distance) ◮ ∼10% improvement over the accuracy we obtain when testing
◮ ∼3% improvement over our validation using ground truth devices 21 / 24
10-7 10-6 10-5 10-4 10-3 10-2 10-1 100
Density of Inferred Block
0.0 0.2 0.4 0.6 0.8 1.0
CDF of Test MAC Addresses
Correct Inference (CRAWDAD) Incorrect Inference (CRAWDAD) Correct Inference (Apple) Correct Inference (Samsung) Incorrect Inference (Samsung)
◮ 55% of correct inferences within non-trivial block density ◮ 85% of incorrect inferences fall outside of any block (density of 0) ◮ Only 1 incorrect Apple inference falls inside a block 22 / 24
23 / 24
◮ management and discovery protocols allow significant privacy leaks ◮ the allocation of MAC addresses lends itself to device fingerprinting
◮ Improved granularity of MAC-based fingerprinting ◮ Complexity and variety of allocation policies causes simpler
◮ Resilient, other methods rely on user-configurable data 24 / 24