Deep Packet Inspection Using GPUs Qian Gong, Wenji Wu, Phil DeMar - PowerPoint PPT Presentation

Deep Packet Inspection Using GPUs Qian Gong, Wenji Wu, Phil DeMar GPU Technology Conference 2017 May 2017

Background • Main uses for network traffic analysis – Operations & management – Capacity planning – Performance troubleshooting • Levels of network traffic analysis – Device counter level (snmp data) – Traffic flow level (flow data) – Packet level (The focus of this work) • Network securities • Application performance analysis • Traffic characterization studies Deep Packet Inspection Using GPUs, GTC’17 2 5/11/2017

Background (cont.) Characteristics of packet-based network traffic analysis applications • Time constraints on packet processing • Computing and I/O throughput-intensive • High levels of data parallelism • Packet parallelism. Each packet can be processed independently • Flow parallelism. Each flow can be processed independently • Extremely poor temporal locality for data • Typically, data processed once in sequence; rarely reused Deep Packet Inspection Using GPUs, GTC’17 3 5/11/2017

The Challenges Packet-based traffic analysis tools face performance & scalability challenges within high-performance networks. – High-performance networks: • 40GE/100GE link technologies • Servers are 10GE-connected by default • 400GE backbone links & 100GE host connections loom on the horizon – Millions of packets generated & transmitted per sec Deep Packet Inspection Using GPUs, GTC’17 4 5/11/2017

Packet-based Traffic Analysis Tool Platform (I) • Requirements on computing platform for high performance network traffic analysis applications – High compute power – Ample memory & IO bandwidth – Capability of handling data parallelism inherent with network data – Easy programmability Deep Packet Inspection Using GPUs, GTC’17 5 5/11/2017

Packet-based Traffic Analysis Tool Platform (II) • Three types of computing platforms: – NPU/ASIC, CPU, GPU Features NPU/ASIC CPU GPU ✖ ✔ Varies High compute power ✖ ✔ Varies High memory bandwidth ✖ ✔ ✔ Easy programmability ✖ ✔ ✔ Data-parallel execution model Architecture Comparison Features cores Bandwidth DP SP Power Price 4992 480 GB/s 2.91 TF 8.73 TF 300W $4,349 NVidia K80 18 102 GB/s 0.72 TF 1.44 TF 165W $7,174 Intel E7- 8890 NVidia K80 vs. Intel E7-8890 Deep Packet Inspection Using GPUs, GTC’17 6 5/11/2017

Our Solution Network Traffic Analysis using GPUs Highlights of our work: • Demonstrated GPUs can significantly accelerate network traffic analysis • Designed/Implemented a generic I/O architecture to capture and move network traffics from wire into GPU domain • Implemented a GPU-accelerated library for network traffic analysis Deep Packet Inspection Using GPUs, GTC’17 7 5/11/2017

GPU-based Network Traffic Analysis Framework Header Analysis Network Monitoring Header Flow Table Traffic Online traffic WireCAP Packet Parser (SrcIP, DstIP, SrcPort, DstPort, Proto) Summarization Network Capture Engine IPS/IDS Filter Payload Analysis (BPF) Traffic Libpcap library Storage Pattern Header Parsing and/or Assembly Abnormal Engineering Offline traffic Matching (Suspicious packets) Warning … Network Traffic Source GPU Domain Applications Configuration System Analyser Configuration Configuration (In standard JSON format) Applications Running modes GPU-based analysis • • • Network Monitoring Online analysis Header analysis – • • Traffic capture Payload analysis IPS/IDS • • Offline analysis Traffic Engineering • And more Deep Packet Inspection Using GPUs, GTC’17 8 5/11/2017

System Architecture – Online Analysis Four types of logical Entities: • GPU-based Analysis • Traffic Capture • Output (in JSON format) • Preprocessing 3. GPU-based Analysis 1. Tra ffi c Capture 2. Preprocessing 4. Output GPU Domain Captured Packet Packet ... Output . Buffer Buffer Data . . Output Tra ffi c Analysis Kernels Capturing Packet Chunks User Space ... NICs Packets Deep Packet Inspection Using GPUs, GTC’17 9 5/11/2017

WireCAP Packet Capture Engine • An advanced packet capture engine for commodity network interface cards (NICs) in high-speed networks – Lossless zero-copy packet capture and delivery – Zero-copy packet forwarding – A Libpcap-compatible interface for low-level network access • WireCAP project website – http://wirecap.fnal.gov (Note: source code is available) Deep Packet Inspection Using GPUs, GTC’17 10 5/11/2017

GPU-based Network Traffic Analysis • A GPU-accelerated library for network traffic analysis – Dozens of CUDA kernels – Can be combined in a variety of ways to perform intended analysis operations • Two types of GPU-based network traffic analysis – Header analysis (see our GTC’13 talk) • http://on-demand.gputechconf.com/gtc/2013/presentations/S3146- Network-Traffic-Monitoring-Analysis-GPUs.pdf – Packet payload analysis • Deep packet analysis (TCP streams) Deep Packet Inspection Using GPUs, GTC’17 11 5/11/2017

Challenges in Stream Reassembly (I) --- Parallelism Why stream reassembly? • Payload of packet affiliated to the same TCP stream need to be assembled before matching against pre-defined patterns A T T T C A K reordering & normalization match A T T A C K However … • Stream reassembly via parallel hash-table requires an atomic lock with each hash key (TCP 4-tuple) • Limited data parallelism when less simultaneous TCP connections are present Deep Packet Inspection Using GPUs, GTC’17 12 5/11/2017

Challenges in Stream Reassembly (II) --- Denial of Service Attack • To address the problem of out-of-order packets, one widely adopted approach is packet buffering and stream reassemb ly, i.e., buffer all packets following a missing one, until they become in-sequence again. Already received and forwarded data A T T C K Buffered data A New data • This approach is intuitive but vulnerable to denial-of-service (DoS) attacks, whereby attackers exhaust the packet buffer capacity by sending long segments of out-of-order packets. Deep Packet Inspection Using GPUs, GTC’17 13 5/11/2017

GPU-based Deep Packets Analysis Pipeline TCB Connection Table corresponding to next hash bucket connection records h k Connection Next State hash(4-tuple) … Packet Statistic Per-flow Automaton- Stream Analysis /Flow TCP Data based Pattern packets Processing Classification Reassembly Matching Hybrid Pattern Matching Pipeline • Intra-batch TCP packets reordering & assembly • Inter-batch split detection Pattern matching wo/ buffering or dropping out-of-order packets Deep Packet Inspection Using GPUs, GTC’17 14 5/11/2017

Key Mechanisms (I) Observation 1  According to previous internet traffic analysis report, only 2%-5% packets are affected by re-ordering  When processing packets in batch (~1e6 packets), 0.1%-0.5% TCP streams spread across batches Mechanism 1 --- intra-batch stream reassembly + Load packets from network to GPUs in batch + In-batch packet reordering and reassembly via parallel sorting Deep Packet Inspection Using GPUs, GTC’17 15 5/11/2017

GPU-based TCP Stream Reassembly raw packet Packet p3 p2 p1 p4 p5 p8 p7 p6 p9 p4 Reordering sort by packets in flow and sequence order (4-tuple|seq #) p1 p3 p7 p2 p4 p4 p5 p6 p8 p9 filter + scan flow identifier 1 1 1 2 2 2 2 3 3 3 next packet array Stream end n/a end end 3 7 4 5 8 9 Normalization bytes of overlapping data (prefer new data) scan seq # n/a 0 n 1 n 2 0 n 3 n 4 0 n 5 n 6 Deep Packet Inspection Using GPUs, GTC’17 16 5/11/2017

Key Mechanisms (II) Observation 2  If a string S is matched across a list of packets P 1 P 2 … P N , the suffix of P 1 must match a prefix of S, the prefix of P N must match a suffix of S, and P 2 … P N-1 must match the prefixes of a suffix-S. Mechanism 2 --- inter-batch split detection + Combine the Aho-Corasick (AC) and suffix-AC automatons to detect signatures spread over different batches Deep Packet Inspection Using GPUs, GTC’17 17 5/11/2017

GPU-based Pattern Matching for Out-of-order Packets Intra-batch: AC automaton State transition automaton Parallel execution mode thread k+1 thread k thread k+2 • One thread per packet Keywords: X = {he, his, she, hers} • Each thread scans extra N bytes towards its consecutive packet Deep Packet Inspection Using GPUs, GTC’17 18 5/11/2017

GPU-based Pattern Matching for Out-of-order Packets Inter-batch: AC automaton & Suffix-AC automaton Out-of-order Packets Suffix Pattern Tree (PST) case 1 AC state case 2 Suffix-AC state Suffix set of X: {e,is,s,he,ers,rs} case 3 struct { nextState[256]; suffix string = path (state) Suffix-AC state AC state preState; preChar; Received and forwarded data }PST; New packets Deep Packet Inspection Using GPUs, GTC’17 19 5/11/2017

Performance Evaluation Traffic Statistics • Traffic source: real traffics mirrored from the Fermilab gateway • Traffic pattern (average per batch) # of packets 1 million # of data packets 776,207 mean packet length 1415-byte # of connections 15,500 Base Systems: • Intel Xeon CPU E5-2650 @ 2.30 GHz, NVIDIA K40 Throughput (wo/ memory transfer) • TCP reassembly: 72.96 Mpps ( ⨉ 192 speedup comparing to libnids on CPU) • TCP state management: 286.85 Mpps • Pattern matching (AC & Suffix-AC): 5.83 Mpps Deep Packet Inspection Using GPUs, GTC’17 20 5/11/2017

Deep Packet Inspection Using GPUs Qian Gong, Wenji Wu, Phil DeMar - PowerPoint PPT Presentation

Deep Packet Inspection Using GPUs Qian Gong, Wenji Wu, Phil DeMar GPU Technology Conference 2017 May 2017 Background Main uses for network traffic analysis Operations & management Capacity planning Performance troubleshooting

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Worm Detection ICMP Packet Analysis Ankur Agiwal 1 2 Packet Content Matching Packet

Leveraging Traffic Repetitions for High-Speed Deep Packet Inspection INFOCOM 2015 Paper #54 used

BlindBox: Deep Packet Inspection Over Encrypted Traffic Justine Sherry, Chang Lan, Raluca Ada

SymTCP: Eluding Stateful Deep Packet Inspection with Automated Discrepancy Discovery Zhongjie Wang

Stateful Firewalls Hank and Foo Types of firewalls Packet filter (stateless) Proxy

Introduction to Packet Tracer What is Packet Tracer? Packet Tracer is a protocol simulator

Chapter 7 Packet-Switching Networks Routing in Packet Networks Shortest Path Routing Chapter 7

Why use GPUs for graph processing? FOSDEM 2020 2 GPUs and Graphs Graphs GPUs Found

GoldenEye: stream-based network packet inspection using GPUs Qian Gong, Wenji Wu, Phil DeMar The

Spicy: A Unified Deep Packet Inspection Framework Dissecting All Your Data Robin Sommer

Packet Radio Lee Maddox, N4HOK What is Packet Radio? Packet radio is the connection of a computer

Lab 1: Packet Sniffing and Wireshark Fengwei Zhang SUSTech CS 315 Computer Security 1 Packet

HMIP Inspection Derbys Experience AYM 3.4.19. HMIP Inspection New HMIP inspection cycle

RISK BASED INSPECTION APPROACH NEW INSPECTION PROCESS ODOT Division of Construction Management

Code Inspection SENG 426, Summer 2009 Overview Code Inspection Process Participants

Study of Snort Rule- set Privacy Impact Nils Ulltveit-Moe and Vladimir Oleshchuk University of

Implementing Snort into SURFids Sander Keemink and Michael van Kleij February 6, 2008 1 / 21

Dynamic profiles for malware communication Joao Marques, Mick Cox MSc System & Network

Opioid Overdose Overview Sarah Bryant, MPH, RN Division Manager Health Promotion and Prevention

Motivation: MSA Motivation: MSA Current attacks to distributed systems involve multiple steps

Constrained approximate search in misuse-based intrusion detection Ambika Shrestha Chitrakar

Jaal: Towards Network Intrusion Detection at ISP Scale A. Aqil, K. Khalil , A. Atya, E.

Memristor Based Autoencoder for Unsupervised Real-Time Network Intrusion and Anomaly Detection

Deep Packet Inspection Using GPUs Qian Gong, Wenji Wu, Phil DeMar - PowerPoint PPT Presentation

Deep Packet Inspection Using GPUs Qian Gong, Wenji Wu, Phil DeMar GPU Technology Conference 2017 May 2017 Background Main uses for network traffic analysis Operations & management Capacity planning Performance troubleshooting

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Worm Detection ICMP Packet Analysis Ankur Agiwal 1 2 Packet Content Matching Packet

Leveraging Traffic Repetitions for High-Speed Deep Packet Inspection INFOCOM 2015 Paper #54 used

BlindBox: Deep Packet Inspection Over Encrypted Traffic Justine Sherry, Chang Lan, Raluca Ada

SymTCP: Eluding Stateful Deep Packet Inspection with Automated Discrepancy Discovery Zhongjie Wang

Stateful Firewalls Hank and Foo Types of firewalls Packet filter (stateless) Proxy

Introduction to Packet Tracer What is Packet Tracer? Packet Tracer is a protocol simulator

Chapter 7 Packet-Switching Networks Routing in Packet Networks Shortest Path Routing Chapter 7

Why use GPUs for graph processing? FOSDEM 2020 2 GPUs and Graphs Graphs GPUs Found

GoldenEye: stream-based network packet inspection using GPUs Qian Gong, Wenji Wu, Phil DeMar The

Spicy: A Unified Deep Packet Inspection Framework Dissecting All Your Data Robin Sommer

Packet Radio Lee Maddox, N4HOK What is Packet Radio? Packet radio is the connection of a computer

Lab 1: Packet Sniffing and Wireshark Fengwei Zhang SUSTech CS 315 Computer Security 1 Packet

HMIP Inspection Derbys Experience AYM 3.4.19. HMIP Inspection New HMIP inspection cycle

RISK BASED INSPECTION APPROACH NEW INSPECTION PROCESS ODOT Division of Construction Management

Code Inspection SENG 426, Summer 2009 Overview Code Inspection Process Participants

Study of Snort Rule- set Privacy Impact Nils Ulltveit-Moe and Vladimir Oleshchuk University of

Implementing Snort into SURFids Sander Keemink and Michael van Kleij February 6, 2008 1 / 21

Dynamic profiles for malware communication Joao Marques, Mick Cox MSc System &amp; Network

Opioid Overdose Overview Sarah Bryant, MPH, RN Division Manager Health Promotion and Prevention

Motivation: MSA Motivation: MSA Current attacks to distributed systems involve multiple steps

Constrained approximate search in misuse-based intrusion detection Ambika Shrestha Chitrakar

Jaal: Towards Network Intrusion Detection at ISP Scale A. Aqil, K. Khalil , A. Atya, E.

Memristor Based Autoencoder for Unsupervised Real-Time Network Intrusion and Anomaly Detection

Dynamic profiles for malware communication Joao Marques, Mick Cox MSc System & Network