DPICO: A High-S peed Deep Packet Inspect ion Engine Using Compact - PowerPoint PPT Presentation

DPICO: A High-S peed Deep Packet Inspect ion Engine Using Compact Finit e Aut omat a Chris Hayes Rensselaer Polytechnic Inst itute (formerly UMAS S Lowell) Yan Luo University of Massachusetts Lowell

Agenda � Baseline Design � Design with Compression � Content Addressable Memory � Interleaved Memory Banks � Data Packing � Memory Savings � Results

Baseline Design and Mot ivat ion � Finite Automata are the basis for many packet filtering techniques. � We propose a technique for implementing packet filtering in hardware. � S tandard Moore Machine Next-S tate memory architecture. � Advantages: � S peed � Disadvantages: � Memory utilization - redundant information can be present based on a given finite automaton. This is what we seek to reduce.

How t o Improve t he Ut ilizat ion � We remove repeated information in the state transition table. � Many transitions for a state may have the same next state pointer. � We want to combine these into a single transition. We accomplish this by creating two types of transitions: � Labeled transition - followed if the label matches the input character. � Default transition - followed if no label matches input character. � The most frequently repeated next state pointer in each state becomes the default transition pointer.

How t o realize t he compression � We take advantage of three technologies: � Content-Addressable Memory � Interleaved Memory Banks � Data Packing

Cont ent Addressable Memory � Drawback to the baseline design is the fixed state size. � Adding default and labeled transitions give a mechanism to compress the state. � Use the CAM to provide a search mechanism to find the next-state transition based on the input character. � Each state has its own CAM, since each state will require its own associative lookup.

Cont ent Addressable Memory (2) Labeled Transition Default Transition Label Next State Pointer End Locn Match ID Next State Pointer A State in Memory End Locn Labeled Transition . . . N labeled transitions Labeled Transition Labeled Transition Labeled Transition Exactly One Default Default Transition Trans. Per State Issue: Need to Search through each labeled transition to resolve next state. (Could take Many Clocks)

Int erleaved Memory � FPGAs can have hundreds of banks of memory. � Each bank can be read in parallel. � Read/ Write bandwidth increased by a factor of n, where n is the number of banks. � Example with four banks: Bank 0 Bank 1 Bank 2 Bank 3 Addr 3 Location 12 Location 13 Location 14 Location 15 Addr 2 Location 8 Location 9 Location 10 Location 11 Addr 1 Location 4 Location 5 Location 6 Location 7 Addr 0 Location 0 Location 1 Location 2 Location 3 Note: By Controlling the Read address to each bank, we can read any 4 continuous locations simultaneously. This allows us to evaluate multiple transitions in a single clock cycle

The Design The current state address is input to the N-bank interleaved memory interleaved memory The individual address is calculated for each Addr Calc Addr Calc Addr Calc RAM Bank D The default and labeled transition info are read Bank 0 Bank 1 n-1 from the RAM Q Input Char. Select Lbl’d Tr. Select Def. Tr. Match ID We select the labeled transition that matches the input character. Simultaneously, we read Select Address (Mux) the default transition information and output the Match ID. Finally, we select the next state pointer from the labeled transition logic or the default transition logic depending on whether the labeled transition logic was successful. Reduces Storage while Keeping a Constant Transition Time

Packing � Labeled transitions are likely to be smaller than Default Transitions. � We can pack the labeled transitions into memory so that much less memory is wasted. � Packing reduces the number of banks needed to account for the largest number of transitions per state. A State in Memory Wasted memory packed into a small Unused Labeled Transition N amount . . . N labeled transitions Labeled Transition 3 L2 (cont) LT2 Labeled Transition 1 Default Transition

Packing (2) � Minimum S ize = N T (8+lg(N T ))+N S (8+lg(N M )+lg(N T )) � N T =Number of Transitions � N S =Number of S tates � N M =Number of Match IDs � We can evaluate the space savings potential by finding the ratio of average transitions per state to the number of possible transitions. � Transition Ratio = #AvgTrans/ 256 � As seen on the next slide, finite automations with transition ratios of less than 0.5 are fit for this method.

avings Pot ent ial Memory S

Result s

Result s from Conv. Program DPICO DPICO DFA Unpacked Minimum # of Baseline Trans. % D 2 FA D 2 FA Ruleset Rules Memory Size Ratio ( r) Savings Memory Memory (bits) Size (bits) Size (bits) imap 46 16,923,528 715,139 571,171 0.018 96.5% ftp 76 11,723,205 534,688 418,552 0.017 96.4% netbios 633 2,198,208 66,556 54,388 0.011 97.5% nntp 13 8,008,479 330,339 268,809 0.017 96.6% exploit 122 56,596,540 7,355,320 5,001,178 0.046 91.2%

S ize and S peed Result s # o f f m B W ax m ax LUT REG B an ks (M H z) (M bps) 2 114 129 144.9 1159.2 4 183 204 122.3 978.4 8 320 352 106.9 855.2 •Baseline Design 16 698 642 98.7 789.6 32 1672 1252 84.8 678.4 •267.7 MHz (2141.6 Mbps) 64 3541 2346 78.9 631.2 •No LUT or REG 128 7659 4810 74.0 592.0 •Non-Pipelined design 256 16052 9563 68.1 544.8 •Implemented in a Xilinx Virtex 4 SX35

peed Result s (2) are needed to see this picture. QuickTime™ and a decompressor ize and S S

Conclusion � Reduced storage while keeping a constant transition time. � Great space solution if the ratio of avg. transitions to number of possible transitions << 0.5. � Minimizing the the maximum number of transitions for a state will increase the speed of the design. � Pipelined solutions can run at 250 MHz in contemporary parts (e.g. Xilinx Virtex 4) by time multiplexing multiple data streams into one engine. � (250 MHz equates to 2Gbps input data rate) � Design is scalable using tradeoff between memory and speed (up to 17Gbps).

Quest ions?

lides Backup S

Result s based on Kumar et al. Projected DPICO Transition Projected Baseline % Minimum Memory Ratio ( r) Ruleset Memory Size (bits) Savings Size (bits) Cisco590 68,195,050 44,757,032 0.34 34.4% Cisco103 80,979,350 36,008,373 0.23 55.5% Cisco7 14,190,060 8,438,994 0.29 40.5% Linux56 50,091,270 16,629,394 0.17 66.8% Linux10 46,654,764 26,996,384 0.29 42.1% Snort10 171,990,900 15,295,048 0.05 91.1% Bro648 20,749,008 3,936,212 0.09 81.0% Projected DFA Projected DPICO Transition % Minimum D 2 FA Baseline Memory Ratio ( r) Ruleset Savings Size (bits) Memory Size (bits) Cisco590 68,195,050 1071299 0.008 98.4% Cisco103 80,979,350 36,008,373 0.010 98.2% Cisco7 14,190,060 8,438,994 0.026 95.3% Linux56 50,091,270 16,629,394 0.016 97.0% Linux10 46,654,764 26,996,384 0.086 83.3% Snort10 171,990,900 15,295,048 0.016 97.3% Bro648 20,749,008 3,936,212 0.004 99.0%

DPICO: A High-S peed Deep Packet Inspect ion Engine Using Compact - PowerPoint PPT Presentation

DPICO: A High-S peed Deep Packet Inspect ion Engine Using Compact Finit e Aut omat a Chris Hayes Rensselaer Polytechnic Inst itute (formerly UMAS S Lowell) Yan Luo University of Massachusetts Lowell Agenda Baseline Design Design

Worm Detection ICMP Packet Analysis Ankur Agiwal 1 2 Packet Content Matching Packet

Introduction to Packet Tracer What is Packet Tracer? Packet Tracer is a protocol simulator

Chapter 7 Packet-Switching Networks Routing in Packet Networks Shortest Path Routing Chapter 7

Connectivity/capacity is our priority. 1 High s peed is the by-product of building

Packet Radio Lee Maddox, N4HOK What is Packet Radio? Packet radio is the connection of a computer

Lab 1: Packet Sniffing and Wireshark Fengwei Zhang SUSTech CS 315 Computer Security 1 Packet

Search Engine Optimization What is Search Engine Optimization Search Engine Optimization is the

MANAGING S PEED on Hillsboroughs Presented by: Paula C. Flores, FITE High Inj ury Network

Modelling of Packet Loss in an Asynchronous Packet Switch using PEPA Wim Vanderbauwhede

Stateful Firewalls Hank and Foo Types of firewalls Packet filter (stateless) Proxy

Flushing High School Senior Project Exit Presentation Packet 2010-2011 Flushing High School

Inspections - OSHAs Rights Authority to inspect Enter without delay..

Leveraging Traffic Repetitions for High-Speed Deep Packet Inspection INFOCOM 2015 Paper #54 used

STAR-CCM+ CM+ and SPE PEED ED for electric ctric machines ines cooling ing analysis ysis

A FASTER Bay Area A Seamless Transportation System Based On: F reedom A ffordability S peed T

FC42 College Soccer Recruiting What does it take to play college soccer? S peed is the

Routing and Transport in Wireless Sensor Networks Ibrahim Matta (matta@bu.edu) Niky Riga

Counterattack Turning the tables on exploitation attempts from tools like Metasploit whoami

PHP Chapter 1 Introduction Survey Have you ever written a web application? Have you used

Peer-to-Peer Networks 01: Organization and Introduction Christian Schindelhauer Technical

QoS, CoS, BE Markus Peuhkuri 2002-09-12 Lecture topics Course organisation Why QoS Terms

Adding rigor to the comparison of anomaly detector outputs Romain Fontugne , National Institute of

The News Reading Source Code, 1 of 5 #! /usr/bin/perl -w use strict; use Net::NNTP; $| = 1; my

Todd Warren CS 394 Spring 2011 Team structure at Microsoft Product Complexity and