I N the past few years, network traffic characterization has - PDF document

Deterministic Finite Automaton for Scalable Traffic Identification: the Power of Compressing by Range Rafael Antonello, Stenio Fernandes, Djamel Sadok, Judith Géza Szabo Kelner Ericsson Traffic Lab Federal University of Pernambuco (UFPE) Budapest, Hungary Recife, Brazil Abstract — Deep Packet Inspection (DPI) systems have been and Operating Systems’ (OS) kernel can keep up with packets becoming an important element in traffic measurement ever arriving at wire-speed, the pattern-matching component of the since port-based classification was deemed no longer appropriate, DPI system may not be able to deal with all the incoming due to protocol tunneling and misuses of well-defined ports. packets without strangling the processor, thus incurring losses. Current DPI systems express application signatures using Currently, DPI systems express patterns using regular regular expressions and it is usual to perform pattern matching expressions [10]. Therefore, it is natural for them to perform through the use of Finite Automaton (FA). Although DPI systems are essentially more accurate, they are also resource-intensive pattern matching through the use of Finite Automaton (FA). and do not scale well with link speeds. Looking to this area of State-space explosion of Deterministic FAs (DFA) may interest, this paper proposes a novel Deterministic Finite require an unacceptable amount of memory space [10]. Automaton, called Ranged Compressed Deterministic Finite Decreasing the complexity of matching procedures and Automaton (RCDFA), that compresses transitions without reducing the memory consumption of DFAs are the main additional memory lookups. Experimental results show that goals of research studies in this field. This paper proposes and RCDFA yields space savings of 97% over the original DFA and up to 93% better compression when compared to the DFA’s evaluates a novel DFA that aims to decrease space state-of-the-art compression techniques. requirements when used to perform pattern matching in DPI systems. Index Terms — DFA Optimizations, Deep Packet Inspection, The contributions of this paper are two-fold: first, we have Performance Evaluation, Computer Networks proposed a novel Deterministic Finite Automaton, called Ranged Compressed Deterministic Finite Automaton I. I NTRODUCTION (RCDFA). RCDFA is based on the following key observation: I N the past few years, network traffic characterization has several consecutive transitions lead to the same destination become an important tool for accurate network management state. Smart transition representations result in huge space and traffic profiling. It is well known that port-based savings over a standard DFA. Second, we have developed an classification is inaccurate, due to traffic tunneling, for algorithm for converting FAs from the original DFA to applications that use other ports assigned to well-known RCDFA. This implies that previously developed and well- services in order to evade firewalls rules, such as P2P tested algorithms for parsing from a regular expression to applications [4][7][5]. For that reason, traffic classification Non-Deterministic FAs (NFA) and DFAs can be reutilized. techniques have been recently relying on Deep Packet We also evaluate and compare the performance of RCDFA to Inspection (DPI) engines. Such systems frequently perform a state-of-the-art DFA variations for traffic identification. set of time-critical operations to verify certain application The remainder of this paper is organized as follows. Section patterns or behaviors, while trying to minimize packet II presents related work. Section III presents our new processing delays. Although DPI systems are essentially more Automaton model. Section IV shows the methodology used on accurate, they frequently perform a set of time-critical RCDFA evaluation and Section V presents experimental operations and are consequently resource-intensive. Therefore, results. We discuss our findings in Section VI. Concluding remarks and suggestions for future work are presented in if not proper designed, they may not scale well with link Section VII. speeds. In general, a DPI system works as follows: first it has to collect packets from the network interface cards (NIC), II. R ELATED W ORK create a data structure to represent incoming packets as Although flexible and expressive, automata-evaluated network flows (usually as a hash table), and forward or store regular expressions traditionally are memory-greedy and the received packets for further processing. After that it severely limit performance in most platforms. Developing DPI searches for well-known patterns within the packet payload systems at multi-gigabit rates is a difficult task as they need to (i.e. application signatures) for each flow. Pattern matching achieve high processing speeds while limiting memory procedures in DPIs are usually performed at the user-space consumption or access. Research studies have been adding level and are highly processing intensive, which causes some features to the original automata formalism in order to significant packet losses. In other words, even though NICs meet such speed and memory consumption requirements. 978-1-4673-0269-2/12/$31.00 c � 2012 IEEE 155

I N the past few years, network traffic characterization has - PDF document

Deterministic Finite Automaton for Scalable Traffic Identification: the Power of Compressing by Range Rafael Antonello, Stenio Fernandes, Djamel Sadok, Judith Gza Szabo Kelner Ericsson Traffic Lab Federal University of Pernambuco (UFPE)

Pre-boot RAM acquisition and compression Martijn Bogaard Student of Master in System and

Combinatorial Testing Rick Kuhn NIST Computer Security Division NIST Combinatorial Testing

Single Letter Formulas for Quantized Compressed Sensing with Gaussian Codebooks Alon Kipnis

gzip, tar Purpose file archiving -compressing multiple files into one smaller file

Inforce Data Compression Methods for Actuarial Modeling I f D t C i M th d f A t i l M d li

FULL YEAR RESULTS 2018 Disclaimer The information contained in this presentation document (the

SSL, GONE IN 30 SECONDS b r e a c h A BREACH beyond CRIME SSL, GONE IN 30 SECONDS AGENDA

Using JPEG to Compress Still Pictures Tyler Genter December 17, 2010 Tyler Genter Using JPEG to

Computing Sparse Representations in O(NlogN) time May 3, 2013 Tsung-Han Lin and H.T. Kung

POI360 Panoramic Mobile Video Telephony over LTE Cellular Networks Xiufeng Xie Xinyu Zhang

With Our Partner: A Program By: Letters from Soldiers Thank you so much for the birthday cake

Headquartered in Tulsa, Oklahoma, TESCORP distributes, fabricates, and services its line of

Linear Algebra in File Compression: SVD and DCT By: Andrew Fraser How Are Images Stored?

Linearly Compressed Pages: A Main Memory Compression Framework with Low Complexity and Low

and Fire Goal: Improve quality and efficiency of methods used to visualize smoke and fire Glenn

EXAR A NEW DIRECTION Mixed Signal and Data Management Solutions for a Connected World Forwar

Local compression and Word Equations Artur Je MPI, Germany 28 February 2013 Compression and

Baltic Marine Environment Protection Commission Task force on migratory fish species FISH-M

Regressive Spring Assembly Adjustable Bucket Sleeve (divider between the springs) Upper

Presenting Academic Work Engage, Talk, Visualize Jrg Cassens Academic Literacy Winter term

Product presentation Original Reloaded Product presentation contents LIGHTNESS 2 Original

ICLR PERC Fire webinar - PERC Introduction Michael Sznyi Flood Resilience Program Lead,

Guidelines for Preparation of Technical Paper Presentation at WATMAN International Conference 2020

Short Cambrex Corporation NYSE:CBM Stephen Saroki OBrien Greene & Co.

I N the past few years, network traffic characterization has - PDF document

Deterministic Finite Automaton for Scalable Traffic Identification: the Power of Compressing by Range Rafael Antonello, Stenio Fernandes, Djamel Sadok, Judith Gza Szabo Kelner Ericsson Traffic Lab Federal University of Pernambuco (UFPE)

Pre-boot RAM acquisition and compression Martijn Bogaard Student of Master in System and

Combinatorial Testing Rick Kuhn NIST Computer Security Division NIST Combinatorial Testing

Single Letter Formulas for Quantized Compressed Sensing with Gaussian Codebooks Alon Kipnis

gzip, tar Purpose file archiving -compressing multiple files into one smaller file

Inforce Data Compression Methods for Actuarial Modeling I f D t C i M th d f A t i l M d li

FULL YEAR RESULTS 2018 Disclaimer The information contained in this presentation document (the

SSL, GONE IN 30 SECONDS b r e a c h A BREACH beyond CRIME SSL, GONE IN 30 SECONDS AGENDA

Using JPEG to Compress Still Pictures Tyler Genter December 17, 2010 Tyler Genter Using JPEG to

Computing Sparse Representations in O(NlogN) time May 3, 2013 Tsung-Han Lin and H.T. Kung

POI360 Panoramic Mobile Video Telephony over LTE Cellular Networks Xiufeng Xie Xinyu Zhang

With Our Partner: A Program By: Letters from Soldiers Thank you so much for the birthday cake

Headquartered in Tulsa, Oklahoma, TESCORP distributes, fabricates, and services its line of

Linear Algebra in File Compression: SVD and DCT By: Andrew Fraser How Are Images Stored?

Linearly Compressed Pages: A Main Memory Compression Framework with Low Complexity and Low

and Fire Goal: Improve quality and efficiency of methods used to visualize smoke and fire Glenn

EXAR A NEW DIRECTION Mixed Signal and Data Management Solutions for a Connected World Forwar

Local compression and Word Equations Artur Je MPI, Germany 28 February 2013 Compression and

Baltic Marine Environment Protection Commission Task force on migratory fish species FISH-M

Regressive Spring Assembly Adjustable Bucket Sleeve (divider between the springs) Upper

Presenting Academic Work Engage, Talk, Visualize Jrg Cassens Academic Literacy Winter term

Product presentation Original Reloaded Product presentation contents LIGHTNESS 2 Original

ICLR PERC Fire webinar - PERC Introduction Michael Sznyi Flood Resilience Program Lead,

Guidelines for Preparation of Technical Paper Presentation at WATMAN International Conference 2020

Short Cambrex Corporation NYSE:CBM Stephen Saroki OBrien Greene &amp; Co.

Short Cambrex Corporation NYSE:CBM Stephen Saroki OBrien Greene & Co.