SMARTxAC SMARTxAC Collaboration agreement started on July 2003 h - - PowerPoint PPT Presentation

smartxac smartxac
SMART_READER_LITE
LIVE PREVIEW

SMARTxAC SMARTxAC Collaboration agreement started on July 2003 h - - PowerPoint PPT Presentation

SMARTxAC: Traffic Monitoring and Analysis System for high-speed links Pere Barlet Josep Sol-Pareta Jordi Domingo-Pascual Advanced Broadband Communications Center (CCABA) {pbarlet, pareta, jordid}@ac.upc.es Technical University


slide-1
SLIDE 1

Advanced Broadband Communications Center (CCABA) Technical University

  • f Catalonia (UPC)

SMARTxAC: Traffic Monitoring and

Analysis System for high-speed links

Pere Barlet Josep Solé-Pareta Jordi Domingo-Pascual {pbarlet, pareta, jordid}@ac.upc.es

http://www.ccaba.upc.es

Acknowledgment: This work has been partially supported by CESCA (SMARTxAC agreement) and MCyT (ref. TIC2002-04531-C04-02)

slide-2
SLIDE 2

SMARTxAC SMARTxAC

Collaboration agreement started on July 2003

h UPC (Technical University of Catalonia) h CESCA (Supercomputing Center of Catalonia)

Development and deployment of a low-cost monitoring

system for the Catalan R&D network

h Anella Cientifica (Scientific Ring) h Connects ~50 universities and research centers in Catalonia

Objectives

h Continuous monitoring and analysis of the Anella Cientifica h Gather knowledge about per institution usage h Detection of anomalies, irregular usage and attacks

Measurement of a full-duplex GE link

h Connection to RedIRIS (Spanish R&D) and to global Internet h Current traffic load ~700 Mbps / ~150 Kpps

slide-3
SLIDE 3

Anella Anella Cientifica Cientifica

Measurement point GigE full-duplex

slide-4
SLIDE 4

Main characteristics Main characteristics

Passive measurement

h Full capture (no sampling) h Equipment: DAG 4.3GE + optical splitters h Precise timestamping using GPS (Trimble Acutime 2000) h CAIDA CoralReef: Packet capture + flow aggregation

– CoralReef is going to be replaced shortly by SMART

Traffic analysis

h Analysis of all traffic at full-line rate h Header-only analysis due to:

– Performance reasons – Encryption techniques – Legal restrictions

h Permanent storage of all analysis results

Web-based graphical interface

h On-demand visualisation of all computed statistics

slide-5
SLIDE 5

Measurement scenario Measurement scenario

1

REDIRIS

Other regional nodes

ESPANIX GÉANT

Dedicated Ethernet segment (NFS) Dedicated Ethernet segment (NFS) Outgoing traffic Incoming traffic CISCO 6513 (Anella Científica) Juniper M-20 (RedIRIS) RedIRIS (Madrid) Internet connection 1 Gbps

ANELLA CIENTÍFICA (GbE)

RedIRIS

USA

Visualization web server (APACHE+PHP) Traffic Analysis System (Linux) Traffic Capture Platform (DAG 4.3GE+CoralReef)

slide-6
SLIDE 6

Flow classification approach Flow classification approach

Traditional 5-tuple flows are translated into 3-tuple

“classified flows” (<inst, dst, app>)

h IP addresses Institution and destination network

– Longest prefix match algorithm using a BGP dump

h Ports + protocol Application h Bidirectional aggregation

Unknown traffic and relevant traffic to detect anomalies

is logged with more details

Aggregation into accounting periods

h Daily, weekly and monthly data-aggregation

Higher aggregation than traditional flow classification

h E.g. disk occupation in “Anella Cientifica” reduced by >99%

– 5-tuple flows: ~25 GB/day – Classified flows: ~20 MB/day

slide-7
SLIDE 7

Traffic analysis statistics Traffic analysis statistics

Traffic statistics per institution

h Application time-series plot (bytes/sec, pkts/sec, flows/sec) h Institution breakdown h Destination network breakdown h Application breakdown h Protocol breakdown h Destination per applications breakdown h Destination per institution breakdown h Unknown ports log h Unknown IP addresses log h Unknown protocols log h Top-N ports and applications h Top-N IP addresses h Top-N protocols h Threshold-based anomaly detection graphs h Self-similarity estimation h Packet size distribution

slide-8
SLIDE 8

SMART SMART

SMART is currently under development

h Specifically designed to perform at gigabit speeds h Integration of capture engine and analysis system

– Only one measurement box runs capture and analysis processes – CoralReef is no longer needed and will not be used

h New statistics (e.g. ASxAS matrices) h Data anonymization h Anomaly detection capabilities h IPv6 support

Collaboration between UPC and NLANR/PMA

h Tested in one of NLANR/PMA OC192MON's located on

SDSC's TeraGrid Cluster

– http://pma.nlanr.net/Special/tera2.html

h IP trace from “Anella Cientifica” was collected for NLANR/PMA

– http://pma.nlanr.net/Special/cesc1.html

slide-9
SLIDE 9

Anomaly detection Anomaly detection

Detect anomalies based on changes in institution traffic

patterns has several difficulties

h Define an “ordinary” (expected) traffic profile per institution h Rule to decide which deviations are considered as an anomaly h Inherent variations of traffic by itself

– E.g. burstiness, day/night, workday/weekend, etc.

h Minimise number of false alarms

Anomaly detection (only header analysis) based on:

h Simple thresholds per institution (packets, bytes, flows, etc.)

– Already implemented and working

h Adaptive traffic prediction

– “Ordinary” traffic profile has not to be defined explicitly – First tests using “adaptive normalized least mean square error linear predictor” were very successful

h Combination of both methods to avoid limitations

– E.g. use of thresholds can mitigate prediction limitation when constant traffic changes occur

slide-10
SLIDE 10

Integration of Integration of SMARTxAC SMARTxAC and and CoMo CoMo

Collaboration Intel Research Cambridge - UPC Objective: Integration of SMARTxAC with CoMo

h CoMo has been designed as an open monitoring infrastructure h Migrate statistics computed by SMARTxAC as CoMo modules h Migrate SMARTxAC graphical interface into CoMo h Collaborate in the design and development of CoMo to:

– Identify limitations which can difficult such integration – Facilitate the development and integration of custom modules

Participation also in the design and development of

CoMo core

h Capture, export and query processes

slide-11
SLIDE 11

Online demo Online demo

http://smartxac.ccaba.upc.es

(access to this site is restricted due to data confidentiality) General information and sample graphs can be found at: http://www.ccaba.upc.es/smartxac

slide-12
SLIDE 12

Web Web-

  • based graphical interface

based graphical interface

slide-13
SLIDE 13

Monthly traffic per application Monthly traffic per application

Institution graphs are not shown in order to preserve institution confidentiality

slide-14
SLIDE 14

Weekly traffic per application Weekly traffic per application

slide-15
SLIDE 15

Application breakdown Application breakdown

slide-16
SLIDE 16

Traffic per AS group Traffic per AS group

slide-17
SLIDE 17

Traffic per AS group and application Traffic per AS group and application

slide-18
SLIDE 18

Top Top-

  • N known ports

N known ports

slide-19
SLIDE 19

Top Top-

  • N unknown ports

N unknown ports

slide-20
SLIDE 20

Top Top-

  • N protocols

N protocols

slide-21
SLIDE 21

Threshold Threshold-

  • based anomaly detection (

based anomaly detection (bps bps) )

slide-22
SLIDE 22

Threshold Threshold-

  • based anomaly detection (flows)

based anomaly detection (flows)

slide-23
SLIDE 23

Packet size CDF Packet size CDF

20 40 60 80 100 520 1020 1520 2020 2520 3020 3520 4020 4520 5020 Packet Size (bytes) Cumulative Percentage