1 NETAI 2018 JOHAN GARCIA 180824
JOHAN GARCIA TOPI KORHONEN DEPARTMENT OF COMPUTER SCIENCE KARLSTAD UNIVERSITY, SWEDEN
EFFICIENT DISTRIBUTION-DERIVED FEATURES FOR HIGH-SPEED ENCRYPTED - - PowerPoint PPT Presentation
EFFICIENT DISTRIBUTION-DERIVED FEATURES FOR HIGH-SPEED ENCRYPTED FLOW CLASSIFICATION JOHAN GARCIA TOPI KORHONEN DEPARTMENT OF COMPUTER SCIENCE KARLSTAD UNIVERSITY, SWEDEN 1 180824 NETAI 2018 JOHAN GARCIA PRESENTATION OUTLINE Problem
1 NETAI 2018 JOHAN GARCIA 180824
JOHAN GARCIA TOPI KORHONEN DEPARTMENT OF COMPUTER SCIENCE KARLSTAD UNIVERSITY, SWEDEN
2 NETAI 2018 JOHAN GARCIA 180824
Thanks to:
3 NETAI 2018 JOHAN GARCIA 180824
usage and support QoE
becomes unfeasible with encrypted flows
be used for classification of encrypted flows
4 NETAI 2018 JOHAN GARCIA 180824
Target use case
J Garcia, T Korhonen, R Andersson, F Västlund. Towards Video Flow Classification at One Million Encrypted Flows per Second. IEEE AINA 2018
5 NETAI 2018 JOHAN GARCIA 180824
6 NETAI 2018 JOHAN GARCIA 180824
7 NETAI 2018 JOHAN GARCIA 180824
A mixture of Gaussian distribution (gray), and a mixture of Beta distributions (blue)
8 NETAI 2018 JOHAN GARCIA 180824
A mixture of Gaussian distribution (gray), and a mixture of Beta distributions (blue) STATISTICAL MOMENTS MAY NOT ALWAYS CAPTURE THE FULL DISTRIBUTIONAL DIFFERENCE
9 NETAI 2018 JOHAN GARCIA 180824
10 NETAI 2018 JOHAN GARCIA 180824
Gaussian mixtures
11 NETAI 2018 JOHAN GARCIA 180824
formulas from LyX screeshot
12 NETAI 2018 JOHAN GARCIA 180824
13 NETAI 2018 JOHAN GARCIA 180824
14 NETAI 2018 JOHAN GARCIA 180824
Jensen-Shannon distance, Chi2, Kullback Leibler-divergence
1000 (200) Realizations of distribution mixtures 12 (5) instantiation of different nr of samples 12-5000 (10-100)
15 NETAI 2018 JOHAN GARCIA 180824
best (but have more bins)
and PROB in most cases for same bin nr
distribution (i.e Beta mixtures) gives larger difference
16 NETAI 2018 JOHAN GARCIA 180824
(packets) give better performance
consistently bad
distributions give worse performance
17 NETAI 2018 JOHAN GARCIA 180824
18 NETAI 2018 JOHAN GARCIA 180824
cellular network during Feb 2017
first 60 seconds of each flow
19 NETAI 2018 JOHAN GARCIA 180824
fa: Flow attributes – Non-distributional flow features ba: Basic statistics – Basic distribution-derived features mo: Statistical moments – Extended distribution-derived features bn: Histogram-based features – using a specific discretization method
22 NETAI 2018 JOHAN GARCIA 180824
23 NETAI 2018 JOHAN GARCIA 180824
24 NETAI 2018 JOHAN GARCIA 180824
Adap KSD best
26 NETAI 2018 JOHAN GARCIA 180824
Adap KSD best Early optimum Metric matters
27 NETAI 2018 JOHAN GARCIA 180824
Adap KSD best Early optimum Metric matters Fraction matters
28 NETAI 2018 JOHAN GARCIA 180824
moments by achieving:
(offline) computational complexity