Net2Text: Query-Guided Summarization of Network Forwarding Behaviors - - PowerPoint PPT Presentation

net2text query guided summarization of network forwarding
SMART_READER_LITE
LIVE PREVIEW

Net2Text: Query-Guided Summarization of Network Forwarding Behaviors - - PowerPoint PPT Presentation

Net2Text: Query-Guided Summarization of Network Forwarding Behaviors Rdiger Birkner , Dana Drachsler-Cohen, Martin Vechev, Laurent Vanbever net2text.ethz.ch NSDI 18 April, 11 2018 SEAT NEWY CHIC DENV KSCY INDI SUNV PHIL LOSA ATLA


slide-1
SLIDE 1

Net2Text: Query-Guided Summarization

  • f Network Forwarding Behaviors

NSDI ’18 Martin Vechev, Laurent Vanbever April, 11 2018 net2text.ethz.ch Rüdiger Birkner, Dana Drachsler-Cohen,

slide-2
SLIDE 2

1

SEAT NEWY SUNV LOSA HOUS ATLA PHIL DENV KSCY CHIC INDI

slide-3
SLIDE 3

1

NEWY SEAT SUNV LOSA HOUS ATLA PHIL DENV KSCY CHIC INDI

slide-4
SLIDE 4

Where is the traffic leaving in NEWY coming from?

1

slide-5
SLIDE 5

Approach Look at From a wealth of low-level data, identify important destinations to reroute Challenge to entire forwarding state all the traffjc statistics

Where is the traffic leaving NEWY coming from?

extract the high-level insights

2

slide-6
SLIDE 6

Understanding how the network behaves, can take hours

Fast reaction is required Networks get more and more complex Customer experience depends on it New peerings, more routers, etc.

3

slide-7
SLIDE 7

What if you could simply ask the questions…

4

and automatically get an answer?

slide-8
SLIDE 8

Type a message… 5

Text Net 2

slide-9
SLIDE 9

Where is the traffic… 5

Text Net 2

slide-10
SLIDE 10

Type a message… 5

Text Net 2

Where is the traffic leaving in NEWY coming from?

question

natural language in

slide-11
SLIDE 11

Type a message… 5

Where is the traffic leaving in NEWY coming from?

Text Net 2

slide-12
SLIDE 12

Type a message…

Text Net 2

summary

natural language in

5

The traffic enters mostly in PHIL and goes to Youtube and Netflix. Where is the traffic leaving in NEWY coming from?

question

natural language in

slide-13
SLIDE 13

The Google traffic to NEWY enters in BOST…

Net2Text has four stages: parsing, data retrieval, summarization, translation

Input Output

How is Google traffic to NEWY handled?

Workflow Network database NL Parser Summarization Translation

6

slide-14
SLIDE 14

The Google traffic to NEWY enters in BOST…

Net2Text has four stages: parsing, data retrieval, summarization, translation

Input Output

How is Google traffic to NEWY handled?

Workflow NL Parser Summarization Translation

7

Network database

slide-15
SLIDE 15

The parser maps the operator’s query to the internal query language

SELECT * FROM paths
 WHERE egress=NEWY
 AND dest=Google

Query Type

?

Router

How is to traffic Google NEWY handled

Egress Destination Traffic Identifier Organization

Input Output

How is Google traffic to NEWY handled? 8

slide-16
SLIDE 16

Based on the query, Net2Text retrieves all relevant data

SELECT * FROM paths
 WHERE egress=NEWY
 AND dest=Google The Google traffic to NEWY enters in BOST…

Input Output

How is Google traffic to NEWY handled?

Workflow NL Parser Summarization Translation Network database

9

slide-17
SLIDE 17

The database maintains the forwarding state and traffic statistics

10

slide-18
SLIDE 18

The database maintains the forwarding state and traffic statistics

path 1 ingress

  • avg. bw

BOST 0.4 Mbps … 98.4 Mbps 25.0 Mbps egress dest. 1.0 Mbps Google Swisscom Swisscom Yahoo path 2 path 3 path n NEWY BOST ATLA NEWY ATLA NEWY HOUS … … … … … … … … … prefix 8.8.8.0/24 46.14.0.0/16 81.63.0.0/17 8.8.178.0/24 …

10

slide-19
SLIDE 19

All the data is summarized by identifying a few clusters

path 1 path 2 path n … ingress

  • avg. bw

BOST BOST SFO 98.4 Mbps 0.4 Mbps 16.1 Mbps … … … …

The Google traffic to NEWY enters in BOST…

Input Output Workflow Translation

How is Google traffic to NEWY handled?

NL Parser Summarization

11

slide-20
SLIDE 20

path 1 ingress

  • avg. bw

BOST 0.4 Mbps … 98.4 Mbps 25.0 Mbps

  • short. path

1.0 Mbps path 2 path 3 path n T BOST BOST T BOST F T … … … … … … … … prefix 8.8.8.0/24 8.8.4.0/24 66.102.0.0/20 35.184.0.0/19 … 25.0 Mbps path 4 HOUS F … 35.184.0.0/19

Input pertaining to Google traffic leaving in NEWY Output

12

identifying a few clusters All the data is summarized by

slide-21
SLIDE 21

path 1 ingress

  • avg. bw

BOST 0.4 Mbps … 98.4 Mbps 25.0 Mbps

  • short. path

1.0 Mbps path 2 path 3 path n T BOST BOST T BOST F T … … … … … … … … prefix 8.8.8.0/24 8.8.4.0/24 66.102.0.0/20 35.184.0.0/19 … 25.0 Mbps path 4 HOUS F … 35.184.0.0/19

Input pertaining to Google traffic leaving in NEWY Output

{BOSTi}, 13

identifying a few clusters All the data is summarized by

slide-22
SLIDE 22

Each cluster represents a path specification A summary consists of multiple path specifications

path 1 ingress

  • avg. bw

BOST 0.4 Mbps … 98.4 Mbps 25.0 Mbps

  • short. path

1.0 Mbps path 2 path 3 path n T BOST BOST T BOST F T … … … … … … … … prefix 8.8.8.0/24 8.8.4.0/24 66.102.0.0/20 35.184.0.0/19 … 25.0 Mbps path 4 HOUS F … 35.184.0.0/19

Output

13 {BOSTi}

Input pertaining to Google traffic leaving in NEWY

,

slide-23
SLIDE 23

path 1 ingress

  • avg. bw

BOST 0.4 Mbps … 98.4 Mbps 25.0 Mbps

  • short. path

1.0 Mbps path 2 path 3 path n T BOST BOST T BOST F T … … … … … … … … prefix 8.8.8.0/24 8.8.4.0/24 66.102.0.0/20 35.184.0.0/19 … 25.0 Mbps path 4 HOUS F … 35.184.0.0/19

Input pertaining to Google traffic leaving in NEWY Output

{BOSTi}, {BOSTi, Tsp} {BOSTi, Tsp, ATLw} , 13

identifying a few clusters All the data is summarized by

slide-24
SLIDE 24

Path specifications are translated back to natural language

{BOSTi}, {BOSTi, Tsp} {BOSTi, Tsp, ATLw} ,

The Google traffic to NEWY enters in BOST…

Input Output Workflow

How is Google traffic to NEWY handled?

NL Parser Translation Summarization

14

Network database

slide-25
SLIDE 25

Google to NEWY

The Traffic Identifier Description

enters in BOST

to obtain natural language from path specifications

Input Output

The Google traffic to NEWY enters in BOST… 15

{BOSTi}, {BOSTi, Tsp} {BOSTi, Tsp, ATLw} ,

The translation uses templates

slide-26
SLIDE 26

The Google traffic to NEWY enters in BOST…

Net2Text has four stages: parsing, data retrieval, summarization, translation

Input Output

How is Google traffic to NEWY handled?

Workflow Network database NL Parser Summarization Translation

16

slide-27
SLIDE 27

1 Performance & operator interviews Scaling Summarization 2 3 from question to succinct answer summarizing fast summaries within a few seconds

Text Net 2

slide-28
SLIDE 28

1 Performance & operator interviews Scaling Summarization 2 3 from question to succinct answer summarizing fast summaries within a few seconds

Text Net 2

slide-29
SLIDE 29

Traffic is being forwarded. Finding a summary of the network-wide forwarding state is simple

17

slide-30
SLIDE 30

Traffic from LOSA to 35.184.0.0/19, which is owned by Google, is leaving the network in CHIC and takes the path SUNV, DENV, KSCY, INDI to CHIC.

18

Finding a summary of the network-wide forwarding state is simple

slide-31
SLIDE 31

amount of detail provided by the summary

19

Explainability amount of data described by the summary Coverage

slide-32
SLIDE 32

19

Traffic is being forwarded. Explainability Coverage

slide-33
SLIDE 33

19

Traffic from LOSA to 35.184.0.0/19, which is owned by Google, … Explainability Coverage

slide-34
SLIDE 34

Explainability

19

better Coverage

slide-35
SLIDE 35

Score Weighted sum of the amount of traffic covered by each path specification in the summary.

Summarization is an optimization problem guided by the summary score

21

slide-36
SLIDE 36

Score each path specification in the summary.

Summarization is an optimization problem guided by the summary score

21

Coverage Weighted sum of the amount of traffic covered by

slide-37
SLIDE 37

Score each path specification in the summary.

Summarization is an optimization problem guided by the summary score

21

Explainability weights based on level of detail

  • f the path specification

Weighted sum of the amount of traffic covered by

slide-38
SLIDE 38

Score each path specification in the summary.

Summarization is an optimization problem guided by the summary score

21

Goal Find path specifications that maximize the score. Weighted sum of the amount of traffic covered by

slide-39
SLIDE 39

all the data in all details

22

Explainability Coverage

slide-40
SLIDE 40

Goal Find k path specifications each of size at most t that maximize the score. Score each path specification in the summary.

Summarization is an optimization problem guided by the summary score and a size restriction

23

Weighted sum of the amount of traffic covered by

slide-41
SLIDE 41

24

k = 3, t = 3

Ø,Ø,Ø

slide-42
SLIDE 42

24 Ø,Ø,Ø {LOSAi},Ø,Ø {SUNVe},Ø,Ø

slide-43
SLIDE 43

… …

24 Ø,Ø,Ø {LOSAi},Ø,Ø {SUNVe},Ø,Ø {LOSAi},{NEWYe},Ø {SUNVe, LOSAi},Ø,Ø

slide-44
SLIDE 44

… … … …

24 Ø,Ø,Ø {LOSAi},Ø,Ø {SUNVe},Ø,Ø {LOSAi},{NEWYe},Ø {LOSAi},{NEWYe},{Yahood} {SUNVe, LOSAi},Ø,Ø {SUNVe, LOSAi, Googled},
 {SUNVe, NEWYi, Yahood},
 {HOUSe, NEWYi, Yahood} {SUNVe}, {SUNVe, LOSAi},Ø

slide-45
SLIDE 45

The search space is exponential in the number of path specifications and feature values

Ø,Ø,Ø

{LOSAi},Ø,Ø {SUNVe},Ø,Ø {LOSAi},{NEWYe},Ø {LOSAi},{NEWYe},{Yahood} {SUNVe, LOSAi},Ø,Ø

{SUNVe, LOSAi, Googled},
 {SUNVe, NEWYi, Yahood},
 {HOUSe, NEWYi, Yahood} {SUNVe}, {SUNVe, LOSAi},Ø

… …

24

slide-46
SLIDE 46

25

Due to the size of the search space, exhaustive exploration is not feasible

slide-47
SLIDE 47

1 Performance & operator interviews Scaling Summarization 2 3 from question to succinct answer summarizing quickly summaries within a few seconds

Text Net 2

slide-48
SLIDE 48

Net2Text relies on two optimizations

Sampling Optimization 1 Optimization 2 Reduce the search space Reduce the input data

26

Approximation

slide-49
SLIDE 49

26

Sampling Optimization 1 Reduce the search space Approximation

slide-50
SLIDE 50

… … … …

27 Ø,Ø,Ø {LOSAi},Ø,Ø {SUNVe},Ø,Ø {LOSAi},{NEWYe},Ø {LOSAi},{NEWYe},{Yahood} {SUNVe, LOSAi},Ø,Ø {SUNVe, LOSAi, Googled},
 {SUNVe, NEWYi, Yahood},
 {HOUSe, NEWYi, Yahood} {SUNVe}, {SUNVe, LOSAi},Ø

slide-51
SLIDE 51

… … … …

Maximal coverage

The search space contains two types of edges: blue edges that increase coverage

28 Ø,Ø,Ø {LOSAi},Ø,Ø {SUNVe},Ø,Ø {LOSAi},{NEWYe},Ø {LOSAi},{NEWYe},{Yahood} {SUNVe, LOSAi},Ø,Ø {SUNVe, LOSAi, Googled},
 {SUNVe, NEWYi, Yahood},
 {HOUSe, NEWYi, Yahood} {SUNVe}, {SUNVe, LOSAi},Ø

slide-52
SLIDE 52

… … … …

Maximal explainability 29

The search space contains two types of edges: red edges that increase explainability

Maximal coverage Ø,Ø,Ø {LOSAi},Ø,Ø {SUNVe},Ø,Ø {LOSAi},{NEWYe},Ø {LOSAi},{NEWYe},{Yahood} {SUNVe, LOSAi},Ø,Ø {SUNVe, LOSAi, Googled},
 {SUNVe, NEWYi, Yahood},
 {HOUSe, NEWYi, Yahood} {SUNVe}, {SUNVe, LOSAi},Ø

slide-53
SLIDE 53

Net2Text reduces the search space to solutions that balance coverage and explainability

… … … …

30 Ø,Ø,Ø {LOSAi},Ø,Ø {SUNVe},Ø,Ø {LOSAi},{NEWYe},Ø {LOSAi},{NEWYe},{Yahood} {SUNVe, LOSAi},Ø,Ø {SUNVe, LOSAi, Googled},
 {SUNVe, NEWYi, Yahood},
 {HOUSe, NEWYi, Yahood} {SUNVe}, {SUNVe, LOSAi},Ø

slide-54
SLIDE 54

Net2Text reduces the search space to solutions that balance coverage and explainability

… …

{SUNVe},{SUNVe, LOSAi},{SUNVe, LOSAi, Yahood}

… …

Balanced coverage and explainability

30 Ø,Ø,Ø {LOSAi},Ø,Ø {SUNVe},Ø,Ø {LOSAi},{NEWYe},Ø {LOSAi},{NEWYe},{Yahood} {SUNVe, LOSAi},Ø,Ø {SUNVe, LOSAi, Googled},
 {SUNVe, NEWYi, Yahood},
 {HOUSe, NEWYi, Yahood} {SUNVe}, {SUNVe, LOSAi},Ø

slide-55
SLIDE 55

Net2Text reduces the search space to solutions that balance coverage and explainability

Graph has a monotonicity property Guaranteed lower bound on the score Net2Text greedily explores the graph Solution is not far off from best solution The child’s score is always higher Always follow most promising path

31

slide-56
SLIDE 56

32

Sampling Optimization 2 Reduce the input data Approximation

slide-57
SLIDE 57

across multiple levels Network traffic is highly skewed

Traffic distribution Routing and network topology Network traffic is repetitive and redundant Few destinations carry most of the traffic Repetitive forwarding patterns Level 1 Insight Level 2

33

slide-58
SLIDE 58

to speed up summarization by sampling Net2Text uses redundancy in the data

Net2Text iterates over all entries at least once Reduce input data by sampling Problem Solution Summary is resilient to loss of redundant information Insight

34

slide-59
SLIDE 59

1 Scaling Summarization 2 from question to succinct answer summarizing fast Performance & operator interviews 3 summaries within a few seconds

Text Net 2

slide-60
SLIDE 60

Net2Text needs to be quick and applicable

Performance Applicability Aspect 1 Aspect 2 End-to-end timing Operator interviews

35

slide-61
SLIDE 61

Performance Applicability Aspect 1 End-to-end timing

35

slide-62
SLIDE 62

summarizing the entire forwarding state

Setup ATT North America from Topology Zoo How is traffic being forwarded? Full routing tables (~650k prefixes) Four features egress destination shortest path ingress

Pushing Net2Text to its limits by

25 nodes, 10 of them egresses

36

Question

slide-63
SLIDE 63

Net2Text no sampling 10 20 100 95

Time (s)

1.0 0.0 0.2 0.4 0.6 0.8

Score

w.r.t. no sampling

37

slide-64
SLIDE 64

1/10 1/1000 no sampling 10 20 100 95

Time (s)

1.0 0.0 0.2 0.4 0.6 0.8

Score

w.r.t. no sampling

Net2Text finds good summaries within seconds thanks to sampling

37

slide-65
SLIDE 65

Greedy Heuristic

Time (s)

10 20 100 95 1.0 0.0 0.2 0.4 0.6 0.8

Score

w.r.t. no sampling

Baseline is slightly faster than Net2Text, but not as resilient to sampling

Pick largest path aggregate 38

slide-66
SLIDE 66

Only sampling higher than 1/5k has a significant effect on the score

Sampling Rate

1/1 1/100 1/10k 1/1M 1.0 0.0 0.2 0.4 0.6 0.8

Score

w.r.t. no sampling

39

slide-67
SLIDE 67

Net2Text needs to be quick and applicable

Performance Applicability Aspect 2 Operator interviews

40

slide-68
SLIDE 68

Operators see value of assistants in their daily tasks Especially “Where is the traffic coming from?” NL is useful, especially for less technical people Supported questions are relevant Support in all time consuming tasks Operators don’t mind to use query languages Assistants Questions NL I/O

41

We asked various operators about Net2Text, they found it useful

slide-69
SLIDE 69

Net2Text assists network operators by summarizing the forwarding state

Net2Text answers questions in natural language Net2Text presents a summary Net2Text responds in a timely manner with a succinct summary in natural language that balances coverage and explainability and the supported queries are relevant

slide-70
SLIDE 70

net2text.ethz.ch

Dana Drachsler-Cohen Martin Vechev Laurent Vanbever Rüdiger Birkner