Apoorv Shukla| NetAI’19
Runtime Verification of P4 Switches with Reinforcement Learning
Apoorv Shukla (TU Berlin)
with Kevin Nico Hudemann (TU Berlin), Artur Hecker (Huawei), Stefan Schmid (Vienna Uni.)
Runtime Verification of P4 Switches with Reinforcement Learning - - PowerPoint PPT Presentation
Runtime Verification of P4 Switches with Reinforcement Learning Apoorv Shukla (TU Berlin) with Kevin Nico Hudemann (TU Berlin), Artur Hecker (Huawei), Stefan Schmid (Vienna Uni.) Apoorv Shukla| NetAI19 P4 [1] : Data plane Programming Language
Apoorv Shukla| NetAI’19
with Kevin Nico Hudemann (TU Berlin), Artur Hecker (Huawei), Stefan Schmid (Vienna Uni.)
Apoorv Shukla| NetAI’19
[1] P. Bosshart, D. Daly, G. Gibby, M. Izzardy, N. McKeown, J. Rexford, C. Schlesinger, D. Talaycoy, A. Vahdat, G. Varghese, D. Walker. P4: Programming Protocol-Independent Packet Processors. SIGCOMM’ 14.
2
Apoorv Shukla| NetAI’19
3
Ingress Match- Action
Packet Replication Engine (PRE)
Packet
Egress Parser Egress Match- Action Egress Deparser Ingress Deparser Buffer Queuing Engine (BQE) Ingress Parser
Apoorv Shukla| NetAI’19
4
Apoorv Shukla| NetAI’19
5
Apoorv Shukla| NetAI’19
6
Apoorv Shukla| NetAI’19
7
Apoorv Shukla| NetAI’19
8
Apoorv Shukla| NetAI’19
9
Apoorv Shukla| NetAI’19
10
Apoorv Shukla| NetAI’19
11
Apoorv Shukla| NetAI’19
12
Credit: https://www.kdnuggets.com/2018/03/5-things-reinforcement-learning.html
Apoorv Shukla| NetAI’19
13
Apoorv Shukla| NetAI’19
14
Apoorv Shukla| NetAI’19
15
Apoorv Shukla| NetAI’19
16
Apoorv Shukla| NetAI’19
17
Apoorv Shukla| NetAI’19
P4 Network
P4 Switch
P4Runtime
Control Plane User written queries Agent Reward System P4RL
fuzz action
Reward
monitor behaviour
18
Apoorv Shukla| NetAI’19
19
Apoorv Shukla| NetAI’19
20
Apoorv Shukla| NetAI’19
21
➔ P4RL generates ~3× rewards
Apoorv Shukla| NetAI’19
➔ P4RL up to 4.42× faster
22
Apoorv Shukla| NetAI’19
23
Credit: https://www.coopertoons.com/education/haltingproblem/haltingproblem.html
Apoorv Shukla| NetAI’19
24
Apoorv Shukla| NetAI’19
25
1 . G e t c
t r
p l a n e c
f i g P4 Switch P4Runtime Control Plane User written queries Agent Reward System
P4RL
fuzz action
Reward
monitor behavior
P4 Network
Contact: apoorv@inet.tu-berlin.de Code: gitlab.inet.tu-berlin.de/apoorv/P4ML