Runtime Verification of P4 Switches with Reinforcement Learning - PowerPoint PPT Presentation

Runtime Verification of P4 Switches with Reinforcement Learning Apoorv Shukla (TU Berlin) with Kevin Nico Hudemann (TU Berlin), Artur Hecker (Huawei), Stefan Schmid (Vienna Uni.) Apoorv Shukla| NetAI’19

P4 [1] : Data plane Programming Language Domain-specific high-level language for data plane programming • Support for user-defined custom protocols, target independence, • etc. [1] P. Bosshart, D. Daly, G. Gibby, M. Izzardy, N. McKeown, J. Rexford, C. Schlesinger, D. Talaycoy, A. Vahdat, G. Varghese, D. Walker. P4: Programming Protocol-Independent Packet Processors. SIGCOMM’ 14. Apoorv Shukla| NetAI’19 2

P4 Pipeline: Complex Buffer Egress Egress Ingress Packet Ingress Egress Match- Ingress Parser Queuing Deparser Parser Match- Replication Deparser Action Engine Engine Action (BQE) (PRE) Packet PSA Architecture with programmable (yellow) and non- programmable blocks (grey) Apoorv Shukla| NetAI’19 3

P4: Multiple versions and platforms Versions: P4 14 & P4 16 • Platforms: bmv2, Tofino, eBPF, XDP • Platform-specific implementations • Interplay between programmable and non-programmable blocks gets complex! Apoorv Shukla| NetAI’19 4

Bugs happen Bugs related to memory safety: buffer overflow, invalid memory • accesses (detectable by static analysis) Runtime bugs related to checksum, ECMP/hash-calculation, • platform-dependent, etc. Apoorv Shukla| NetAI’19 5

Runtime bug detection is hard P4 is half a program; forwarding rules populated at runtime • Static Analysis prone to false positives: insufficient • Switch does not throw any runtime exceptions: hard to catch • This talk: P4 Runtime bug Detection! Apoorv Shukla| NetAI’19 6

Example: Platform-Independent Bug L3 switch parser of P4 language tutorials does not validate IPv4 • ihl Packets with IP options are forwarded with wrong checksum • Apoorv Shukla| NetAI’19 7

Motivating Example: Platform-Dependent Bug Conflicting forwarding decisions can lead to unexpected behavior • Dependent on implementation of packet replication engine (PRE) • More bug examples in the paper! Apoorv Shukla| NetAI’19 8

Problem Statement Is it possible to automatically detect runtime bugs in P4 switches? Apoorv Shukla| NetAI’19 9

Goal Design a system which automatically detects runtime bugs • Detects both: platform-dependent and –independent bugs • Is non-intrusive: no changes to the P4 program or switch • Apoorv Shukla| NetAI’19 10

Approach in a nutshell Use fuzzing, and guide it through reinforcement learning agent • Generate +ve rewards if an anomaly is detected in the feedback • Feedback also guides the agent further • Apoorv Shukla| NetAI’19 11

P4RL P4RL Agent – Guides Fuzzing • p4q – Query Language for expressivity, reducing input search • space Agent S t R t A t R t+1 Environment S t+1 Credit: https://www.kdnuggets.com/2018/03/5-things-reinforcement-learning.html Apoorv Shukla| NetAI’19 12

P4RL Reinforcement Learning States: Sequence of bytes forming the packet header • Actions: Add/modify/delete bytes at position X • 1, if the packet triggered a bug Rewards: • 0, otherwise Apoorv Shukla| NetAI’19 13

Reducing Input Search Space for Fuzzing Pre-generated dictionary created using control plane • configuration, compiled P4 program and p4q queries Compiled P4 program in JSON format aids in knowing accepted • header layouts Check boundary values first for header fields by queries • Apoorv Shukla| NetAI’19 14

Query Language: p4q Goal: Specify expected P4 switch behavior • If-then-else conditional statements • Common boolean expressions & relational operators • (ing.hdr.ipv4 & ing.hdr.ipv4.version !=4, egr.egress_port == False, ) Apoorv Shukla| NetAI’19 15

P4RL Agent-guided Fuzzing Apoorv Shukla| NetAI’19 16

P4RL DDQN Combination of double Q-learning and deep Q networks with a • simple form of prioritized experience replay Select next action based upon the result of feeding current • environment state to neural network Two separate neural networks for action selection and evaluation • Apoorv Shukla| NetAI’19 17

P4RL Workflow P4RL Agent P4 Network 2. Select 4. Get fuzz Reward action 3. Send packets & User written Reward P4Runtime Control monitor behaviour P4 Switch queries System Plane 1. Get control plane config Apoorv Shukla| NetAI’19 18

Evaluation Strategy Target: Publicly available L3 (basic.p4) switch • (simple_switch_grpc) implementation Baseline: Simple Agent relying on random action selection • Metrics: • Mean Cumulative Reward (MCR) over 10 runs • Bug Detection Time • Apoorv Shukla| NetAI’19 19

Bugs found by P4RL in publicly available programs PI – Platform-independent PD – Platform-dependent Apoorv Shukla| NetAI’19 20

Learning Performance: P4RL Agent vs. Baseline ➔ P4RL generates ~3 × rewards Apoorv Shukla| NetAI’19 21

Detection Time Speedup: P4RL Agent vs. Baseline ➔ P4RL up to 4.42 × faster Apoorv Shukla| NetAI’19 22

Limitations: Undecidability Yes <Input> P4RL engine No Credit: https://www.coopertoons.com/education/haltingproblem/haltingproblem.html Apoorv Shukla| NetAI’19 23

Conclusion P4RL’s machine learning-guided fuzzing enables detection of • complex runtime bugs (non-intrusively) Identifies platform-dependent and -independent bugs • Ensure correctness in P4 deployments • Apoorv Shukla| NetAI’19 24

Summary Agent P4RL P4 Network 2. Select 4. Get fuzz action Reward User Control Reward P4 written P4Runtime Plane System Switch queries 3. Send packets & monitor behavior 1 . G e t c o n t r o l p l a n e c o n f i g Contact: apoorv@inet.tu-berlin.de Code: gitlab.inet.tu-berlin.de/apoorv/P4ML Apoorv Shukla| NetAI’19 25

Runtime Verification of P4 Switches with Reinforcement Learning - PowerPoint PPT Presentation

Runtime Verification of P4 Switches with Reinforcement Learning Apoorv Shukla (TU Berlin) with Kevin Nico Hudemann (TU Berlin), Artur Hecker (Huawei), Stefan Schmid (Vienna Uni.) Apoorv Shukla| NetAI19 P4 [1] : Data plane Programming Language

SERIES 500 SWITCHES SLIDE SWITCHES SWITCHES TACT SPECIFICATIONS Contact Rating: Gold, 0.4 VA

Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning:

TenantGuard: Scalable Runtime Verification of Cloud-Wide VM-Level Network Isolation Han Song

DIVS DL/ID Verification Systems Verification of Legal Status DIVS Passport Verification

1 AGENDA Limit Switches Production Overview Standards Overview Limit switches

Microprocessors & Interfacing Input switches Basics of switches Keypads

RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem

Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning?

Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest

Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement

Testing Concurrency Runtime via a Testing Concurrency Runtime via a Stochastic Stress Framework

Integration of Runtime Verification into Metamodeling F. Macias T. Scheffel M. Schmitz R. Wang

Leveraging DTrace for runtime verification Carl Martin Rosenberg June 7th, 2016 Department of

A Scala API for Runtime Verification Klaus Havelund Jet Propulsion Laboratory Pasadena,

ECE 3574: Applied Software Design Introduction to Qt The goal of todays meeting it to learn

T2: A Second Generation OS For Introduction Embedded Sensor Networks TinyOS-2.x Core

Cloudy With a Chance of Misconceptions Dominik Wermke* Nicolas Huaman Christian Stransky

Making a Language Cross Platform Libraries and Tooling Gwen Mittertreiner Facebook Gwen

SWARM Extreme Jitendra Bothra Baturalp Torun jitendrabothra@gmail.com baturalp@gmail.com

V8 engine of Node.js on IA: JavaScript-JITTED x86 machine code mapping profiling support and X87

TUHH Institute of Telematics TUHH Hamburg University of Technology Hamburg University of

Outline Introduction Contribution: Novel Vectorization and Mapping Workflow.

Runtime Verification of P4 Switches with Reinforcement Learning - PowerPoint PPT Presentation

Runtime Verification of P4 Switches with Reinforcement Learning Apoorv Shukla (TU Berlin) with Kevin Nico Hudemann (TU Berlin), Artur Hecker (Huawei), Stefan Schmid (Vienna Uni.) Apoorv Shukla| NetAI19 P4 [1] : Data plane Programming Language

SERIES 500 SWITCHES SLIDE SWITCHES SWITCHES TACT SPECIFICATIONS Contact Rating: Gold, 0.4 VA

Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning:

TenantGuard: Scalable Runtime Verification of Cloud-Wide VM-Level Network Isolation Han Song

DIVS DL/ID Verification Systems Verification of Legal Status DIVS Passport Verification

1 AGENDA Limit Switches Production Overview Standards Overview Limit switches

Microprocessors &amp; Interfacing Input switches Basics of switches Keypads

RL Overview of topics About Reinforcement Learning The Reinforcement Learning Problem

Reinforcement Learning UMaine COS 470/570 Introduction to AI Why reinforcement learning?

Reinforcement Learning and Simulation-Based Search David Silver Reinforcement Learning and

Reinforcement Learning Reinforcement Learning Reinforcement Learning in a nutshell g Imagine

Safe Reinforcement Learning Philip S. Thomas Stanford CS234: Reinforcement Learning, Guest

Reinforcement Learning Timothy Chou Charlie Tong Vincent Zhuang April 19, 2016 Reinforcement

Testing Concurrency Runtime via a Testing Concurrency Runtime via a Stochastic Stress Framework

Integration of Runtime Verification into Metamodeling F. Macias T. Scheffel M. Schmitz R. Wang

Leveraging DTrace for runtime verification Carl Martin Rosenberg June 7th, 2016 Department of

A Scala API for Runtime Verification Klaus Havelund Jet Propulsion Laboratory Pasadena,

ECE 3574: Applied Software Design Introduction to Qt The goal of todays meeting it to learn

T2: A Second Generation OS For Introduction Embedded Sensor Networks TinyOS-2.x Core

Cloudy With a Chance of Misconceptions Dominik Wermke* Nicolas Huaman Christian Stransky

Making a Language Cross Platform Libraries and Tooling Gwen Mittertreiner Facebook Gwen

SWARM Extreme Jitendra Bothra Baturalp Torun jitendrabothra@gmail.com baturalp@gmail.com

V8 engine of Node.js on IA: JavaScript-JITTED x86 machine code mapping profiling support and X87

TUHH Institute of Telematics TUHH Hamburg University of Technology Hamburg University of

Outline Introduction Contribution: Novel Vectorization and Mapping Workflow.

Microprocessors & Interfacing Input switches Basics of switches Keypads