Unifying network filtering rules for the Linux kernel with eBPF - PowerPoint PPT Presentation

FOSDEM’19 • Brussels, 2019-02-02 Unifying network filtering rules for the Linux kernel with eBPF Quentin Monnet <quentin.monnet@netronome.com> @qeole

Outline Several network filtering mechanisms in the Linux kernel What are they, and what do they do? How are they used? Latest addition: eBPF What does it bring to filter networking? Increasing number of convergence leads between the different models What are the objectives? How can they be unified? Q. Monnet | Unifying network filtering rules for the Linux kernel with eBPF 2/22

Some network filtering mechanisms in the Linux kernel Q. Monnet | Unifying network filtering rules for the Linux kernel with eBPF 3/22

Netfilter (iptables/nf_tables) Framework for packet filtering (firewall), NAT Often the default choice for dropping flows Several front-end components (ebtables, arptables, iptables, ip6tables, nf_tables, conntrack) Back-end: Netfilter nf_tables successor to iptables: more flexible, more efficient Q. Monnet | Unifying network filtering rules for the Linux kernel with eBPF 4/22

Traffic Control filters (tc, iproute2) TC framework for Traffic Control in the kernel: traffic shaping, scheduling, policing, dropping “Queueing disciplines” (qdisc), possibly applied to “classes” Filters are used to dispatch packets into the different classes (Traffic control mostly applies to egress traffic, but filters also usable for ingress) Framework actually using a variety of filters: Q. Monnet | Unifying network filtering rules for the Linux kernel with eBPF 5/22 • basic (ematch, “extended match”) • flow • flower • u32 • [bpf] • Specific filters: fw, route, rsvp, tcindex

Hardware filters (ethtool) “Receive network flow classification”: Hardware filters Main objective: flow steering, but able to drop flows Needs hardware support, not all NICs have it Rules set with ethtool -U (ioctl) Q. Monnet | Unifying network filtering rules for the Linux kernel with eBPF 6/22

pcap-filters, cBPF (e.g. for tcpdump) Facility from the libpcap library Takes an expression and turns it into a filter Output is legacy BPF (cBPF), attached to sockets in the kernel (or run in user space if not on Linux) Used by tcpdump (see tcpdump -i eth0 -d <expr> ) Q. Monnet | Unifying network filtering rules for the Linux kernel with eBPF 7/22

Filtering hooks Q. Monnet | Unifying network filtering rules for the Linux kernel with eBPF 8/22 Userspace BPF on socket Net fi lter ingress Net fi lter egress (PREROUTING, INPUT) (OUTPUT, POSTROUTING) Kernel stack TC ingress TC egress Kernel Driver Hardware fi lters (set up with ethtool) Hardware (NIC)

Many rule syntaxes Example rule: Drop incoming IP(v4) HTTP packets # iptables -A INPUT -i eth0 \ -p tcp --dport 80 -j drop # nft add rule ip filter input iif eth0 \ tcp dport 80 drop # tcpdump -i eth0 \ ip and tcp dst port 80 # tc filter add dev eth0 ingress flower \ ip_proto tcp dst_port 80 action drop # ethtool -U eth0 \ flow-type tcp4 dst-port 80 action -1 Q. Monnet | Unifying network filtering rules for the Linux kernel with eBPF 9/22

Many other ways to filter packets The list is not exhaustive Other frameworks are available (many of them out of kernel space) Software switches: Open vSwitch, etc. User space processing: DPDK (rte-flows), firewall apps, etc. P4 as another way to implement switches/filters, compile to target … Q. Monnet | Unifying network filtering rules for the Linux kernel with eBPF 10/22

Enter eBPF Q. Monnet | Unifying network filtering rules for the Linux kernel with eBPF 11/22

Introduction to eBPF Generic, efficient, secure in-kernel (Linux) virtual machine Q. Monnet | Unifying network filtering rules for the Linux kernel with eBPF In the rest of the presentation: “BPF” means “eBPF” Specific features: Maps, tail calls, helper functions 12/22 Event-based programs injected, verified and attached in the kernel Tracing/Monitoring Sockets TC Kprobe/Uprobe (traf fi c control) Networking XDP Tracepoint (network driver) eBPF Lightweight Tunnel Perf Event Encapsulation Flow Dissector Others to come? Infrared Cgroups Remote Control

BPF hooks for network packet processing Q. Monnet | Unifying network filtering rules for the Linux kernel with eBPF 13/22 Userspace BPF on socket Net fi lter ingress Net fi lter egress (PREROUTING, INPUT) (OUTPUT, POSTROUTING) Kernel stack TC ingress TC egress BPF as TC fi lter BPF XDP (“generic”) Kernel BPF XDP (driver support) Driver BPF Hardware fi lters (TC/XDP o ffl oad) (set up with ethtool) Hardware (NIC) Agilio SmartNIC

What BPF brings to network filtering BPF is POWER! Programmability (change network processing at runtime) In-kernel verifier: safety, security of the programs JIT (Just-in-time) compiler available for main architectures: speed! Low-level (driver hooks): speed!! Hardware offload: speed!!! Also: Headaches, long nights spent rewriting the filters Additional pain to pass the verifier But keep in mind: BPF is self-contained, well defined, flexible Maybe a good intermediate representation to represent filters? Q. Monnet | Unifying network filtering rules for the Linux kernel with eBPF 14/22

Convergence of the models Q. Monnet | Unifying network filtering rules for the Linux kernel with eBPF 15/22

Why unifying? User side: Transparently reuse existing set of rules Benefit from the best of each world: flexibility, ease of use, performance Developer side: Easier to work on a common intermediate representation rather than on a variety of distinct back-ends Better uncoupling of the front- and back-ends Q. Monnet | Unifying network filtering rules for the Linux kernel with eBPF 16/22

flow_rule infrastructure Work in progress from Pablo Neira Ayuso—No BPF in this one Q. Monnet | Unifying network filtering rules for the Linux kernel with eBPF Stop exposing TC front-end details to drivers (easier to add features to TC) Unified IR passed to the driver: avoid having one parser for each ACL front-end Motivation: 17/22 Intermediate representation for ACL hardware offloads Based on Linux flow dissector infrastructure and TC actions Can be used by different front-ends such as HW filters, TC, Netfilter Hardware fi lters TC Net fi lter (via ioctl) (via Netlink) (via Netlink) Userspace Translates native interface representation to fl ow_rule IR fl ow_rule IR Kernel front end Parses fl ow_rule IR Kernel to populate HW IR Hardware IR Driver O ffl oads fi lter as HW IR Hardware (NIC)

bpfilter: BPF-based firewall bpfilter : new back-end for iptables in Linux, based on BPF Q. Monnet | Unifying network filtering rules for the Linux kernel with eBPF Improve performance (JIT, offloads) Reuse rules from iptables Motivation: 18/22 Also: proposal for nf_tables to BPF translation on top of bpfilter Rules are translated into a BPF program The iptables binary is left untouched user space, for rule translation Uses a special kernel module launching an ELF executable in a special thread in bp fi lter UMH iptables special thread Userspace translates inject rules rules to eBPF Net fi lter bp fi lter.ko subsystem module Kernel bp fi lter BPF hook

libkefir: a library to convert ACLs to BPF programs Comes as a library, for inclusion in other projects Q. Monnet | Unifying network filtering rules for the Linux kernel with eBPF Sorry, not published yet! libkefir : KE rnel FI ltering R ules: Work in progress @ Netronome 19/22 And give BPF-compatible C source code to users, so they can hack it But do not try to handle all cases Turn simple ACL rules into hackable BPF programs Motivation similar to bpfilter: reuse rules, with improved performance ethtool rules Userspace TC fl ower rules libke fi r C source code iptables rules pcap-lib BPF bytecode expressions BPF program attached Kernel

Wrapping up Various frameworks for packet filtering in Linux BPF is one of them, brings new perspectives in terms of programmability, performance, speed, speed and speed Convergence between different models is beginning to emerge: Easier handling of rules for driver developers (flow_rule IR proposal) Reuse of existing rules for users (bpfilter, libkefir) Better performance for those existing set of rules Also, consider: P4 as another approach for convergence—BPF is one target BPF used in other places: Open vSwitch datapath, DPDK eBPF as a heterogeneous processing ABI (LPC 2018) Usage of a DSL for producing BPF programs, but targeted at tracing the Linux kernel: bpftrace Q. Monnet | Unifying network filtering rules for the Linux kernel with eBPF 20/22

Thank you! Questions Q. Monnet | Unifying network filtering rules for the Linux kernel with eBPF 21/22

Unifying network filtering rules for the Linux kernel with eBPF - PowerPoint PPT Presentation

FOSDEM19 Brussels, 2019-02-02 Unifying network filtering rules for the Linux kernel with eBPF Quentin Monnet <quentin.monnet@netronome.com> @qeole Outline Several network filtering mechanisms in the Linux kernel What are they,

Introduction to Linux Kernel Modules Luca Abeni luca.abeni@santannapisa.it Linux Kernel Modules

Debugging the Linux Kernel with GDB Kieran Bingham Debugging the Linux Kernel with GDB Many

Introduction to Linux Aline Abler Aline Abler Linux, whats that? The pieces of a Linux

Making C Less Dangerous in the Linux Kernel Kees Cook | @keescook LINUX.CONF.AU 21-25 January

Worksheet 9 Worksheet 9 Linux as a router, packet filtering, traffic Linux as a router, packet

Linux Overview Amir Hossein Payberah payberah@gmail.com 1 Agenda Linux Overview Linux

Linux from Sensors to Servers ! When is Linux Not Linux? ! 1 1 Linux runs across a huge range

Filtering Cubemaps Filtering Cubemaps Angular Extent Filtering and Edge Seam Fixup Methods

Traffic Control Mechanisms Filtering Source address filtering Other forms of filtering

Lesson 7 Rate Conversion Filtering and Downsampling interchange Filtering and Upsampling

Linux Kernel Debugging Your kernel just oopsed - What do you do, hotshot? Muli Ben-Yehuda

Intro to Linux Kernel Programming Don Porter Lab 4 You will write a Linux kernel module

Linux Kung Fu Introduction What is Linux? Why Linux? What is the difference between a client

Tight Kernel Query Complexity of Kernel Ridge Regression and Kernel -means Clustering Manuel

Linux Kernel Synchronization System Calls Synchronization in Kernel the kernel RCU File

1 Theres a kernel security researcher named Dan Rosenberg whose done a lot of linux kernel

Voice Coding with Opus Koen Vos, Karsten Vandborg Srensen, Sren Skak Jensen, Jean-Marc Valin

Communication Systems RTP-QoS University of Freiburg Computer Science Computer Networks and

End-to-end Methods for Traffic Shaping Detection, Performance Problem Diagnosis, Home Wireless

Using TCP/IP Traffic shaping to achieve iSCSI service predictability Paper presentation Jarle

Congestion Control Jean Walrand www.eecs.berkeley.edu/~ wlr Outline The Problem What is fair?

D&R IP-SoC Days Shanghai Nikos Zervas CEO, CAST Inc. 1 Automotive IP Cores: Challenges

Wireless Networks and Protocols MAP-Tele Manuel P. Ricardo Faculdade de Engenharia da

An Architecture of a Network Controller for QoS Management in Home Networks with Lots of IoT

Unifying network filtering rules for the Linux kernel with eBPF - PowerPoint PPT Presentation

FOSDEM19 Brussels, 2019-02-02 Unifying network filtering rules for the Linux kernel with eBPF Quentin Monnet <quentin.monnet@netronome.com> @qeole Outline Several network filtering mechanisms in the Linux kernel What are they,

Introduction to Linux Kernel Modules Luca Abeni luca.abeni@santannapisa.it Linux Kernel Modules

Debugging the Linux Kernel with GDB Kieran Bingham Debugging the Linux Kernel with GDB Many

Introduction to Linux Aline Abler Aline Abler Linux, whats that? The pieces of a Linux

Making C Less Dangerous in the Linux Kernel Kees Cook | @keescook LINUX.CONF.AU 21-25 January

Worksheet 9 Worksheet 9 Linux as a router, packet filtering, traffic Linux as a router, packet

Linux Overview Amir Hossein Payberah payberah@gmail.com 1 Agenda Linux Overview Linux

Linux from Sensors to Servers ! When is Linux Not Linux? ! 1 1 Linux runs across a huge range

Filtering Cubemaps Filtering Cubemaps Angular Extent Filtering and Edge Seam Fixup Methods

Traffic Control Mechanisms Filtering Source address filtering Other forms of filtering

Lesson 7 Rate Conversion Filtering and Downsampling interchange Filtering and Upsampling

Linux Kernel Debugging Your kernel just oopsed - What do you do, hotshot? Muli Ben-Yehuda

Intro to Linux Kernel Programming Don Porter Lab 4 You will write a Linux kernel module

Linux Kung Fu Introduction What is Linux? Why Linux? What is the difference between a client

Tight Kernel Query Complexity of Kernel Ridge Regression and Kernel -means Clustering Manuel

Linux Kernel Synchronization System Calls Synchronization in Kernel the kernel RCU File

1 Theres a kernel security researcher named Dan Rosenberg whose done a lot of linux kernel

Voice Coding with Opus Koen Vos, Karsten Vandborg Srensen, Sren Skak Jensen, Jean-Marc Valin

Communication Systems RTP-QoS University of Freiburg Computer Science Computer Networks and

End-to-end Methods for Traffic Shaping Detection, Performance Problem Diagnosis, Home Wireless

Using TCP/IP Traffic shaping to achieve iSCSI service predictability Paper presentation Jarle

Congestion Control Jean Walrand www.eecs.berkeley.edu/~ wlr Outline The Problem What is fair?

D&amp;R IP-SoC Days Shanghai Nikos Zervas CEO, CAST Inc. 1 Automotive IP Cores: Challenges

Wireless Networks and Protocols MAP-Tele Manuel P. Ricardo Faculdade de Engenharia da

An Architecture of a Network Controller for QoS Management in Home Networks with Lots of IoT

D&R IP-SoC Days Shanghai Nikos Zervas CEO, CAST Inc. 1 Automotive IP Cores: Challenges