F AASM : Lightweight Isolation for Efficient Stateful Serverless - PowerPoint PPT Presentation

F AASM : Lightweight Isolation for Efficient Stateful Serverless Computing Simon Shillaker and Peter Pietzuch Large-scale Data and Systems Group, Imperial College London

Serverless Big Data Vision Serverless functions Application Big data 10101011 000010001 10101011 00100010 + 000010001 00100010 😄 💼 Cheap, highly scalable big data processing 2

Serverless Under the Hood Problem 2: Inefficient state sharing Function Host Problem 1: Isolation overhead 101010110 101010110 101010110 00010001 00010001 00010001 00100010 00100010 00100010 Container State in external storage 10101011 000010001 00100010 Local copy of data 101010110 101010110 101010110 00010001 00010001 00010001 00100010 00100010 00100010 3 Images: AWS , Azure , GCP , OpenWhisk

Problem 1: Isolation Overhead Per tenant isolation, i.e. sharing containers E.g. PyWren , Jonas et al., SoCC ‘17; Crucial , Barcelona et al., Middleware ‘19 ✅ Spreads isolation overhead ❌ Loses fine-grained scaling Snapshots and restore E.g. SOCK , Oakes et al., ATC ‘18; SEUSS , Cadden et al., Eurosys ‘20; Catalyzer , Du et al., ASPLOS ‘20 ✅ Low initialisation time ❌ Same memory footprint Software-based Isolation E.g. “Micro” services , Boucher et. al, ATC ‘18; Cloudflare Workers ; Fastly Terrarium ✅ Low overheads ❌ No resource isolation 4

Problem 2: Inefficient State Sharing Add extra services to containers E.g. Cloudburst , Sreekanti et al., arXiv ‘20; SAND , Akkus et al., ATC ‘18 ✅ Reduces network overhead ❌ Still duplicates locally, increases isolation overhead Execute functions on external storage E.g. Shredder , Zhang et al., SoCC ‘19 ✅ Moves code to data ❌ Does not replicate across hosts Make external storage faster E.g. Pocket , Klimovic et al., OSDI ‘18 ✅ Reduces latency ❌ Still not sharing 5

How Do We Efficiently Share State But Maintain Isolation? 👺 101010110 101010110 101010110 00010001 00010001 00010001 00100010 00100010 00100010 We need an isolation mechanism that gives us fine-grained control over memory 6

Software-Fault Isolation with WebAssembly WebAssembly - Lightweight memory safety - Used by Fastly, Cloudflare, Krustlet Challenges: - Relax isolation to share memory at runtime - Virtualisation between functions and host resources 7

Two-Tier State - Distribution and Locally-Shared State Local tier Shared memory 101010110 00010001 00100010 Two-tier state Global tier 10101011 Cross-host synchronisation 000010001 00100010 101010110 00010001 00100010 Challenges: - Hide complexity from the user - Minimise synchronisation - Schedule to optimise co-location 8

Faasm: Lightweight Isolation for Efficient Stateful Serverless Computing ฀฀ Faaslet isolation Shared memory 101010110 00010001 regions 00100010 Global synchronisation 10101011 000010001 Proto-Faaslet 00100010 snapshots https://github.com/lsds/Faasm 101010110 00010001 00100010 9

Faasm: Lightweight Isolation for Efficient Stateful Serverless Computing Problem 1: Isolation overheads Faaslets - lightweight isolation based on WebAssembly Host interface - minimal serverless-specific virtualisation Proto-Faaslets - 500μs initialisation, 90kB memory Problem 2: Inefficient state sharing Faaslet shared regions - shared memory without breaking isolation Two-tier state - global synchronisation 10

WebAssembly - memory safety with fine-grained control Offset: +0 +stack_base +heap_base +heap_top +heap_top Stack Data Heap <=4GB std::vector<uint8_t> wasmMemory; WebAssembly memory model 12

Memory safety and resource isolation Faaslet Thread + cgroup Network namespace Memory safety (WebAssembly) Virtual net interface WASI capabilities Host interface Filesystem Faaslet multi-tenant isolation 13

Minimal Virtualisation for Serverless and POSIX applications Category Sub-category API Chaining chain_call() , await_call() , ... Serverless State get_state() , set_state() , ... Dynamic Linking dlopen() , dlsym() , ... Memory mmap() , brk() , ... POSIX Network socket() , connect() , bind() , ... File I/O open() , close() , read() , ... The Faaslet Host Interface 14

Proto-Faaslets - Host-Independence, μs Restore, kBs Memory Footprint Faasm host A Stack Data Heap Function table Proto-Faaslet cache (copy-on-write memory) .wasm .o Proto-Faaslet store Capture complete execution state Support arbitrarily initialisation code E.g. pre-initialised language runtime CPython in <1ms Faasm host B Proto-Faaslet snapshot and restore 15

Two-Tier State Architecture Top-Down View 1. F AASM programming model 2. Faaslet shared memory regions Local tier 101010110 00010001 00100010 3. Two-tier push-pull 4. Serialisation-free data transfer Global tier 10101011 000010001 00100010 101010110 00010001 00100010 17

F AASM Programming Model - Distributed Machine Learning (SGD) High-level Object-Oriented abstractions t_a = SparseMatrixReadOnly("training_a") Read-only matrices t_b = MatrixReadOnly("training_b") Asynchronous vector weights = VectorAsync("weights") Flexible consistency @serverless_func def weight_update(idx_a , idx_b): for col_idx , col_a in t_a.columns[idx_a:idx_b] : Standard Programming constructs col_b = t_b.columns[col_idx] adj = calc_adjustment(col_a , col_b) Transparent optimisations Direct access to shared memory for val_idx , val in col_a.non_nulls (): weights[val_idx] += val * adj if iter_count % threshold == 0: Intuitive mark-up weights.push() Function annotation @serverless_func Fork-join parallelism def sgd_main(n_workers , n_epochs): for e in n_epochs: args = divide_problem(n_workers) c = chain(weight_update, n_workers, args) await_all(c) 18

Shared Memory Without Breaking Safety Guarantees +A 0 +A+S Offset: Faaslet A Proc. memory A B S Faaslet B 0 +B +B+S Faaslet Shared Memory Regions 19

Push-pull - Global Synchronisation with Variable Consistency Host A F1: F2: Local tier “state_x”: 011100100 PUSH(“state_x”) Global tier “state_x”: 011100100 Host B PULL(“state_x”) “state_x”: 011100100 F3: Two-Tier Push-Pull 20

Serialisation-Free Transfer of Arbitrarily Complex Data Structures Host A Host B F1 F2 F3 F4 A B B C1 C2 k B : k A : k C : A B C Sub-arrays Byte arrays Distributed KVS Faasm’s serialisation-free state 21

Evaluation Questions: 1. How do Faaslets compare to containers? 2. Can F AASM improve efficiency and performance of ML training? 3. Can F AASM improve throughput of ML inference? 4. Does Faaslet isolation affect performance of dynamic languages? Comparison: - Knative running identical code - Code compiled natively for Knative - Code compiled to WebAssembly for F AASM 22 Image: Knative

How do Faaslet Overheads Compare to Containers? Lower overheads mean lower latency and lower costs Docker (alpine) Faaslets Proto-Faaslets vs. Docker Initialisation 2.8s 5.2ms 0.5ms 5.6K x CPU cycles 251M 1.4K 650 385K x Memory Footprint 1.3MB 200KB 90KB 15 x Density ~8K ~70K >100K 12 x 23

How do Faaslets “Churn” Compared to Containers? Higher churn means higher utilisation of shared infrastructure High Churn 1000x increase in max throughput 5000x reduction in latency 24

Can Faasm Improve Efficiency and Performance of Parallel ML Training? Parallel processing on co-located data reduces training time Faster training with increasing parallelism 80% reduction in training time Knative hosts restricted by memory pressure 25

Can Faasm Improve Efficiency and Performance of Parallel ML Training? Reduced data shipping reduces costs Reduced network transfers 60% reduction in network transfers Reduction increases with higher parallelism 26

Can Faasm Improve Throughput and Reduce Latency Serving ML Inference? Proto-Faaslets increase max throughput and reduce latency Increased Throughput Decreased tail latency Negligible cold starts with Proto-Faaslets 90% reduction in tail latency 120% increase in max throughput with 5% cold starts 27

Does Faaslet Isolation Affect Performance of Dynamic Languages? Faaslet isolation has negligible impact on a distributed Python application Comparable performance Faaslet isolation shows no significant overhead Effect persists with increasing matrix size 28

F AASM : Lightweight Isolation for Efficient Stateful Serverless - PowerPoint PPT Presentation

F AASM : Lightweight Isolation for Efficient Stateful Serverless Computing Simon Shillaker and Peter Pietzuch Large-scale Data and Systems Group, Imperial College London Serverless Big Data Vision Serverless functions Application Big data

Mesos Go Stateful An Abstraction for frameworks running stateful workload Dhilip & Amit -

GCC Highlighted Products GSure Gel Extraction kit GSure Soil DNA Isolation kit GSure Sputum DNA

Serializable Snapshot Isolation Making ISOLATION LEVEL SERIALIZABLE Provide Serializable

Stateful access control using LSM CS547 Thomas Uphill Stateful access cont rol using LSM 11

Scalable Verification of Stateful Networks Aurojit Panda, Ori Lahav, Katerina Argyraki, Mooly

2 nd AASM (25-28 Feb 2017 Isfahan, Iran) Saturday 25 th Feb 2017 ( 8:30-18:30) 8.30 -9:30

Stateful Firewalls Hank and Foo Types of firewalls Packet filter (stateless) Proxy

The lightweight beam for Heavyweight applications The impact of this lightweight beam concept

The lightweight beam for Heavyweight applications The impact of this lightweight steel beam will

Its time to Think Lightweight! www.thinklightweight.com TO D A Y S TO P IC S 1.

Lightweight Cryptography and and RFID Security Svetla Nikova COSIC KUL COSIC, KULeuven and

Efficient Software-Based Fault Isolation Robert Wahbe Steven Lucco Thomas E. Anderson Susan L.

Harmonizing Performance and Isolation in Microkernels with Efficient Intra-kernel Isolation and

Introduction to pixel track isolation The purpose of track isolation algorithm is an additional

ADAPTED SPAULDING PYRAMID Making Isolation: How does it work? Patient Isolation- Creating

Isolation trees Alastair Rushworth Data Scientist DataCamp Anomaly Detection in R Isolation

EECS 591 D ISTRIBUTED S YSTEMS Manos Kapritsos Fall 2020 Slides by: Lorenzo Alvisi PBFT: A B

Collecting & Analyzing Twitter data an Introduction Viktoria Spaiser UAF in Political

Routing Trillions of Events Per Day @Twitter #ApacheBigData 2017 Lohit VijayaRenu & Gary

Crawling Twitter Data Konstantinos Semertzidis ksemer@cs.uoi.gr What types of information can we

Equal Representation in Two-tier Voting Systems Nicola Maaser and Stefan Napel Economics

Domain-Driven Design SWEN-261 Introduction to Software Engineering Department of Software

Reece Peterson, Professor Emeritus University of Nebraska-Lincoln Todays To explain the

A 5G Paradigm Based on Two-Tier Physical Network Architecture Elvino S. Sousa Jeffrey Skoll

F AASM : Lightweight Isolation for Efficient Stateful Serverless - PowerPoint PPT Presentation

F AASM : Lightweight Isolation for Efficient Stateful Serverless Computing Simon Shillaker and Peter Pietzuch Large-scale Data and Systems Group, Imperial College London Serverless Big Data Vision Serverless functions Application Big data

Mesos Go Stateful An Abstraction for frameworks running stateful workload Dhilip &amp; Amit -

GCC Highlighted Products GSure Gel Extraction kit GSure Soil DNA Isolation kit GSure Sputum DNA

Serializable Snapshot Isolation Making ISOLATION LEVEL SERIALIZABLE Provide Serializable

Stateful access control using LSM CS547 Thomas Uphill Stateful access cont rol using LSM 11

Scalable Verification of Stateful Networks Aurojit Panda, Ori Lahav, Katerina Argyraki, Mooly

2 nd AASM (25-28 Feb 2017 Isfahan, Iran) Saturday 25 th Feb 2017 ( 8:30-18:30) 8.30 -9:30

Stateful Firewalls Hank and Foo Types of firewalls Packet filter (stateless) Proxy

The lightweight beam for Heavyweight applications The impact of this lightweight beam concept

The lightweight beam for Heavyweight applications The impact of this lightweight steel beam will

Its time to Think Lightweight! www.thinklightweight.com TO D A Y S TO P IC S 1.

Lightweight Cryptography and and RFID Security Svetla Nikova COSIC KUL COSIC, KULeuven and

Efficient Software-Based Fault Isolation Robert Wahbe Steven Lucco Thomas E. Anderson Susan L.

Harmonizing Performance and Isolation in Microkernels with Efficient Intra-kernel Isolation and

Introduction to pixel track isolation The purpose of track isolation algorithm is an additional

ADAPTED SPAULDING PYRAMID Making Isolation: How does it work? Patient Isolation- Creating

Isolation trees Alastair Rushworth Data Scientist DataCamp Anomaly Detection in R Isolation

EECS 591 D ISTRIBUTED S YSTEMS Manos Kapritsos Fall 2020 Slides by: Lorenzo Alvisi PBFT: A B

Collecting &amp; Analyzing Twitter data an Introduction Viktoria Spaiser UAF in Political

Routing Trillions of Events Per Day @Twitter #ApacheBigData 2017 Lohit VijayaRenu &amp; Gary

Crawling Twitter Data Konstantinos Semertzidis ksemer@cs.uoi.gr What types of information can we

Equal Representation in Two-tier Voting Systems Nicola Maaser and Stefan Napel Economics

Domain-Driven Design SWEN-261 Introduction to Software Engineering Department of Software

Reece Peterson, Professor Emeritus University of Nebraska-Lincoln Todays To explain the

A 5G Paradigm Based on Two-Tier Physical Network Architecture Elvino S. Sousa Jeffrey Skoll

Mesos Go Stateful An Abstraction for frameworks running stateful workload Dhilip & Amit -

Collecting & Analyzing Twitter data an Introduction Viktoria Spaiser UAF in Political

Routing Trillions of Events Per Day @Twitter #ApacheBigData 2017 Lohit VijayaRenu & Gary