Serverless Performance on a Budget Erwin van Eyk The central - PowerPoint PPT Presentation

Serverless Performance on a Budget Erwin van Eyk

The central trade-off in serverless computing High Performance “Infinite” scaling High availability Low latency � 2

The central trade-off in serverless computing High Performance Low Cost No costs when idle “Infinite” scaling High availability No operational cost Low latency Granular billing � 2

The central trade-off in serverless computing High Performance Low Cost No costs when idle “Infinite” scaling High availability No operational cost Low latency Granular billing How can we optimize the performance-cost trade-off? � 2

Anatomy of a Functions-as-a-Service (FaaS) platform - Function Configuration - Environment variables - Arguments - Version - Source pointer - ... pods and other resources � 3

Anatomy of a FaaS platform � 4

Anatomy of a FaaS platform: Fission (without optimizations) � 5

Anatomy of a FaaS platform: cold start � 6

Anatomy of a FaaS platform: cold start 0 � 6

Anatomy of a FaaS platform: cold start 1 Trigger function deployment 0 � 6

Anatomy of a FaaS platform: cold start Fetch function metadata 2 1 Trigger function deployment 0 � 6

Anatomy of a FaaS platform: cold start Fetch function metadata 2 3 kubectl create 1 Trigger function deployment 0 � 6

Anatomy of a FaaS platform: cold start Fetch function metadata 2 Wait for K8S to deploy function 3 kubectl create 1 4 Trigger function deployment 0 � 6

Anatomy of a FaaS platform: cold start Fetch function metadata 2 Wait for K8S to deploy function 3 kubectl create 1 4 Trigger function deployment 0 5 Send request � 6

Anatomy of a FaaS platform: cold start Fetch function metadata 2 Wait for K8S to deploy function 3 kubectl create 1 4 Trigger function deployment 6 0 Response 5 Send request � 6

Anatomy of a FaaS platform: cold start Fetch function metadata 2 Wait for K8S to deploy function 3 kubectl create 1 4 Trigger function deployment 6 0 Response 5 7 Send request � 6

Anatomy of a FaaS platform: warm execution � 7

Anatomy of a FaaS platform: warm execution 0 � 7

Anatomy of a FaaS platform: warm execution 0 5 Send request � 7

Anatomy of a FaaS platform: warm execution 6 0 Response 5 Send request � 7

Anatomy of a FaaS platform: warm execution 6 0 Response 5 7 Send request � 7

Cold Start Warm Execution � 8

Cold Start Trigger deployer Warm Execution � 8

Cold Start Trigger Fetch function deployer metadata Warm Execution � 8

Cold Start Trigger Fetch function Deploy Pod deployer metadata Warm Execution � 8

Cold Start Trigger Fetch function Fetch Deploy Pod deployer metadata function Warm Execution � 8

Cold Start Trigger Fetch function Fetch Deploy Pod Deploy function deployer metadata function Warm Execution � 8

Cold Start Trigger Fetch function Fetch Route Deploy Pod Deploy function deployer metadata function request Warm Execution � 8

Cold Start Trigger Fetch function Fetch Route Function Deploy Pod Deploy function deployer metadata function request Execution Warm Execution � 8

Cold Start Trigger Fetch function Fetch Route Function Deploy Pod Deploy function deployer metadata function request Execution Warm Execution Route request � 8

Cold Start Trigger Fetch function Fetch Route Function Deploy Pod Deploy function deployer metadata function request Execution Warm Execution Route Function request Execution � 8

Cold starts matter! Coldstart latency (in ms) over 168 hours 180 ms 500 ms 3600 ms � 9 Wang, Liang, et al. "Peeking Behind the Curtains of Serverless Platforms." 2018 USENIX ATC, 2018.

How do FaaS platforms improve their performance? And, at what cost? 1. Function resource reusing 2. Function runtime pooling 3. Function prefetching 4. Function prewarming � 10

Optimization 1 Function Resource Reusing Trigger Fetch function Fetch Route Function Deploy pod Deploy function deployer metadata function request Execution � 11

Function Isolation vs. Function Reuse Request Response Function Instance Requests Responses Request Response Function Instance Function Instance Request Response Function Instance Full Isolation Full resource reuse � 12

Function resource reusing in practice - Why performance isolation: - Performance variability - In practice: all FaaS platforms reuse resources - Per-user binpacking - Functions are isolated - Function executions share resources � 13

FaaS platform with function reusing � 14

Trade-off: how long to keep functions alive? - To reuse functions we have to keep them alive. - Keep-alive in practice: - AWS: ~6 hours - Google: ~6 hours - Azure: 1-4 days Long keep-alive short keep-alives More warm executions Less idle resources � 15

Optimization 2 Function Runtime Pooling Trigger Fetch function Fetch Route Function Deploy pod Deploy function deployer metadata function request Execution � 16

Function Instance = Runtime + Function - Insight: function instances consist out of two parts - Function-specific code : user-provided business logic. - Runtime: operational logic, monitoring, health checks... - Divide the deployment process into 2 stages: Deploy the runtime → unspecialized runtime or stem cell - Deploy the function to the runtime → specialized function - Function Instance Function Runtime Function Runtime Resources Resources Resources Runtime deployment Function deployment � 17

Resource Pooling - Common in many domains (e.g. thread pools) Fn Instance Fn Instance Fn Instance Fn Instance Fn Runtime Fn Runtime Fn Runtime Fn Runtime Fn Runtime Fn Runtime Fn Runtime Pool of 3 function runtimes 2 function runtimes → function instances Pool rebalancing � 18

FaaS platform with function runtime pooling � 19

Trade-off: how big should the pool? Large pool Minimal pool Handle high concurrency Fast pool exhaustion Minimize pool; less idle resources Increases resource overhead Performance Minimize cost � 20

Optimization 3 Function Prefetching Trigger Fetch function Fetch Route Function Deploy pod Deploy function deployer metadata function request Execution � 21

Function prefetching Fetch function sources proactively and place them near resources to reduce function transfer latency - Software flow has a big impact on cold start durations - Function sources (10s of MBs) have to be retrieved and transferred to the resources - Especially important for geo-distributed and edge use cases - AWS Lambda@edge - Cloudflare Abad, Cristina L. et al. "Package-Aware Scheduling of FaaS Functions." Companion of the 2018 ACM/SPEC International � 22 Conference on Performance Engineering. ACM, 2018.

Prefetching Remote Storage Cluster-level Rack/Machine-level Function-level

Prefetching Higher latency Less storage costs Remote Storage Cluster-level Rack/Machine-level Function-level Lower latency More storage costs

FaaS platform with prefetching � 25

Optimization 4 Function Prewarming Trigger Fetch function Fetch Route Function Deploy pod Deploy function deployer metadata function request Execution � 26

Function prewarming Anticipate function executions by deploying functions predictively. - Prewarming or predictive scheduling in other domains: - CPU branch predictor - Proactive autoscalers - Predictive caches van Eyk, Erwin, et al. "A SPEC RG CLOUD Group's Vision on the Performance Challenges of FaaS Cloud Architectures." � 27 Companion of the 2018 ACM/SPEC International Conference on Performance Engineering. ACM, 2018.

� 28

Predicting function executions is hard... Active field of research (autoscaling, predictive caches…) Common approaches 1. Runtime analysis - Rule-based - Pattern recognition and machine learning - Artificial intelligence 2. Exploit additional information of functions - Dependency knowledge in function compositions - Interval triggers � 29

... and involves a trade-off. Optimistic prewarming Pessimistic prewarming Low threshold High threshold Misprediction: no prewarm Misprediction: resources wasted Ping hack More performance due to prewarming Less costs due to less mispredicted prewarming � 30

⼽戉弗 Function composition... - Connect existing functions into complex function compositions - Workflow engine takes care of the plumbing and provides fully monitorable, fault-tolerant function compositions with low overhead. Sequential execution image-recognizer translate-text validate-image image-resizer combine-image-text Parallel execution � 31

...with prewarming Fission Workflows supports horizon-based prewarming Finished Started Prewarmed Not started � 32

Serverless Performance on a Budget Erwin van Eyk The central - PowerPoint PPT Presentation

Serverless Performance on a Budget Erwin van Eyk The central trade-off in serverless computing High Performance Infinite scaling High availability Low latency 2 The central trade-off in serverless computing High Performance Low

Serverless On Your Own Terms Using Knative Context Serverless more than Function Serverless

How Serverless Changes the IT Department Paul Johnston Opinionated Serverless Person

Serverless Gardens IoT + Serverless johncmckim.me twitter.com/@johncmckim

Stateful Serverless Sean Walsh @SeanWalshEsq We predict that Serverless Computing will grow

Kotlin Serverless Framework Vladislav Tankov What is serverless? cloud-computing execution model

Databases Gone Serverless Alkin Tezuysal (@ask_dba) Sr. Technical Manager, Percona Who am I?

Lunch and Learn John McKim @johncmckim Software Engineer A Cloud Guru Serverless Framework

Serverless Boom or Bust? An Analysis of Economic Incentives Xiayue Charles Lin, Joseph E.

Serverless Python Serverless Python Michael Bright , Trainer @mjbright Consulting , Trainer

Catalyst Ubers Serverless Platform Shawn Burke - Staff Engineer Uber Seattle Why Serverless?

Unikernels and Event-driven Serverless Platforms Madhuri Yechuri Agenda Bio Application

FaaS You Like It! @ewanslater Serverless CNCF Definition Serverless computing refers to

The Serverless PHP Application Rob Allen LaravelConf Taiwan 2020 Serverless? Rob Allen ~

cloudstate.io serverless 2.0 with cloudstate Sean Walsh | Field CTO and Cloud Evangelist @

Serverless in the Wild: Characterizing and Optimizing the Serverless Workload at a Large Cloud

Kotless Kotlin Serverless Framework Vladislav Tankov @vdtankov October 15, 2020 Introduction

Automatic Trigger Generation for Rule-based Smart Homes ACM SIGPLAN PLAS, Vienna, Austria

Interactive data visualization and reporting Dr. etinkaya-Rundel 2018-04-18 Project tips

VHDL 3 Sequential Logic Circuits Reference: Roth/John Text: Chapter 2 VHDL Process

Trigger and DAQ at LHC Trigger and DAQ at LHC C.Schwick Contents Contents INTRODUCTION The

Reducing Power with Activity Trigger Analysis k + , Julien Legriel , Erwan Piriou # ,

An Analysis of 200,000 IFTTT Recipes Blase Ur, Melwyn Pak Yong Ho, Stephen Brawner, Jiyun Lee,

Event-Triggered Control Design with Performance Barrier Pio Ong and Jorge Cort es Mechanical

Exploring Algorithmic Solutions in Software and Firmware for the CMS L1 Trigger In Preparation