Serverless in the Wild: Characterizing and Optimizing the Serverless - PowerPoint PPT Presentation

Serverless in the Wild: Characterizing and Optimizing the Serverless Workload at a Large Cloud Provider Mohammad Shahrad , Rodrigo Fonseca , Íñigo Goiri, Gohar Chaudhry, Paul Batum, Jason Cooke, Eduardo Laureano, Colby Tresness, Mark Russinovich, and Ricardo Bianchini July 15, 2020

What is Serverless? • Very attractive abstraction: • Pay for Use • Infinite elasticity from 0 (and back) • No worry about servers • Provisioning, Reserving, Configuring, patching, managing • Most popular offering: Function-as-a-Service (FaaS) • Bounded-time functions with no persistent state among invocations • Upload code, get an endpoint, and go For the rest of this talk, Serverless = Serverless FaaS

What is Serverless? Bare Metal VMs (IaaS) Containers Functions (FaaS) Unit of Scale Server VM Application/Pod Function Provisioning Ops DevOps DevOps Cloud Provider Init Time Days ~1 min Few seconds Few seconds Scaling Buy new hardware Allocate new VMs 1 to many, auto 0 to many, auto Typical Lifetime Years Hours Minutes O(100ms) Payment Per allocation Per allocation Per allocation Per use State Anywhere Anywhere Anywhere Elsewhere

Serverless “…more than 20 percent of global enterprises will have deployed serverless computing technologies by 2020.” Gartner, Dec 2018

Serverless Source: CNCF Cloud Native Interactive Landscape https://landscape.cncf.io/format=serverless

Serverless December 2019 “… we predict that (…) serverless computing will grow to dominate the future of cloud computing.”

So what are people doing with FaaS? • Interesting Explorations • MapReduce (pywren) • Many simple things • Linear Algebra (numpywren) • ETL workloads • ExCamera • IoT data collection / processing • gg “burst-parallel” functions apps • Stateless processing • ML training • Image / Video transcoding • Limitations • Translation • Communication • Check processing • Latency • Serving APIs, Mobile/Web Backends • Locality (lack) • State management

What is Serverless? • Very attractive abstraction: • Pay for Use • Infinite elasticity from 0 (and back) • No worry about servers • Provisioning, Reserving, Configuring, patching, managing

If you are a cloud provider… • A big challenge • You do worry about servers! • Provisioning, scaling, allocating, securing, isolating • Illusion of infinite scalability • Optimize resource use • Fierce competition • A bigger opportunity • Fine grained resource packing • Great space for innovating, and capturing new applications, new markets

Cold Starts Azure Functions OpenWhisk AWS Lambda • Typically range between 0.2 to a few seconds 1,2 1 https://levelup.gitconnected.com/1946d32a0244 2 https://mikhail.io/serverless/coldstarts/big3/ 9

Cold Starts and Resource Wastage Keeping functions in memory indefinitely. Wasted Memory ? Removing function instance from memory after invocation. Cold Starts 10

Stepping Back: Characterizing the Workload • How are functions accessed • What resources do they use • How long do functions take 2 weeks of all invocations to Azure Functions in July 2019 First characterization of the workload of a large serverless provider Subset of the traces available for research: https://github.com/Azure/AzurePublicDataset 11

Invocations per Application* 12 This graph is from a representative subset of the workload. See paper for details.

Invocations per Application 13 This graph is from a representative subset of the workload. See paper for details.

Invocations per Application 18% >1/min 82% <1/min 99.6% of invocations! 0.4% of invocations 18 This graph is from a representative subset of the workload. See paper for details.

Apps are highly heterogeneous 19

What about memory? If we wanted to keep all apps warm… Cumulative Fraction of Total Memory 1.0 0.8 0.6 Allocated Memory Physical Memory 0.4 0.2 0.0 0.0 0.2 0.4 0.6 0.8 1.0 Fraction of Least Invoked Apps 20

What about memory? If we wanted to keep all apps warm… Cumulative Fraction of Total Memory 1.0 0.8 0.6 Allocated Memory Physical Memory 0.4 0.2 82% of apps -> 0.4% of invocations -> 0.0 40% of all physical memory, 0.0 0.2 0.4 0.6 0.8 1.0 60% of virtual memory Fraction of Least Invoked Apps 90% of apps -> 1.05% of invocations -> 50% of all physical memory 21

Function Execution Duration 1.00 0.90 0.75 Minimum CDF Average 0.50 Maximum LogNormal Fit 0.25 0.10 0.00 • Executions are short 1ms 100ms 1s 10s 1m 10m 1h Time(s) • 50% of apps on average run for <= 0.67s • 75% of apps on run for <= 10s max • Times at the same scale as cold start times 1,2 22 1 https://levelup.gitconnected.com/1946d32a0244 2 https://mikhail.io/serverless/coldstarts/big3/

Key Takeaways • Highly concentrated accesses • 82% of the apps are accessed <1/min on average • Correspond to 0.4% of all accesses • But in aggregate would take 40% of the service memory if kept warm • Arrival processes are highly variable • Execution times are short • Same OOM as cold start times 23

Cold Starts and Resource Wastage Cumulative Fraction of Total Memory 1.0 Allocated Memory 0.8 Physical Memory 0.6 Keeping functions in 0.4 memory indefinitely. 0.2 0.0 0.0 0.2 0.4 0.6 0.8 1.0 Fraction of Least Invoked Apps Wasted 1.00 0.90 0.75 Memory ? CDF 0.50 Minimum 0.25 Average Maximum LogNormal Fit 0.10 0.00 1ms 100ms 1s 10s 1m 10m 1h Time(s) Removing function instance from memory after invocation. Cold Starts 24

What do serverless providers do? Amazon Lambda Fixed 10-minute Cold start probability keep-alive. Azure Functions Cold start probability Fixed 20-minute keep-alive. Time since last invocation (mins) Time since last invocation (mins) 25 Mikhail Shilkov, Cold Starts in Serverless Functions, https://mikhail.io/serverless/coldstarts/

Fixed Keep-Alive Policy Results from simulation of the entire workload for a week. Longer keep-alive 26

Fixed Keep-Alive Won’t Fit All 8 mins 8 mins 10-minute Fixed Keep-alive Time Cold Start 11 mins 11 mins Warm Start Time 27

Fixed Keep-Alive Is Wasteful 8 mins 8 mins 10-minute Fixed Keep-alive Time Cold Start Warm Start Function image kept in memory but not used. 28

Hybrid Histogram Policy Adapt to each application Pre-warm in addition to keep-alive Lightweight implementation 29

A Histogram Policy To Learn Idle Times 8 mins Idle Time (IT): 8 mins 10-minute Fixed Keep-alive Time Cold Start Frequency Warm Start 8 Idle Time (IT) 30

A Histogram Policy To Learn Idle Times Keep-alive Pre-warm Frequency 7 8 9 Idle Time (IT) 31

A Histogram Policy To Learn Idle Times Pre-warm Keep-alive Frequency 99 th percentile 5 th percentile Idle Time (IT) Minute-long bins Limited number of bins (e.g., 240 bins for 4-hours) 32

The Hybrid Histogram Policy Pre-warm Keep-alive Frequency 99 th percentile Out of Bound 5 th percentile (OOB) Idle Time (IT) We can afford to run complex predictors given the low arrival rate. A histogram might be too wasteful. Time Series Forecast 33

The Hybrid Histogram Policy Pattern Use IT distribution Yes No Significant (histogram) Update New Too many app’s IT invocation Be conservative OOB ITs No distribution (standard keep-alive) Yes Time-series forecast (ARIMA) ARIMA: Autoregressive Integrated Moving Average 34

More Optimal Pareto Frontier 35

Implemented in OpenWhisk REST Interface Controller Distributed Load Database Balancer • Open-sourced industry-grade Distributed (IBM Cloud Functions) Messaging • Functions run in docker containers • Uses 10-minute fixed keep-alive Invoker Invoker Invoker Container Container Container Container Container Container Container Container Container • Built a distributed setup with 19 VMs 36

Simulation Experimental 4-Hour Hybrid Histogram 1.00 0.75 CD) 0.50 0.25 Hybrid )ixHd (10-min) 0.00 0 25 50 75 100 App Cold StDrt (%) Average exec time reduction: 32.5% Container memory reduction: 15.6% 99 th –percentile exec time reduction: 82.4% Latency overhead: < 1ms (835.7µs) 37

Closing the loop Ø First serverless characterization from a provider’s point of view Ø A dynamic policy to manage serverless workloads more efficiently ( First elements now running in production. ) Ø Azure Functions traces available to download: https://github.com/Azure/AzurePublicDataset/blob/master/ AzureFunctionsDataset2019.md 38

Serverless in the Wild: Characterizing and Optimizing the Serverless - PowerPoint PPT Presentation

Serverless in the Wild: Characterizing and Optimizing the Serverless Workload at a Large Cloud Provider Mohammad Shahrad , Rodrigo Fonseca , igo Goiri, Gohar Chaudhry, Paul Batum, Jason Cooke, Eduardo Laureano, Colby Tresness, Mark

Serverless On Your Own Terms Using Knative Context Serverless more than Function Serverless

How Serverless Changes the IT Department Paul Johnston Opinionated Serverless Person

Serverless Gardens IoT + Serverless johncmckim.me twitter.com/@johncmckim

Wild Horse and Burro Roundtable Wild Horse and Burro Roundtable Wild Horse and Burro Roundtable

Stateful Serverless Sean Walsh @SeanWalshEsq We predict that Serverless Computing will grow

Serverless Performance on a Budget Erwin van Eyk The central trade-off in serverless computing

Kotlin Serverless Framework Vladislav Tankov What is serverless? cloud-computing execution model

Databases Gone Serverless Alkin Tezuysal (@ask_dba) Sr. Technical Manager, Percona Who am I?

Lunch and Learn John McKim @johncmckim Software Engineer A Cloud Guru Serverless Framework

Literacy Activity Wild Animal Habitat What is your favourite wild animal? Where do wild animals

Serverless Boom or Bust? An Analysis of Economic Incentives Xiayue Charles Lin, Joseph E.

Serverless Python Serverless Python Michael Bright , Trainer @mjbright Consulting , Trainer

Catalyst Ubers Serverless Platform Shawn Burke - Staff Engineer Uber Seattle Why Serverless?

Unikernels and Event-driven Serverless Platforms Madhuri Yechuri Agenda Bio Application

FaaS You Like It! @ewanslater Serverless CNCF Definition Serverless computing refers to

The Serverless PHP Application Rob Allen LaravelConf Taiwan 2020 Serverless? Rob Allen ~

Long-Tailed Sources & Open Compound Targets Boqing Gong CVPR 2009 50 classes 85 attributes

WILD RIDE - RIGHT DESCRIPTION PART NUMBER Standard Stainless

modelling rich interaction sensor-based systems statusevent analysis rich set of

Programming Distributed Systems 05 Quorums Annette Bieniusa AG Softech FB Informatik TU

Exploits of a TAG analyst chasing in the wild Clement Lecigne <clem1@google.com, @_clem1>

Wild Hypersurfaces joint work with Andrew Crabbe Graham J. Leuschke gjleusch@math.syr.edu

Modelling, ontology and wild thought Willard McCarty Kings College London www.mccarty.org.uk

Robot-Centric Activity Recognition 'in the Wild' Ilaria Gori, Jivko Sinapov, Priyanka Khante,

Serverless in the Wild: Characterizing and Optimizing the Serverless - PowerPoint PPT Presentation

Serverless in the Wild: Characterizing and Optimizing the Serverless Workload at a Large Cloud Provider Mohammad Shahrad , Rodrigo Fonseca , igo Goiri, Gohar Chaudhry, Paul Batum, Jason Cooke, Eduardo Laureano, Colby Tresness, Mark

Serverless On Your Own Terms Using Knative Context Serverless more than Function Serverless

How Serverless Changes the IT Department Paul Johnston Opinionated Serverless Person

Serverless Gardens IoT + Serverless johncmckim.me twitter.com/@johncmckim

Wild Horse and Burro Roundtable Wild Horse and Burro Roundtable Wild Horse and Burro Roundtable

Stateful Serverless Sean Walsh @SeanWalshEsq We predict that Serverless Computing will grow

Serverless Performance on a Budget Erwin van Eyk The central trade-off in serverless computing

Kotlin Serverless Framework Vladislav Tankov What is serverless? cloud-computing execution model

Databases Gone Serverless Alkin Tezuysal (@ask_dba) Sr. Technical Manager, Percona Who am I?

Lunch and Learn John McKim @johncmckim Software Engineer A Cloud Guru Serverless Framework

Literacy Activity Wild Animal Habitat What is your favourite wild animal? Where do wild animals

Serverless Boom or Bust? An Analysis of Economic Incentives Xiayue Charles Lin, Joseph E.

Serverless Python Serverless Python Michael Bright , Trainer @mjbright Consulting , Trainer

Catalyst Ubers Serverless Platform Shawn Burke - Staff Engineer Uber Seattle Why Serverless?

Unikernels and Event-driven Serverless Platforms Madhuri Yechuri Agenda Bio Application

FaaS You Like It! @ewanslater Serverless CNCF Definition Serverless computing refers to

The Serverless PHP Application Rob Allen LaravelConf Taiwan 2020 Serverless? Rob Allen ~

Long-Tailed Sources &amp; Open Compound Targets Boqing Gong CVPR 2009 50 classes 85 attributes

WILD RIDE - RIGHT DESCRIPTION PART NUMBER Standard Stainless

modelling rich interaction sensor-based systems statusevent analysis rich set of

Programming Distributed Systems 05 Quorums Annette Bieniusa AG Softech FB Informatik TU

Exploits of a TAG analyst chasing in the wild Clement Lecigne &lt;clem1@google.com, @_clem1&gt;

Wild Hypersurfaces joint work with Andrew Crabbe Graham J. Leuschke gjleusch@math.syr.edu

Modelling, ontology and wild thought Willard McCarty Kings College London www.mccarty.org.uk

Robot-Centric Activity Recognition 'in the Wild' Ilaria Gori, Jivko Sinapov, Priyanka Khante,

Long-Tailed Sources & Open Compound Targets Boqing Gong CVPR 2009 50 classes 85 attributes

Exploits of a TAG analyst chasing in the wild Clement Lecigne <clem1@google.com, @_clem1>