Architectures that Scale Deep: Regaining Control in Deep Systems - PowerPoint PPT Presentation

Architectures that Scale Deep: Regaining Control in Deep Systems Ben Sigelman (@el_bhs, bhs@lightstep.com) Co-founder & CEO: LightStep Co-creator: OpenTracing, OpenTelemetry, Google Dapper, Google Monarch QCon SF, November 2019

Part I Scaling, and Deep Systems

What is scale, anyway?

Scaling wide

Scaling deep

How does this look for software?

Software: Scaling wide

Software: Scaling deep

How do real-world systems look?

Microservices at scale aren’t just wide systems , they’re deep systems

Deep Systems Deep Systems Architectures with ≥ 4 layers of Architectures with ≥ 4 layers of independently operated services independently operated services (including external/cloud dependencies) (including external/cloud dependencies)

What do deep systems sound like?

What do deep systems sound like? “Don’t deploy on Fridays”

What do deep systems sound like? “Where’s Chris?! I’m dealing with a P0 and they’re the only one who knows how to debug this.”

What do deep systems sound like? “It can’t be our fault, our dashboard says we’re healthy”

What do deep systems sound like? “Kafka is on fire”

What do deep systems sound like? “I need 100% availability from your team. One hundred percent .”

What do deep systems sound like? “I didn’t know I depended on that region”

What do deep systems sound like? “That was on a dashboard but I can’t find it”

What do deep systems sound like? Lots of challenges: - People-management - Security - Multi-tenancy - “Big-customer” success - Performance - Observability

Part II Control Theory: TL;DR Edition

Why do we care so much about observability , anyway?

Inputs Outputs A System … and its state vector,

Observability Inputs Outputs A System … and its state vector, How well can you infer internal state using only the outputs ?

Controllability Inputs Outputs A System … and its state vector, How well can you control internal state using only the inputs ?

Controllability is the dual of Observability

Part III What Deep Systems Mean for Observability

Pure Monoliths developers per service Deep Systems Architectural evolution # of services

Stress (n): responsibility without control Stress what you can control what you are responsible for

Observability: Shrink This Gap

Mental models A System

Managing Deep Systems Services must have SLOs (“Service Level Objectives”: latency, errors, etc) For effective service management, only three things matter: 0. Releasing service functionality 1. Gradually improving SLOs 2. Rapidly restoring SLOs In a deep system, we must control the entire “triangle” to maintain our SLOs

There’s that word again… Controllability == Observability Controllability == Observability

Observability: “The Conventional Wisdom” Observing microservices is hard Google and Facebook solved this (right???) They used Metrics, Logging, and Distributed Tracing… … So we should, too.

3 Pillars, 3 Experiences Metrics Logs Traces

Three Pillars? Three Pillars? Two giant pipes… Metrics Without Traces: Cognitive Load ≈ O( depth 2 ) Logs

Three Pillars? Three Pillars? Two giant pipes… Metrics Logs

Two giant pipes… Metrics Without Traces: Cognitive Load ≈ O( depth 2 ) Logs

Traces

Traces provide Context

Traces provide Context And context rules out invalid hypotheses

Two giant pipes and a filter Metrics Context (from traces) Logs

Context reduces cognitive load Relevant Metrics Context (from traces) Relevant Logs With Traces: Cognitive Load ≈ O( depth )

Observability: Shrink This Gap

Let’s Review

Microservices don’t just scale wide, they scale deep Recognize deep systems

Stress (n): responsibility without control Stress what you can control what you are responsible for

“Controllability” (of SLOs) depends on observability

“The Three Pillars of Observability” is a lousy metaphor … and traces are not sprinkles

Tracing can reduce cognitive load from O( depth 2 ) to O( depth )

Tracing is the backbone of simple observability in deep systems

Thank You Play with LightStep, Feedback always for free, anytime: welcome: (no email address required!) twitter → @el_bhs lightstep.com/play the emails → bhs@lightstep.com

Architectures that Scale Deep: Regaining Control in Deep Systems - PowerPoint PPT Presentation

Architectures that Scale Deep: Regaining Control in Deep Systems Ben Sigelman (@el_bhs, bhs@lightstep.com) Co-founder & CEO: LightStep Co-creator: OpenTracing, OpenTelemetry, Google Dapper, Google Monarch QCon SF, November 2019 Part I

Architectures Architectural styles Software architectures Architectures versus middleware

8. Other Deep Architectures CS 519 Deep Learning, Winter 2018 Fuxin Li With materials from Zsolt

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

MongoDB large scale data-centric architectures QConSF 2012 Kenny Gorman Founder, ObjectRocket

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Distributed DeepLearning at Scale Soumith Chintala Facebook AI Research Overview Deep

Learning Deep Architectures Yoshua Bengio, U. Montreal CIFAR NCAP Summer School 2009 August 6th,

False Alarm Reduction for Active Sonars using Deep Learning Architectures Matthias Bu

CompSci 356: Computer Network Architectures Lecture 2: Network Architectures Xiaowei Yang

Architectures, Architectures, Microkernels, IPC, Microkernels, IPC, Capabilities Capabilities

Overview Agent Architectures Definition of agent architecture Classical Architectures for

CompSci 356: Computer Network Architectures Lecture 2: Network Architectures Xiaowei Yang

HPC Architectures Types of resource currently in use Outline Shared memory architectures

HPC Architectures Types of resource currently in use Outline Shared memory architectures

Deep Yellow Limited Indaba Presentation February 2013 Greg Cochran Managing Director ASX:

Modeling Interestingness with Deep Neural Networks Jianfeng Gao, Patrick Pantel, Michael Gamon,

Bird Identification using Deep Learning Techniques Presentation by Elias Sprengel University:

A Pure-Play Zinc Producer June 2018 w w w . a s c e n d a n t r e s o u r c e s . c o m T S X :

Deep Learning Feature for Handwritten Keyword Spotting Baptiste Wicht Andreas Fischer Jean

Everybody Else Has Signed It. Whats Your Problem? Why Deep South Engineers Need to

Tac acom oma P a Power an and Tran ansp sport rtation on E Electrification on Cam

Linearly Augmented Deep Neural Network Pegah Ghahremani, Johns Hopkins University, Jasha Droppo,