getting comfortable in prod to improve your life in dev @cyen - PowerPoint PPT Presentation

getting comfortable in prod to improve your life in dev @cyen @honeycombio

first, some background…

Christine DEV

DEV WRITE → TEST → COMMIT → WRITE → TEST → COMMIT → WRITE → TEST → COMMIT → WRITE → TEST → COMMIT → WRITE → TEST → COMMIT → WRITE → TEST → COMMIT → WRITE → TEST → COMMIT → WRITE → TEST → COMMIT

DEV OPS WRITE → TEST → COMMIT → RELEASE 💦 → DEBUG → FIX

💦 DEV OPS "The only good "Works on my diff is a red machine" diff"

"Observation 1: Change is the most common trigger" —Subbu Allamaraju, Expedia, Feb 2019   https://m.subbu.org/incidents-trends-from-the-trenches-e2f8497d52ed

API USER BILLING GATEWAY MGMT REST REST API API APP PARTNER PAYMENT WEB UI S MGMT REST REST API API INTERNAL TXN NOTIFICATION SYSTEM WEB UI MGMT REST REST API API THEN NOW

DEV OPS "The only good "Works on my diff is a red machine" diff"

DEV OPS THE FIRST WAVE: getting ops folks to code teaching devs to own THE SECOND WAVE: code in production

observability DEV OPS it’s all about sharing SOFTWARE OWNERSHIP

observability a.k.a. understanding the behavior of a system based on knowledge of its external outputs. a.k.a. "what is my software doing, and why is it behaving that way?"

    monitoring observability The system as black box The system as a living, magic. Thresholds, alerts, adaptable thing. A culture of system signals like CPU and instrumentation and metadata memory.   rather than strictly-defined counters.   Checking and rechecking for known bad behaviors. Being able to tease out previously-unknown bad behaviors and outliers.

DEV OPS WRITE → TEST → COMMIT → RELEASE 💦 → DEBUG → FIX

DEV OPS WRITE → TEST → COMMIT → RELEASE → OBSERVE TEST OBSERVE

DEV OPS MAKE HAUNTED GRAVEYARDS LESS SCARY

… why devs, again?

The   ▸ Design documents Software ▸ Architecture review DEV ▸ Test-driven development TEST ▸ Integration tests ▸ Code review Process ▸ Continuous integration ▸ Continuous deployment ▸ 🎊🥃🍿🎋 ▸ Observe our code in production

EXPECTED --- FAIL: TestUnitTest (0.00s) talk_test.go:10: — expected: 4 (type int) ACTUAL actual: 5 (type int)

💦 DEV OPS "The only good "Works on my diff is a red machine" diff"

DEV PROD still   observability

prod, part of the dev process?

when deciding… The   ▸ Design documents Software WHAT   ▸ Architecture review DEV to build ▸ Test-driven development ▸ Integration tests HOW TO   ▸ Code review Process build it ▸ Continuous integration ▸ Continuous deployment WHETHER   ▸ 🎊🥃🍿🎋 it works ("test in prod") ▸ (Wait for exception   tracker to complain)

▸ Locally: log lines, printfs, debuggers attached to our IDEs when ▸ What’s causing our code to deviate from deciding expectations? … WHAT ▸ Stop "pulling straws"—quantify pain, and start prioritizing.

▸ Know what "normal" really is ▸ Events (instrumentation) can be when like DEBUG statements in prod deciding ▸ What and how we build should be … HOW TO informed by reality

▸ Complex systems have an infinitely long list of black swan failure scenarios when ▸ "Test in Production" to experiment and check deciding hypotheses WHETHE … ▸ Feature flags + observability = 💜 R

but this is hard.

make prod feel more like dev

TOOLS SHOULD SPEAK MY LANGUAGE ▸ As a dev, traditional monitoring tools don't tie back to the concepts I deal with in my code $YOUR_BIZ-relevant ID AWS availability zone time to render API endpoint CPU utilization payload size kafka partition build ID client OS Cassandra hostname

TOOLS SHOULD SPEAK MY LANGUAGE ▸ As a dev, traditional monitoring tools don't tie back to the concepts I deal with in my code AWS availability zone customer ID 8bd3acf2 394817e6 7e7ea1d0 1528afb3 a87fcfcd 7e7ea1d0 7e7ea1d0 394817e6 7e7ea1d0 us-east-1 fb2ff7ca 2f67a581 394817e6 8bd3acf2 eu-west-1 70efe4da 2f67a581 7e7ea1d0 2f67a581 7e7ea1d0 fb2ff7ca 7e7ea1d0 1528afb3 4e4e1207 4e4e1207 1528afb3 1528afb3 1528afb3 98f1d93f 1528afb3 394817e6 us-west-2 144afb2f 2f67a581 2f67a581 98f1d93f 7e7ea1d0 7e7ea1d0 eu-central-1 7e7ea1d0 a87fcfcd 7e7ea1d0 8bd3acf2 7e7ea1d0 1528afb3 394817e6 us-west-1 2f67a581

TOOLS SHOULD SPEAK MY LANGUAGE ▸ As a dev, traditional monitoring tools don't tie back to the concepts I deal with in my code AND LET ME ITERATE

SHARE PATTERNS WHERE POSSIBLE ▸ Tracing helps production feel even more familiar: can map a trace directly to my code structure

PROD SHOULD FEEL LIKE DEVELOPMENT?

CHANGE CAN BE INCREMENTAL 2019-01-25T01:30:23.743Z Enqueued task 2019-01-25T01:30:24.120Z Task processed, returning 42 entries 2019-01-25T01:30:24.212Z Task complete (email sent to foobar@example.com) 2019-01-25T01:30:23.743Z Enqueued task task_id=72 type=enqueue target=email 2019-01-25T01:30:29.953Z Task timed out after 6.01 seconds task_id=72 type=process Timestamp=2019-01-25T01:30:29.953Z target=email message=Task timed out after 6.01 seconds queue_dur_ms=200 task_id=72 timeout_dur_ms=6010

CHANGE CAN BE INCREMENTAL 2019-01-25T01:30:23.743Z Enqueued task task=72 2019-01-25T01:30:24.120Z Enqueued task task=74 2019-01-25T01:30:24.212Z Task processed, returning 42 entries task=74 2019-01-25T01:30:26.014Z Task complete (email sent to foobar@example.com) task=74 2019-01-25T01:30:26.214Z Enqueued task task=77 2019-01-25T01:30:24.120Z Task errored: unknown constant ::Fixnum task=77 2019-01-25T01:30:29.953Z Task timed out after 6.01 seconds task=72 2019-01-25T01:30:32.762Z Enqueued task task=78 2019-01-25T01:30:34.243Z Task processed, returning 0 entries task=78 2019-01-25T01:30:34.243Z Task complete, (email sent to bazqux@example.com) task=78

at the end of all of this…

💦 DEV OPS

💜 OPS DEV

DEV OPS WRITE → TEST → COMMIT → RELEASE → OBSERVE TEST OBSERVE

share the great responsibility   OPS: (and great power!) DEVS: embrace observability, bring production closer to development.

ASK NEW QUESTIONS thanks! SHIP BETTER SOFTWARE @cyen @honeycombio CURIOUS? TRY play.honeycomb.io

getting comfortable in prod to improve your life in dev @cyen - PowerPoint PPT Presentation

getting comfortable in prod to improve your life in dev @cyen @honeycombio first, some background Christine DEV DEV WRITE TEST COMMIT WRITE TEST COMMIT WRITE TEST COMMIT WRITE TEST COMMIT

Presentation 2016 SC ORGANISATION CONSULTING PROD SRL Share Capital: 270 000 euros

ThorneConsulting.com W E L C O M E From Getting Noticed From Getting Noticed to Getting

Common constraint mistakes SU P P LY C H AIN AN ALYTIC S IN P YTH ON Aaren St u bber eld S

Going D/S/K Prod Like A Pro BRET FISHER Docker Captain, DevOps Dude, Creator of Docker Mastery

SALT Vagrant and Virtualbox Ben Hosmer @bhosmer Monday, April 22, 13 Local Dev Prod

WELCOME Getting Comfortable with our Technology Andrew Barnes Community Engagement Coordinator

Inject ctable m mod odified r release se prod oducts ts Dr Sotiris Michaleas, National

Voic oice Diso e Disord rder ers ASHA / Ho How do do w we pr prod oduce ce voi oice

implicatio implica tion n of of bio biofue fuel pr prod oduc uctio tion: n: A A case

Che hemical ical Prod oduction ction Presented to Side event for Conference of States

BI BIODI ODIVERSITY VERSITY CO CONS NSERVATI ERVATION ON IN IN CO COCO COA A PR PROD

Michigan 6-2018 Aerials Shipping Products Acres Prod Prop Shipping Whsle Ctr Admin Total

RICE CE HUSK K AS AN ALTERN TERNATIV TIVE E ENER ERGY GY FOR OR CEME MENT NT PROD ODUC

Em Emer ergi ging ng Ma Major or Co Copp pper er Pr Prod oduc ucer er ORA PRODUC

Cold Cathode N N L U A A X I CARALUX CARALUX Internal illuminations Caralux Prod.

FOO OOD an and AG AGRICULTURAL AL PROD ODUCTS IN INDUSTRY an and TRAD ADE A. A.S.

Panel Discussion Panel

Frank-Wolfe Splitting via Augmented Lagrangian Method Fabian Pedregosa 2 Simon Lacoste-Julien 1

AGENDA A Business Perspective What is a Systems Mindset? Elements of Business

Inference attacks on location data S ebastien Gambs Universit e du Qu ebec ` a Montr

On Selective Unboundedness of VASS St ephane Demri Laboratoire Sp ecification et V

Nuclear fusion Light nuclei can react together and form heavier nuclei, for example: R. Nguyen

Long-Term Care Insurance (Kaigo Hoken) Introduction to the System and how to use its services

GAP INC. FISCAL 2019 FOURTH QUARTER EARNINGS RESULTS Robert J. Fisher Teri List-Stoll INTERIM