#QConSF @ana_m_medina
Chaos Engineering Chaos Engineering with Containers
1
Ana Medina Chaos Engineer at
Chaos Engineering Chaos Engineering with Containers Ana Medina - - PowerPoint PPT Presentation
@ana_m_medina #QConSF Chaos Engineering Chaos Engineering with Containers Ana Medina Chaos Engineer at 1 @ana_m_medina #QConSF Ana Medina @ana_m_medina Chaos Engineer @ Gremlin Previously Software Engineer / SRE @ Uber , Also
#QConSF @ana_m_medina
1
Ana Medina Chaos Engineer at
#QConSF @ana_m_medina
2
Ana Medina
@ana_m_medina
Chaos Engineer @ Gremlin Previously Software Engineer / SRE @ Uber, Also worked/ interned @ SFEFCU, Google, Quicken Loans, Stanford University and Miami Dade College. College dropout. Self taught engineer.
#QConSF @ana_m_medina
3
#QConSF @ana_m_medina
4
#QConSF @ana_m_medina
5
#QConSF @ana_m_medina
6
Gremlin Founder and CEO
#QConSF @ana_m_medina
7
#QConSF @ana_m_medina
8
Charity Majors CEO of honeycomb
#QConSF @ana_m_medina
9
#QConSF @ana_m_medina
10
#QConSF @ana_m_medina
11
#QConSF @ana_m_medina
Minimize the Blast radius
12
#QConSF @ana_m_medina
13
#QConSF @ana_m_medina
14
#QConSF @ana_m_medina
15
#QConSF @ana_m_medina
16
Real World Scenario: company / user is evaluating cloud provider managed kubernetes. which one is more reliable? The Hypothesis: shutting down a container (1/1) should only give a small delay before app is reachable again The Experiment: shut down kubernetes dashboard container Abort Conditions: app is unreachable after 60 seconds
#QConSF @ana_m_medina
17
#QConSF @ana_m_medina
#QConSF @ana_m_medina
#QConSF @ana_m_medina
#QConSF @ana_m_medina
21
Real World Scenario: company / user is evaluating
The Hypothesis: yes, they will come back up The Experiment: shutdown container and wait a few seconds and check if it’s up Abort Conditions: app is unreachable after 60 seconds
#QConSF @ana_m_medina
22
#QConSF @ana_m_medina
23
Real World Scenario: company / user is working with their UI team to provide a good user experience when there API/DB issues The Hypothesis: images will not load, but product listing will The Experiment: blackhole all traffic from the front end to REST API and DB ports Abort Conditions: app is unreachable after 60 seconds
#QConSF @ana_m_medina
24
#QConSF @ana_m_medina
25
#QConSF @ana_m_medina
26
#QConSF @ana_m_medina
27
#QConSF @ana_m_medina
28
#QConSF @ana_m_medina
@ana_m_medina ana@gremlin.com