@coreoslinux About Me @brandonphilips CTO/CO-FOUNDER - PowerPoint PPT Presentation

@coreoslinux

About Me @brandonphilips CTO/CO-FOUNDER github.com/philips systems engineer

/etc distributed

open source software failure tolerant durable watchable exposed via HTTP runtime reconfigurable

Data Store API -X GET Get Wait -X PUT Put Create CAS -X DELETE Delete CAD

etcd Cluster Leader Follower

Applications locksmith

Cluster Wide Reboot Lock 1. Need reboot to reboot? Decrement the semaphore key atomically with etcd. 2. manager.Reboot() and wait... 3. After rebooting increment the semaphore key in etcd atomically.

Applications kubernetes and fleet

You Scheduler API Scheduler Machine(s)

Cluster Work Scheduling 1. Cluster API writes desired work into etcd keyspace. 2. Agents running on individual machines pick up work assigned to them. 3. Agents report where work is running and current status.

Applications vulcan, confd, dns and distributed git

Example Leader Election using TTL and atomic operations

PUT /6eadeac2d/f1d2d2f924e98 ‘http://10.1.2.3:7001’

PUT /6eadeac2d/f1d2d2f924e98 ‘http://10.1.2.3:7001’ Entry 1 /6eadeac2d/f1d2df http://10.1.2.3:7001

PUT /6eadeac2d/f1d2d2f924e98 ‘http://10.1.2.3:7001’ 1 Index /6eadeac2d/f1d2df http://10.1.2.3:7001

PUT /6eadeac2d/f1d2d2f924e98 ‘http://10.1.2.3:7001’ 1 /6eadeac2d/f1d2df Key http://10.1.2.3:7001

PUT /6eadeac2d/f1d2d2f924e98 ‘http://10.1.2.3:7001’ 1 /6eadeac2d/f1d2df Value http://10.1.2.3:7001

Idx Key Value Expiration Time 18 sched m3 Sept 18 2:11:30

Idx Key Value Expiration Time 18 sched m3 Sept 18 2:11:30 schedlr m3

Idx Key Value Expiration Time 18 sched m3 Sept 18 2:11:30 ) 3 m , 8 1 , d e h c s ( s a c schedlr c m3 a s ( s c h e d , 1 8 , m 3 )

Idx Key Value Expiration Time 45 sched m3 Sept 18 2:13:30 ) 0 0 : 3 1 : 2 ( c n y s s y n c ( 2 : 1 3 : 0 0 )

Idx Key Value Expiration Time ) 0 3 : 3 1 : 2 ( c n y s s y n c ( 2 : 1 3 : 3 0 )

Idx Key Value Expiration Time 50 sched m5 Sept 18 2:13:35 ) 5 m , d e h c s ( e t a e r c schedlr c m5 r e a t e ( s c h e d , m 5 )

etcd basics clusters and bootstrapping

etcd Cluster Leader Follower

bootstrapping Candidate

GET discovery.etcd.io/new

discovery.etcd.io/6eadeac2 6eadeac2d

6eadeac2d/state CREATE

Key Value Index state started 5890 n0 10.0.2.1 5891 n1 10.0.2.4 5898 ... 6eadeac2d/state

bootstrapped Leader Follower

6eadeac2d/state CREATE

1 2 3 4 { Log

1 2 3 4 Entries

1 2 3 4 Indexes

Sequential Consistency Operations* are atomically executed in the same sequential order on all machines.

PUT Pet = cat 1 PUT Pet = dog 2 Pet=dog 1 2 1 Pet=cat 1 Pet=cat

PUT Pet = cat 1 PUT Pet = dog 2 Pet=dog 1 2 1 2 Pet=dog 1 Pet=cat

PUT Pet = cat 1 PUT Pet = dog 2 Pet=dog 1 2 1 2 Pet=dog 1 2 Pet=dog

Sequential Consistency Real-time

GET Pet @ 10:00.0 -> 2[dog] 1 2 GET Pet @ 10:00.0 -> 1[cat]!? 1 1 2

1 2 GET Pet @ 10:00.1 -> 1[dog] 2 1 1 2

Sequential Consistency Index Time

GET Pet @ 2 -> 2[dog] 1 2 GET Pet @ 2 -> blocking 1 1 2

1 2 GET Pet @ 2 -> 2[dog] 1 2 1 2

etcd guarantees that a get at index X will always return the same result. Avoid thinking in terms of real time because with network latency the result is always out-of-date.

Quorum GETs GET via Raft

1 2 1 1 2

1 2 QGET A 1 1 2

1 2 QGET A -> 2[dog] 1 2 1 2

1 2 3 QGET A -> 2[dog] 1 2 1 2 3

Watchable Changes HTTP Long-poll

1 2 3 > GET asdf?waitIndex=4&wait=true HTTP/1.1 > Accept: */* > < HTTP/1.1 200 OK < Content-Type: application/json < X-Etcd-Index: 3 < X-Raft-Index: 97 < X-Raft-Term: 0 < BLOCK

1 2 3 4 > GET asdf?waitIndex=4&wait=true HTTP/1.1 > Accept: */* > < HTTP/1.1 200 OK < Content-Type: application/json < X-Etcd-Index: 3 < X-Raft-Index: 97 < X-Raft-Term: 0 < {"action":"set","node":{"key":"/asdf","value":"foobar"," modifiedIndex":4,"createdIndex":4}}

1 2 3 4 > GET asdf?waitIndex=4&wait=true HTTP/1.1 > Accept: */* > < HTTP/1.1 200 OK < Content-Type: application/json < X-Etcd-Index: 4 < X-Raft-Index: 516 < X-Raft-Term: 0 < {"action":"set","node":{"key":"/asdf","value":"foobar"," modifiedIndex":4,"createdIndex":4}}

Event History History isn’t forever, prepare!

Availability In a 2F+1 cluster tolerate F machine failures

Available

Unavailable

Master Election Fast recovery (5-10*typical RTT) from temporarily unavailable

Available Leader Follower

Temporarily Unavailable Leader Follower

Available Leader Follower

Durable log files, snapshots and backups

Mistakes so far...

Log files Filesystems truncate and corrupt data. Solutions: ● Must use checksumming in the file to ensure sanity ● Throwing out broken log files must be handled by the server

etcd machine naming Trusted users to manage unique names across the cluster. This went poorly. ● Misconfiguration from bugs ● Misconfiguration by users ● Machine cloning on the cloud Solution: etcd data-dir owns a unique uuid.

sync() in the cloud Slow, slow, slow: ● User #1 OpenStack on spinning disk: 6s ● User #2 AWS EBS backed: 1.5s Solution: ● Tune etcd to expect this long latency. ● Write batching and handling of behind machines.

Wednesday 10:40am LCA CoreOS: An Introduction Wednesday 6:00pm AKL Continuous Delivery Meetup. CoreOS: An Introduction Thursday 6:00 PM Go AKL Meetup Something about Go Friday 10:40am LCA CoreOS Tutorial

Thanks we like pull requests github.com/coreos/etcd

@coreoslinux About Me @brandonphilips CTO/CO-FOUNDER - PowerPoint PPT Presentation

@coreoslinux About Me @brandonphilips CTO/CO-FOUNDER github.com/philips systems engineer etcd /etc distributed open source software failure tolerant durable watchable exposed via HTTP runtime reconfigurable Data Store API -X GET Get

About Me @brandonphilips CTO/CO-FOUNDER github.com/philips systems engineer Why build CoreOS?

OUND INFORMATION ON FOUNDER TION ON FOUNDER GROUND INFORMA BACKGR OUND INFORMATION ON FOUNDER

GitHub Provider The GitHub provider is used to interact with GitHub organization resources. The

Git 101: Git and GitHub for beginners Overview 1.Install git and create a Github account

HydraBus An Open Source Platform HydraBus/HydraFW GitHub Hardware / Schematics on GitHub

Version control [ GitHub ] Thomas De Graaff August 23, 2016 Introduction Assignments: Working

Bro stuff Justin Azoff Aug 4, 2015 try.bro.org on github Figure : try.bro on github Bro

Eric Wahlforss CTO/SoundCloud GOTO Aarhus 2011 L O O C Eric Wahlforss CTO/SoundCloud GOTO

Network Device Naming Matt Domsch Technology Strategist, Office of the CTO Office of the CTO

Data Seepage Robert Graham Founder & CEO David Maynor Founder & CTO What is Data

Data Seepage Robert Graham Founder & CEO David Maynor Founder & CTO What is Data

THE BIG LEAP: What it took to get me to join as a technical Co-Founder and CTO Talk by Adrienne

Building a scalable time-series database using Postgres Mike Freedman Co-founder / CTO,

Code With Purpose @tomprats github.com/tomprats www.tomify.me Tom Prats Developer

DAT300 / DIT615 Github Repo Charalampos Stylianopoulos Github Repo Example projects from

Matthew McCullough @matthewmccull @matthewmccull training@github.com training.github.com

Camera Visualization System Requirements and Status JTM - March 2017 Visualization Requirements

Failure Detectors Concurrency Trilogy Part IV Announcements Project proposals are due

Programming Distributed Systems 10 Total-order broadcast with Raft Annette Bieniusa AG Softech

In Search of an Understandable Consensus Algorithm Diego Ongaro and John Ousterhout Stanford

Breakout Session Partnering with Families to Shape the Post-COVID World Gretchen Morgan, Center

Keeping RAFT Afloat Cloud Scale Distributed Consensus Philip Haynes YOW! Data September 2016

Porting to Vulkan Lessons Learned Who am I? Feral Interactive - Mac/Linux/Mobile games publisher

RAIDER: RAIDER: Responsive Responsive Architecture for Architecture for Inter Inter-Domain

@coreoslinux About Me @brandonphilips CTO/CO-FOUNDER - PowerPoint PPT Presentation

@coreoslinux About Me @brandonphilips CTO/CO-FOUNDER github.com/philips systems engineer etcd /etc distributed open source software failure tolerant durable watchable exposed via HTTP runtime reconfigurable Data Store API -X GET Get

About Me @brandonphilips CTO/CO-FOUNDER github.com/philips systems engineer Why build CoreOS?

OUND INFORMATION ON FOUNDER TION ON FOUNDER GROUND INFORMA BACKGR OUND INFORMATION ON FOUNDER

GitHub Provider The GitHub provider is used to interact with GitHub organization resources. The

Git 101: Git and GitHub for beginners Overview 1.Install git and create a Github account

HydraBus An Open Source Platform HydraBus/HydraFW GitHub Hardware / Schematics on GitHub

Version control [ GitHub ] Thomas De Graaff August 23, 2016 Introduction Assignments: Working

Bro stuff Justin Azoff Aug 4, 2015 try.bro.org on github Figure : try.bro on github Bro

Eric Wahlforss CTO/SoundCloud GOTO Aarhus 2011 L O O C Eric Wahlforss CTO/SoundCloud GOTO

Network Device Naming Matt Domsch Technology Strategist, Office of the CTO Office of the CTO

Data Seepage Robert Graham Founder &amp; CEO David Maynor Founder &amp; CTO What is Data

Data Seepage Robert Graham Founder &amp; CEO David Maynor Founder &amp; CTO What is Data

THE BIG LEAP: What it took to get me to join as a technical Co-Founder and CTO Talk by Adrienne

Building a scalable time-series database using Postgres Mike Freedman Co-founder / CTO,

Code With Purpose @tomprats github.com/tomprats www.tomify.me Tom Prats Developer

DAT300 / DIT615 Github Repo Charalampos Stylianopoulos Github Repo Example projects from

Matthew McCullough @matthewmccull @matthewmccull training@github.com training.github.com

Camera Visualization System Requirements and Status JTM - March 2017 Visualization Requirements

Failure Detectors Concurrency Trilogy Part IV Announcements Project proposals are due

Programming Distributed Systems 10 Total-order broadcast with Raft Annette Bieniusa AG Softech

In Search of an Understandable Consensus Algorithm Diego Ongaro and John Ousterhout Stanford

Breakout Session Partnering with Families to Shape the Post-COVID World Gretchen Morgan, Center

Keeping RAFT Afloat Cloud Scale Distributed Consensus Philip Haynes YOW! Data September 2016

Porting to Vulkan Lessons Learned Who am I? Feral Interactive - Mac/Linux/Mobile games publisher

RAIDER: RAIDER: Responsive Responsive Architecture for Architecture for Inter Inter-Domain

Data Seepage Robert Graham Founder & CEO David Maynor Founder & CTO What is Data

Data Seepage Robert Graham Founder & CEO David Maynor Founder & CTO What is Data