Microservices at Netflix Scale
First Principles, Tradeoffs, Lessons Learned
Ruslan Meshenberg @rusmeshenberg
Microservices at Netflix Scale First Principles, Tradeoffs, Lessons - - PowerPoint PPT Presentation
Microservices at Netflix Scale First Principles, Tradeoffs, Lessons Learned Ruslan Meshenberg @rusmeshenberg Microservices: all benefits, no costs? Netflix is the worlds leading Internet television network with over 81 million members in
First Principles, Tradeoffs, Lessons Learned
Ruslan Meshenberg @rusmeshenberg
Netflix is the world’s leading Internet television network with over 81 million members in over 190 countries enjoying more than 125 million hours of TV shows and movies per day, including original series, documentaries and feature films.
Ruslan Meshenberg Director, Platform Engineering
https://media.netflix.com/en/company-blog/completing-the-netflix-cloud-migration
RDBMS
*Except the Persistence / Caching layers
Service A Service B Service B Service B Service B Service B
Zone A Zone B Zone C Zone B Zone C Zone A Zone A Zone B Zone C Zone C
Client Client
Zone A Zone B
500ms
Bi-directional Nightly compare & repair Local Quorum
(Typical)
Region A Region B
Develop
Develop, Test, Deploy, Support
Develop, Test, Deploy, Support
Develop, Test, Deploy, Support
Architect
Design Develop Review Test Deploy Run Support
Design Develop Review Test Deploy Run Support
ArchitectDesign Develop Review Test Deploy Run Support
ArchitectDesign Develop Review Test Deploy Run Support
ArchitectDesign Develop Review Test Deploy Run Support
ArchitectDesign Develop Review Test Deploy Run Support
ArchitectDesign Develop Review Test Deploy Run Support
ArchitectDesign Develop Review Test Deploy Run Support
ArchitectDesign Develop Review Test Deploy Run Support
ArchitectDesign Develop Review Test Deploy Run Support
ArchitectDesign Develop Review Test Deploy Run Support
ArchitectDesign Develop Review Test Deploy Run Support
ArchitectDesign Develop Review Test Deploy Run Support
UI
Feature A Feature B Feature C
Personalization
Feature D A/B Test E
Mid-tier
A/B Test F Feature H
Infrastructure
Availability Scalability Security Leverage
Org changes are hard!
Client Application Client Library EVCache Client Service Client S S S S
. . . DB DB DB DB . . . . . .
Request Cache
If you can’t see it, you can’t improve it
Edge
ELB Zuul Playback
API Middle Tier & Platform
EVCache Cassandra
99% availability 99% availability 99% availability
99%
500 = 0.0657%
FIT
Fault-Injection Test Framework
x x
from VM to process
experience
Titus UI Titus UI Docker Registry Docker Registry Rhea container container container docker Titus Agent metrics agent Titus executor logging agent zfs mesos agent docker Rhea Titus API Cassandra Titus Master Job Management & Scheduler S3 Zookeeper Docker Registry 50 EC2 Autocaling API Mesos Master Titus UI Fenzo VPC networking driver container container container AWS container metadata proxy
Integration
CI/CD Amazon VM’s
http://netflix.github.com
http://netflix.github.com
http://netflix.github.com
http://netflix.github.com
http://netflix.github.com
http://netflix.github.com
Microservices bring great value to development velocity, availability and other dimensions
Microservices at scale require organizational change and centralized infrastructure investment
Be aware of your situation and what works for you
Ruslan Meshenberg @rusmeshenberg