Scaling Nova with CellsV2 The Nova Developer and the CERN Operator - PowerPoint PPT Presentation

Scaling Nova with CellsV2 The Nova Developer and the CERN Operator perspective Dan Smith (Red Hat) Belmiro Moreira (CERN)

Your deployment probably looks like this: API DB/MQ Computes

Nova with Cells(v1) This special router needs separate code for almost every feature! nova cells Replication In Python

Native sharding of the contended resources API DB/MQ Computes

CellsV2 Services API “Super” Scheduler Placement Conductor Conductor Conductor Compute Compute Compute Compute

Design and Development Tenets Mainstream No Python Replication Performance CellsV2 should not Data should either Optimize cross-cell ● ● ● be opt-in or a live at the global or instance-based API different code path cell level (not both) operations Full upstream testing Aim for no Introduce caching ● ● ● in a reasonably “unsupported in and fault tolerance as cells-y configuration cells” features needed ● Cells should be invisible to regular API users

Development Challenges Unify two camps of Nova users ● Those for which CellsV1 will never be a desirable solution ○ Those for which CellsV1 is a necessary evil ○ Must be able to prescribe a transition for both camps ● ○ Regular operators have minimal tolerance for unnecessary steps and sometimes fewer resources Typical CellsV1 operators often have more resources, but have large existing deployments ○ ● Major re-architecting of a large amount of Nova internals All of this must happen in parallel to other efforts ● The world kept changing while we worked on this ●

How’d that go then? Mostly good? ● Obviously this introduced bugs and churn ○ Some additional operational overhead for regular operators ● Existing CellsV1 users faced a big transition ● Deployment assumptions ○ Some of the least-desirable attributes became “features” ○ Resulted in some cleanups and stricter rules around existing Nova code ● ○ Laid the groundwork for future non-scale-related use cases

Status (Rocky) Fully developed and tested in mainstream Nova - there is no “non-cells” ● deployment arrangement ● Good multi-cell performance Focus has been on instance operations ○ Some admin-type operations may still need optimizing ○ Some remaining functions fail to work properly in a fully-isolated environment ● Late affinity check ○ ● Performance is rapidly improving Fault tolerance is naive but improving ●

What’s next? Cross-cell migrations ● Further eliminating the restrictions of running with multiple cells ○ Fault tolerance improvements ● API availability when cells are down ○ ○ Improving quota handling when cells are down Still plenty of room to improve with caching and DB replication ○ ● Affinity via placement

CERN - Cloud resources status board - 06/11/2018@11:26

Cells at CERN ● CERN uses cells since 2013 ● Why we use cells? Single endpoint. Scale transparently between different Data Centres ○ Availability and Resilience ○ ○ Isolate failure domains Dedicate cells to projects ○ Hardware type per cell ○ Easy to introduce new configurations ○

CellsV1 and The Operational Nightmare Unmaintained upstream ● Only few deployments using CellsV1 ● ● Several functionality missing Flavor Propagation ○ No aggregates support ○ No server group support ○ No security groups with nova-network ○ A lot of local patches to make other basic functionality work ● Examples: ○ ■ Boot more than one instance per request ■ Availability Zones support DBs can get out of sync ● Upgrade is hard! ●

Journey to CellsV2 at CERN CellsV2 deployed at CERN Cloud CellsV1 deployed at 70 cells CERN Cloud 2018 2 cells 2013 . . . Grizzly Newton Ocata Pike Queens (https://www.youtube.com/watch?v=49CFXNIDM3c&t)

Why we are excited about CellsV2? Upstream code ● All nova deployments use now cells ● ○ We are not in the “blackhole” anymore Finally we can use nova full feature set ● Promise of sane DBs ● ● Rolling upgrades for old CellsV1 users ● CERN moved fast to CellsV2 Identified few interesting issues at scale. Most already fixed in Rocky

HOT Databases RabbitMQ nova DB CellA controller CellA compute nodes RabbitMQ RabbitMQ nova_api DB nova DB CellB controller Nova API Servers TOP Cell controllers CellB compute nodes RabbitMQ nova DB CellZ controller CellZ compute nodes

HOT Databases Cell databases activity increased a lot with cellsV2 ● ○ Simple API operations need to connect to all DBs. Most of these operations were sequential nova list; nova boot ■ ○ Most of the issues are already fixed in Queens/Rocky or in progress For example: ■ https://bugs.launchpad.net/nova/+bug/1771810 ● ● https://bugs.launchpad.net/nova/+bug/1746558 ● https://bugs.launchpad.net/nova/+bug/1746561 Number of queries and connections in one Cell DB after Nova Queens upgrade with cellsV2 enabled. API only available to few users

DB Down! Cloud Down! Fault tolerant DB solution per cell is recommended by Nova team ● ○ Very challenging for CERN considering the number of cells One of the reasons that we decided to use cells was failure domains ○ ● An unavailable cell DB affects the entire cloud Can’t create, list, delete instances… ○ ● No perfect solution... few compromises https://review.openstack.org/#/q/topic:bp/handling-down-cell ○ ○ For example: Not all information is available when getting instances ■ nova list; nova show ● ● returns a minimalistic construct from the available information in the API DB ■ Is not possible to calculate quota if project has instances in an unavailable cell Policy: os_compute_api:servers:create:cell_down ●

Scheduling Central Scheduling ● Filters are not per cell ○ Ex: “PCIPassthroughFilter” runs in every schedule request because we deploy GPUs in one cell ■ ○ “request-filter” for Placement ■ Allows placement to be be aware of project cell mapping and AVZs ■ Basic filtering on Placement. Reduces the number of allocation candidates Uses aggregates and placement aggregates ■ ● Automatic sync in Rocky https://docs.openstack.org/nova/latest/admin/configuration/schedulers.html#aggregates-in-placement ■ ○ However, we can still get a large number of allocation candidates scheduler/max_placement_results = 10 ■ Improves scheduling performance ● ● But… Unveiled some issues… (https://bugs.launchpad.net/nova/+bug/1777591) ○ Live migration with a defined target Rebuild with a new image ○

Miscellaneous Delete "Orphan" request_specs and instance_mappings ● https://bugs.launchpad.net/nova/+bug/1761198 ○ Slow AVZ list. Important for Horizon ● https://bugs.launchpad.net/nova/+bug/1801897 ○ ● Scheduling time is higher than in CellsV1 Don’t expect always a consistent state from 5 years old DBs ● Delete aggregate_hosts fails if service not available ○

Rocky Upgrade - Nova Control plane ● ○ Upgraded in 1h (nova API unavailable) ○ VMs (4 vcpus/8GB RAM) Top control plane ○ ■ 16 nova-api 10 nova-conductor; 10 nova-scheduler ■ ■ 10 nova-placement-api 73 cell controllers ○ ■ nova-api; nova-conductor; nova-network Number of nova api requests DBs sync done the day before ● ● upgrade_levels/compute=auto

Rocky Upgrade - Nova Compute nodes upgraded during the next 24h after control plane ● ● Number of placement requests increased with compute nodes upgrade Needed to x3 the number of placement nodes ● ● Impact in the VM scheduling time nova-compute (ironic driver) rollback to Queens! ● http://lists.openstack.org/pipermail/openstack-dev/2018-November/136251.html ● https://review.openstack.org/#/c/614886/ ●

Summary CERN Cloud is running Nova Rocky with CellsV2 Few issues found during Queens. Most of them are already fixed in Rocky ● CellsV2 works at scale ● No more code handcraft like in CellsV1 to have basic functionality ● Performance is improving ● ● Much easier upgrade Thanks to everyone from the Nova Team!

Scaling Nova with CellsV2 The Nova Developer and the CERN Operator - PowerPoint PPT Presentation

Scaling Nova with CellsV2 The Nova Developer and the CERN Operator perspective Dan Smith (Red Hat) Belmiro Moreira (CERN) Your deployment probably looks like this: API DB/MQ Computes Nova with Cells(v1) This special router needs separate

Whats new in Nova CellsV2? Matt Riedemann (mriedem on IRC) - Huawei Surya Seetharaman

Outline Scaling Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large

UP UP AND OUT: SCALING SOFTWARE WITH AKKA Jonas Bonr CTO Typesafe @jboner Scaling software

HR Connection Orientation Welcome to the NOVA Team! Whats on the Agenda? NOVA Overview

Analysis of Scaling Algorithms for Matrix & Operator Scaling Contents Scaling Algorithms

TOURISM NOVA SCOTIA INFORMATION & OPPORTUNITIES October 11, 2018 Presented by Tourism Nova

NOVA Wood DESKING SYSTEM NOVA Wood Natures touch in your office! A desking system that

RESULTS PRESENTATION Six-months ended 31 December 2016 Nova Park, Gorzow, Poland Nova Park,

one year of running RedIRIS NOVA Esther.Robles@rediris.es 2011 2042 : RedIRIS NOVA 2

NOvA Project John Cooper Fermilab Institutional Review June 6-9, 2011 NOvA CD-4 Deliverables

Nova Project Update, OpenStack Summit Berlin Melanie Witt irc:melwitt Red Hat What is Nova?

Cross-section measurements at the NOvA near detector Linda Cremonesi for the NOvA Collaboration

Effectively Scaling Effectively Scaling up/universalizing exclusive up/universalizing exclusive

Scaling From simple models to rich strategies PPPLab Day, November 30th Scaling: recent

Outline Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large Principles of

Institutional Presentation The NOVA Association NON-COMPREHENSIVE LIST OF SCHOOLS NOVA is a

Patient Reported Outcomes in an Era of Immunotherapy Drug Development David Cella, PhD

West Vancouver Passive House Comfort, Health, Beauty & Energy Efficiency NAPHN Conference

Rachel Evans PG Dipl. Brewing & Distilling BSc (Hons) Applied Food Science & Marketing

Packages References Project: CMMP Platform - Iraq Crude Oil Expansion Project Customer: Saipem

they add to the energy mix and where is it happening? Representing the UK Hydrogen and Fuel Cell

Spir itual Car e Companion Mission Sta te me nt In ke e ping with the mission of Je sus, Ca

CLASSIZER ONE v0.7 key enabling solutions and TT services based on SPES patented technology

Stochas&c efficiencies G. Verley, M. Esposito, T. Willaert

Scaling Nova with CellsV2 The Nova Developer and the CERN Operator - PowerPoint PPT Presentation

Scaling Nova with CellsV2 The Nova Developer and the CERN Operator perspective Dan Smith (Red Hat) Belmiro Moreira (CERN) Your deployment probably looks like this: API DB/MQ Computes Nova with Cells(v1) This special router needs separate

Whats new in Nova CellsV2? Matt Riedemann (mriedem on IRC) - Huawei Surya Seetharaman

Outline Scaling Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large

UP UP AND OUT: SCALING SOFTWARE WITH AKKA Jonas Bonr CTO Typesafe @jboner Scaling software

HR Connection Orientation Welcome to the NOVA Team! Whats on the Agenda? NOVA Overview

Analysis of Scaling Algorithms for Matrix &amp; Operator Scaling Contents Scaling Algorithms

TOURISM NOVA SCOTIA INFORMATION &amp; OPPORTUNITIES October 11, 2018 Presented by Tourism Nova

NOVA Wood DESKING SYSTEM NOVA Wood Natures touch in your office! A desking system that

RESULTS PRESENTATION Six-months ended 31 December 2016 Nova Park, Gorzow, Poland Nova Park,

one year of running RedIRIS NOVA Esther.Robles@rediris.es 2011 2042 : RedIRIS NOVA 2

NOvA Project John Cooper Fermilab Institutional Review June 6-9, 2011 NOvA CD-4 Deliverables

Nova Project Update, OpenStack Summit Berlin Melanie Witt irc:melwitt Red Hat What is Nova?

Cross-section measurements at the NOvA near detector Linda Cremonesi for the NOvA Collaboration

Effectively Scaling Effectively Scaling up/universalizing exclusive up/universalizing exclusive

Scaling From simple models to rich strategies PPPLab Day, November 30th Scaling: recent

Outline Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large Principles of

Institutional Presentation The NOVA Association NON-COMPREHENSIVE LIST OF SCHOOLS NOVA is a

Patient Reported Outcomes in an Era of Immunotherapy Drug Development David Cella, PhD

West Vancouver Passive House Comfort, Health, Beauty &amp; Energy Efficiency NAPHN Conference

Rachel Evans PG Dipl. Brewing &amp; Distilling BSc (Hons) Applied Food Science &amp; Marketing

Packages References Project: CMMP Platform - Iraq Crude Oil Expansion Project Customer: Saipem

they add to the energy mix and where is it happening? Representing the UK Hydrogen and Fuel Cell

Spir itual Car e Companion Mission Sta te me nt In ke e ping with the mission of Je sus, Ca

CLASSIZER ONE v0.7 key enabling solutions and TT services based on SPES patented technology

Stochas&amp;c efficiencies G. Verley, M. Esposito, T. Willaert

Analysis of Scaling Algorithms for Matrix & Operator Scaling Contents Scaling Algorithms

TOURISM NOVA SCOTIA INFORMATION & OPPORTUNITIES October 11, 2018 Presented by Tourism Nova

West Vancouver Passive House Comfort, Health, Beauty & Energy Efficiency NAPHN Conference

Rachel Evans PG Dipl. Brewing & Distilling BSc (Hons) Applied Food Science & Marketing

Stochas&c efficiencies G. Verley, M. Esposito, T. Willaert