The return of OpenStack Telemetry and the 10,000 Instances - PowerPoint PPT Presentation

The return of OpenStack Telemetry and the 10,000 Instances Telemetry Project Update Alex Krzos Julien Danjou 8 November 2017

The return of OpenStack Telemetry and the 10,000 Instances 20,000 Alex Krzos Julien Danjou 8 November 2017

Introductions Alex Krzos Senior Performance Engineer @ Red Hat akrzos@redhat.com IRC: akrzos Julien Danjou Principal Software Engineer @ Red Hat jdanjou@redhat.com IRC: jd_ Red Hat

Lets talk about Telemetry and Scaling... ● Why scale test? Telemetry Architecture ● ● Gnocchi Architecture ● The Road to 10,000 Instances ● Scale and Performance Test Results Conclusion ● Red Hat

Why Scale Test? Determine capacity and limits Develop good defaults and recommendations Characterize resource utilization Telemetry must scale as number of metrics collected will only increase. Red Hat

Telemetry Architecture Red Hat

Gnocchi Architecture Red Hat

The Road to 10,000 Instances Ocata struggled to get 5,000 instances even with lots of tuned parameters and reducing workload. Goal: Achieve 10,000 instances with less tuning than Ocata and a more difficult workload. Extra Credit: Go beyond 10,000 with same hardware. Red Hat

Workloads Boot Persisting Instances with Network 500 at a time, then quiesce ● Boot Persisting Instances ● 1000 at a time, then quiesce Measure Gnocchi API Responsiveness ● Metric Create/Delete ● Resource Create/Delete Get Measures ● Red Hat

Hardware 3 Controllers 2 x E5-2683 v3 - 28 Cores / 56 Threads ● ● 128GiB Memory ● 2 x 1TB 7.2K SATA in Raid 1 12 Ceph Storage Nodes 2 x E5-2650 v3 - 20 Cores / 40 Threads ● ● 128GiB Memory ● 18 x 500GB 7.2K SAS ( 2 - Raid 1 - OS, 16 OSDs), 1 NVMe Journal 59 Compute Nodes 2 x E5-2620 v2 - 12 Cores / 24 Threads ● ● 128GiB / 64GiB Memory ● 2 x 1TB 7.2K SATA in Raid 1 Red Hat

Network Topology Red Hat

10,000 Instances with NICs Test Workload (20 iterations) 500 instances with attached network booted every 30 minutes ● Gnocchi Settings ● metricd workers per Controller = 18 ● api workers per Controller = 24 Ceilometer Settings ● notification_workers = 3 ● rabbit_qos_prefetch_count = 128 ● 300s polling interval Red Hat

Pike Results - 10k Test Gnocchi Backlog Red Hat

Pike Results - 10k Test CPU on Controllers Red Hat

Pike Results - 10k Test Memory on All Hosts Red Hat

Pike Results - 10k Test Disks on Controllers Red Hat

Pike Results - 10k Test Disks on CephStorage Red Hat

20,000 Instances Test Workload (20 iterations) 1000 instances booted ● ● 5000 get measures ● 1000 metric and resource creates/deletes Gnocchi metricd workers per Controller = 36 ● ● api processes per Controller = 24 Ceilometer ● notification_workers = 5 rabbit_qos_prefetch_count = 128 ● ● 300s polling interval Red Hat

Ocata Results Red Hat

Ocata Results Not in Pike Red Hat

Pike Results - 20k Test Gnocchi Backlog Red Hat

Pike Results - 20k Test CPU on Controllers Red Hat

Pike Results - 20k Test Memory on All Hosts Red Hat

Pike Results - 20k Test Disks on Controllers Red Hat

Pike Results - 20k Test Disks on CephStorage Red Hat

Pike Results - 20k Test Network Controllers Em1 Red Hat

Pike Results - 20k Test Network Controllers Em2 Red Hat

API Get Measures - 20k Test Red Hat

API Create/Delete Metrics - 20k Test Red Hat

API Create/Delete Resources - 20k Test Red Hat

Tuning - Gnocchi Some Differences between versions (Newton, Ocata, Pike) Pike (Gnocchi v4) metricd/api workers ● Incoming storage driver (Redis is currently prefered) ● Ocata / Newton (Gnocchi 3.1 / 3.0) metricd/api workers ● tasks_per_worker / metric_processing_delay ● Check scheduler (Use latest version of Gnocchi) ● Red Hat

Tuning - Ceilometer Always avoid overwhelming Gnocchi backlog (collect what you need/use) ● check rabbit_qos_prefetch_count - Monitor Rabbitmq too ● Pike agent-notification workers ● Ocata publish directly to Gnocchi (disable collector) ● Newton collector workers ● Red Hat

Conclusion OpenStack Telemetry is now proven to the 10,000 instance mark and more in Pike Minimal degradation in response timing of API as more and more metrics are collected Of course there is still room for improvements: Reduce the load on the archival storage ● Spikes in API timings (Frontend API vs Backend API) ● Performance testing with other storage drivers (Swift, File) ● Red Hat

THANK YOU plus.google.com/+RedHat facebook.com/redhatinc linkedin.com/company/red-hat twitter.com/RedHatNews youtube.com/user/RedHatVideos

The return of OpenStack Telemetry and the 10,000 Instances - PowerPoint PPT Presentation

The return of OpenStack Telemetry and the 10,000 Instances Telemetry Project Update Alex Krzos Julien Danjou 8 November 2017 The return of OpenStack Telemetry and the 10,000 Instances 20,000 Alex Krzos Julien Danjou 8 November 2017

OpenStack Telemetry and the 10,000 Instances To infinity and beyond Julien Danjou Alex Krzos 9

Growth in Known Compounds 70,000,000 63,175,733 60,000,000 54,675,250 50,000,000 50,000,000

SJVIA Projected Cash Flows as of 10/15/15 $10,000,000 $9,000,000 $8,000,000 $7,000,000

State funding remains below pre-recession levels $300,000,000 $290,000,000 $280,000,000 $273.1M

APRIL 30, 2019 $14,000,000.00 $12,000,000.00 $10,000,000.00 $8,000,000.00 $6,000,000.00

PAPA Technical Meetings - 2017 HMA PRODUCTION BY YEAR 1,200,000 1,000,000 980,000 1,000,000

CFR Data- State-Wide Fiscal Losses State Wide Losses - Education Programs 93,700,000

Camping units 300,000 290,000 280,000 270,000 260,000 250,000 240,000 230,000 220,000

Industrial Robot Outlook 1,000,000 900,000 800,000 700,000 600,000 500,000 400,000 300,000

3,542 o F 3,542 o F 120 o F 50 o F N ATURAL GAS USE - 75% 1.5 MILLION THERMS /Y R .

Curtis Dubay Senior Economist, U.S. Chamber of Commerce April 2020 Historically High

BUDGET OVERVIEW 1 CHALLENGES FOR THE GENERAL FUND $30,000,000 $25,000,000 $20,000,000

27.3% 9,130,000 C-Crossovers sold Crossover % in 2014 in total C segment in 2014 35,000,000

The Returns to Education Source: Bureau of Labor Statistics 1 Total Enrollment Over Time

DAIRY MARKETS ARE ALIVE AND WELL Volume & Open Interest - Class III Milk 5,000 50,000

Strategic Resource Allocation Project Academic Leadership Meeting September 28, 2017

Speciation and role of iron phases in cement to fix heavy metals J. Rose 1,2 , A. Benard 2,3 , A.

Proposed training program for shotcrete operators To achieve a quality shotcrete system

Preparing for Collaborative Data Driven Projects December 9, 2016 Lauren Hareem Erika Haynes

He a lth Re form Imple me nta tion F orum: E ligibility, E nr ollme nt, and Re te ntion Ma rc

Multicore Synchronization a pragmatic introduction Speaker (@0xF390) Co-founder of

OPENSTACK DEPLOYMENT AND AUTOMATION @kernelcdub @thomasdcameron @jameslabocki May 5, 2015

Rosanne Albright Office of Environmental Programs City of Phoenix 2 2 Public support for

Jungheinrich Group presentation Hamburg, March 2018 At a Glance 39 39 In countries

Sambuz

Useful Links

Newsletter

Mail Us

The return of OpenStack Telemetry and the 10,000 Instances - PowerPoint PPT Presentation

The return of OpenStack Telemetry and the 10,000 Instances Telemetry Project Update Alex Krzos Julien Danjou 8 November 2017 The return of OpenStack Telemetry and the 10,000 Instances 20,000 Alex Krzos Julien Danjou 8 November 2017

OpenStack Telemetry and the 10,000 Instances To infinity and beyond Julien Danjou Alex Krzos 9

Growth in Known Compounds 70,000,000 63,175,733 60,000,000 54,675,250 50,000,000 50,000,000

SJVIA Projected Cash Flows as of 10/15/15 $10,000,000 $9,000,000 $8,000,000 $7,000,000

State funding remains below pre-recession levels $300,000,000 $290,000,000 $280,000,000 $273.1M

APRIL 30, 2019 $14,000,000.00 $12,000,000.00 $10,000,000.00 $8,000,000.00 $6,000,000.00

PAPA Technical Meetings - 2017 HMA PRODUCTION BY YEAR 1,200,000 1,000,000 980,000 1,000,000

CFR Data- State-Wide Fiscal Losses State Wide Losses - Education Programs 93,700,000

Camping units 300,000 290,000 280,000 270,000 260,000 250,000 240,000 230,000 220,000

Industrial Robot Outlook 1,000,000 900,000 800,000 700,000 600,000 500,000 400,000 300,000

3,542 o F 3,542 o F 120 o F 50 o F N ATURAL GAS USE - 75% 1.5 MILLION THERMS /Y R .

Curtis Dubay Senior Economist, U.S. Chamber of Commerce April 2020 Historically High

BUDGET OVERVIEW 1 CHALLENGES FOR THE GENERAL FUND $30,000,000 $25,000,000 $20,000,000

27.3% 9,130,000 C-Crossovers sold Crossover % in 2014 in total C segment in 2014 35,000,000

The Returns to Education Source: Bureau of Labor Statistics 1 Total Enrollment Over Time

DAIRY MARKETS ARE ALIVE AND WELL Volume &amp; Open Interest - Class III Milk 5,000 50,000

Strategic Resource Allocation Project Academic Leadership Meeting September 28, 2017

Speciation and role of iron phases in cement to fix heavy metals J. Rose 1,2 , A. Benard 2,3 , A.

Proposed training program for shotcrete operators To achieve a quality shotcrete system

Preparing for Collaborative Data Driven Projects December 9, 2016 Lauren Hareem Erika Haynes

He a lth Re form Imple me nta tion F orum: E ligibility, E nr ollme nt, and Re te ntion Ma rc

Multicore Synchronization a pragmatic introduction Speaker (@0xF390) Co-founder of

OPENSTACK DEPLOYMENT AND AUTOMATION @kernelcdub @thomasdcameron @jameslabocki May 5, 2015

Rosanne Albright Office of Environmental Programs City of Phoenix 2 2 Public support for

Jungheinrich Group presentation Hamburg, March 2018 At a Glance 39 39 In countries

Sambuz

Useful Links

Newsletter

Mail Us

DAIRY MARKETS ARE ALIVE AND WELL Volume & Open Interest - Class III Milk 5,000 50,000