TUT1131 - Best Practices in Deploying SUSE CaaS Platform Martin - PowerPoint PPT Presentation

TUT1131 - Best Practices in Deploying SUSE CaaS Platform Martin Weiss Juan Utande Herrera Senior Architect Infrastructure Solutions Senior Architect Infrastructure Solutions Martin.Weiss@SUSE.com Juan.Herrera@suse.com

AGENDA AGEN What What is is SUSE SUSE CaaS CaaS 4 Deployment B loyment Best P t Practices ctices 1 Platform tform 2 Requirements uirements 5 Testing ting 3 Planning and Sizi Plan and Sizing 6 Operations rations

What is SUSE CaaS Plaform 3 3

SUSE: Underpinning Digital Transformation Business-critical Machine Business High Performance Traditional IT Internet of Applications Learning Analytics Computing & Applications Things Application Delivery Container Management Platform as a Service SUSE CaaS Platform SUSE Cloud Application Platform Software-defined Infrastructure Services Private Cloud / IaaS SUSE OpenStack Cloud SUSE Global Infrastructure Services & Lifecycle Compute Storage Consulting Management Networking Services Virtual Machine SUSE Enterprise Public Cloud SDN and NFV Select Services & Container Storage SUSE Manager Premium Support SUSE Cloud Services Multimodal Operating System Service Provider SUSE Linux Enterprise Server Program Physical Infrastructure: Multi-platform Servers, Switches, Storage Open, Secure, Proven 4

What is SUSE CaaS Platform 3? • Kubernetes • MicroOS with Transactional Updates • Simple deployment • SUSE supported • LDAP / Active Directory Integration • Caching Registry Integration • Air Gapped Implementation Support • Registry.suse.com • Helm • Docker or Cri-o (tech preview), Flannel • Multiple deployment methods

Requirements 6

General requirements Where to deploy What do I need Who can help me Support options • Deploy on physical • SUSE CaaS • Sales and Pre/Post • Included 24/7 Hardware or on your Platform Sales Consulting: priority support in Virtualization subscriptions case of issues - Help choosing the infrastructure • SLES for right Hardware • Consulting for • Ready to Run on infrastructure nodes maintenance and - Architect the Public and Private proactive support to solution Clouds scale, upgrade, - Initial review and fix implementation

Use Case Specific Requirements Application Requirements Security Requirements Availability Requirements (Sizing) • Number of Pods • Images (source and size) • Single or multi data-center • Memory, CPU • Isolation • Distance / Latency • Storage requirements (file, • Integration into existing block, object, single or multi- Identity Sources writer, capacity, static or dynamic provisioning) • specific Hardware / CPU / GPU requirements • Network Entry points / Services / Bandwidth $$$ BUDGET $$$ Politics, Religion, Philosophy, Processes ;-)

Planning and Sizing

Planning and Sizing SUSE C SE CaaS P S Platform – CLUS USTER 1 R 1 Kuber ernet etes es Master Master Master + Admin Based Workers as VM or physical on number of pods Worker Worker Worker + Fault tolerance Based on LDAP, Salt, number of pods ETCD cluster and resource Velum, SQL requirements Disk Space for each Worker: Second cluster: • 50 GB for OS (BTRFS minimum for OS) • Fault • 100 GB for /var/lib/docker (BTRFS for Images tolerance and Containers) • Disaster • Space really depends on image sizes, versions recovery and changes

Deployment Best Practices 1

Deployment - Processes and People Prepare the Team (DevOps?) – Server – Storage – Network – Application – Security – User Other

Deployment Stages 2 1 3 Base Infrastructur Infrastructur Software e e Installation Preparation Verification 4 5 SUSE Kubernetes CaaS Addons Platform Installation

Deployment Review the Design Preparation of Time Synchronization • Depending on the requirements adjust • Have a fault tolerant time provider group before implementation Name Resolution Hardware Installation • Ensure that all addresses of the servers • Ensure that hardware installation and have different names cabling is correct • Add all addresses to DNS with forward and • Update Firmware reverse lookup • Adjust Firmware / BIOS settings • Ensure DNS is fault tolerant Disable everything not required ( i.e. serial • /etc/HOSTNAME must be the name in the ports, network boot, power saving ) public network Configure HW date/time • Define and create DNS Entries for internal and external Velum and API targets (Cname, VM Preparation Load Balancer, no round robin) • Use paravirtual SCSI

Deployment Deploy On-Premise Registry (docker-distribution-registry) • Implement Portus to Secure the On-Premise Registry • Create DNS entry for Registry • Create Namespaces and Users on Registry • Optional: Integrate Portus into existing LDAP or Active-Directory Put all required images into registry into the right namespace • Dashboard, Prometheus, Grafana, etc. Optional: Setup caching registries

Deployment Prepare Load Balancer Endpoints for API and DEX • Port 6443 and 32000 Storage Network setup and connectivity Prepare on-premise helm chart repository Prepare docker host to pull from internet, scan images, push to on- premise registry Prepare GIT for storing all manifests / yaml files

Deployment Software Staging AutoYaST • Subscription Management Toolkit, SUSE • Ensure that all servers are installed Manager, RMT (limited) 100% identical • Ensure staging of patches to guarantee • Consulting solution available (see same patch level on existing servers and https://github.com/Martin-Weiss/cif) newly installed servers Configuration Management General • Templates • Use BTRFS for the OS • Salt • Disable Firewall / AppArmor / IPv6

Deployment ONLY USE STATIC IP Configs Verify Time Synchronization Verify Name Resolution Test all Network Connections • Bandwidth • Latency

Deployment • Install all Servers (Admin, Master, Worker) via AutoYaST • Ensure that all the patches available are installed at this point in time • AutoYaST configures Salt to ensure all Master/Worker connect to Salt-Master on the Admin host • Access Velum web-interface and create admin user • Specify Internal Dashboard FQDN (CNAME) • Enable Tiller (for later Helm usage) • Configure the overlay network • Add the SSL certificate of the CA signing the registry and external LDAP certificates • Accept Nodes, Assign Roles • Specify External API FQDN (load balancer for API and DEX) • Specify External Velum FQDN (CNAME) • Run Bootstrap (and now have a cup of coffee ;-))

Deployment Create required Namespaces Create required Users / Groups in LDAP or Connect to Active Directory Create Roles and Role-Assignments Deploy Basic Services • K8s Dashboard • Persistent Storage / Storage Classes • Ingress • Monitoring • Logging Deploy Application • Application based scripts • CI/CD • Helm

Testing 2

Testing - Preparation Create a test plan For every test describe • Starting point • Test details • Expected result When executing the test • Prepare and verify starting point • Execute test • Document the test execution • Document the test results • Compare test results with expectation • Repeat the test several times 2 2

Testing - Fault Tolerance Ensure all fault tolerance tests are done with load on the system Network failure • Single / Multiple NIC • Single / Multiple Switches • Cluster / Public Network Node failure • Admin • Master • Worker

Operations

Life Cycle • New Patches • Create new Stage on Staging System • Assign new Stage to Admin and Nodes • Wait until next day or “transactional-update dup reboot” • Access Velum - reboot admin • Ensure NO Single Pod application runs in the cluster* • Access Velum - reboot all

Monitoring and Logging • Old: cAdvisor, Heapster, InfluxDB, Grafana • New: cAdvisor with Prometheus and Grafana • Alertmanager • Logfile collection and cleanup • Disk space usage • Application Specific Monitoring?

Backup and Recovery (1) Don ´ t do backup and recovery • • Everything that is deployed to the cluster must be 100% reproducible • Use a second cluster for disaster recovery and deploy the application twice • Have proper staging for the application • For persistent data - the application MUST support consistent backup and restore and this can not be done on the k8s side of things • Recommendation: use a GIT or similar source code management system • Disaster Recovery: delete the whole cluster, de-deploy and re-configure the cluster, re-deploy the application and restore the applications data via application functionality

Backup and Recovery (2) • Backup ETCD • LDIF export of openLDAP • Snapshot of Admin VM • Power off everything and snapshot • Kubectl export • GIT / Helm / Yaml File backup and versioning • Backup of Persistent Volumes • Single object restore? • Create an alias for kubectl - -record 2

Questions? 2

Questions? Deployment B loyment Best t Requirements uirements Plan Planning and Sizi and Sizing Practices ctices Testing ting Operations rations

Backup slides

TUT1131 - Best Practices in Deploying SUSE CaaS Platform Martin - PowerPoint PPT Presentation

TUT1131 - Best Practices in Deploying SUSE CaaS Platform Martin Weiss Juan Utande Herrera Senior Architect Infrastructure Solutions Senior Architect Infrastructure Solutions Martin.Weiss@SUSE.com Juan.Herrera@suse.com AGENDA AGEN What

Enabling Business Agility with SUSE CaaS Platform BOV 1078 Rob Knight Jonas Forsberg

SUSE Manager 4 2019 Fall Expert Day North America Why Do I Need SUSE Manager? 11 118 118 8

Simplifying Pillar Management with Forms and Formulas in SUSE Manager TUT1053 Raine Curtis

Introd u cing SUSE Enterprise Storage 5 1 SUSE Enterprise Storage 5 SUSE Enterprise Storage 5 is

SUSE Nils Brauckmann President and General Manager SUSE at a Glance 0m 50s We adapt. You

The Future Of Linux Suspend Stefan Seyfried <seife@suse.de> SuSE Linux Products GmbH -

SUSE Enterprise Storage 6 Darren Soothill EMEA Storage Technical Strategist Agenda

SUSE Manager Under the Hood Silvio Moioli, Developer, moio@suse.com Ral Osuna

Rule breaker! Upgrading an OpenStack Cloud while skipping a release Rick Salevsky Nanuk Krinner

SMB debugging tools the art of hair pulling Aurlien Aptel <aaptel@suse.com> SUSE Who am

Agenda What Is SUSE Enterprise Storage 5.5 Requirements Planning and Sizing

SUSE Manager 174 174 SUSE Manager HEALTH MONITORING Best-in-class open source infrastructure

Deploying And Supporting Perl 6 Jonathan Worthington UKUUG Spring 2007 Conference Deploying And

Stateful workloads on kubernetes with ceph Agenda CaaS Kubernetes

Open source software for the keen file formats Ramn photographer: file formats Casero Caas

Patching at Large with SUSE Manager Marc Laguilharre, Premium Support Engineer,

Headquarters U.S. Air Force I n t e g r i t y - S e r v i c e - E x c e l l e n c e Air Force

Learning Covariant Feature Detectors Karel Lenc and Andrea Vedaldi University of Oxford GMDL

Backscatter Patch Test inter-comparison of systems using shared reference areas for testing,

Advances in Automated Program Repair and a Call to Arms Westley Weimer University of Virginia

SEGMENTATION OF MEDICAL VOLUMES USING CONVOLUTIONAL NEURAL NETWORKS MEDICAL IMAGE SEGMENTATION

Selective Patella Resurfacing and Total Knee Arthroplasty Jay R Lieberman MD Professor and

Total Knee Replacement DR. (PROF.) ANIL ARORA MS (Ortho) DNB (Ortho) Dip SIROT (USA) FAPOA

Patellofemoral Pain Syndrome and Hip Strengthening Candace Gorby, AT, USAW-L1 Kettering Sports

TUT1131 - Best Practices in Deploying SUSE CaaS Platform Martin - PowerPoint PPT Presentation

TUT1131 - Best Practices in Deploying SUSE CaaS Platform Martin Weiss Juan Utande Herrera Senior Architect Infrastructure Solutions Senior Architect Infrastructure Solutions Martin.Weiss@SUSE.com Juan.Herrera@suse.com AGENDA AGEN What

Enabling Business Agility with SUSE CaaS Platform BOV 1078 Rob Knight Jonas Forsberg

SUSE Manager 4 2019 Fall Expert Day North America Why Do I Need SUSE Manager? 11 118 118 8

Simplifying Pillar Management with Forms and Formulas in SUSE Manager TUT1053 Raine Curtis

Introd u cing SUSE Enterprise Storage 5 1 SUSE Enterprise Storage 5 SUSE Enterprise Storage 5 is

SUSE Nils Brauckmann President and General Manager SUSE at a Glance 0m 50s We adapt. You

The Future Of Linux Suspend Stefan Seyfried &lt;seife@suse.de&gt; SuSE Linux Products GmbH -

SUSE Enterprise Storage 6 Darren Soothill EMEA Storage Technical Strategist Agenda

SUSE Manager Under the Hood Silvio Moioli, Developer, moio@suse.com Ral Osuna

Rule breaker! Upgrading an OpenStack Cloud while skipping a release Rick Salevsky Nanuk Krinner

SMB debugging tools the art of hair pulling Aurlien Aptel &lt;aaptel@suse.com&gt; SUSE Who am

Agenda What Is SUSE Enterprise Storage 5.5 Requirements Planning and Sizing

SUSE Manager 174 174 SUSE Manager HEALTH MONITORING Best-in-class open source infrastructure

Deploying And Supporting Perl 6 Jonathan Worthington UKUUG Spring 2007 Conference Deploying And

Stateful workloads on kubernetes with ceph Agenda CaaS Kubernetes

Open source software for the keen file formats Ramn photographer: file formats Casero Caas

Patching at Large with SUSE Manager Marc Laguilharre, Premium Support Engineer,

Headquarters U.S. Air Force I n t e g r i t y - S e r v i c e - E x c e l l e n c e Air Force

Learning Covariant Feature Detectors Karel Lenc and Andrea Vedaldi University of Oxford GMDL

Backscatter Patch Test inter-comparison of systems using shared reference areas for testing,

Advances in Automated Program Repair and a Call to Arms Westley Weimer University of Virginia

SEGMENTATION OF MEDICAL VOLUMES USING CONVOLUTIONAL NEURAL NETWORKS MEDICAL IMAGE SEGMENTATION

Selective Patella Resurfacing and Total Knee Arthroplasty Jay R Lieberman MD Professor and

Total Knee Replacement DR. (PROF.) ANIL ARORA MS (Ortho) DNB (Ortho) Dip SIROT (USA) FAPOA

Patellofemoral Pain Syndrome and Hip Strengthening Candace Gorby, AT, USAW-L1 Kettering Sports

The Future Of Linux Suspend Stefan Seyfried <seife@suse.de> SuSE Linux Products GmbH -

SMB debugging tools the art of hair pulling Aurlien Aptel <aaptel@suse.com> SUSE Who am