dns belgium in the cloud
play

DNS Belgium in the cloud ICANN Tech Day 2017-03-13 - PowerPoint PPT Presentation

DNS Belgium in the cloud ICANN Tech Day 2017-03-13 maarten.bosteels@dnsbelgium.be What did we do ? Migrated to Amazon Web Services Re-built entire registration platform from code Took down the wall between Ops & Dev Main


  1. DNS Belgium in the cloud ICANN Tech Day – 2017-03-13 maarten.bosteels@dnsbelgium.be

  2. What did we do ? • Migrated to Amazon Web Services • Re-built entire registration platform from code • Took down the wall between Ops & Dev

  3. Main drivers for change • Configuration drift (Test vs. Prod) • Long lead times (eg. patching) • Difficult hand-overs Dev-Ops • Infrequent deployments • Lots of fire fighting, little time for fire prevention • Aging hardware

  4. Classic model ? CUSTOMER SUPPORT RAR RANT Modules Registration System Focus Dev Third party software Application servers Virtual Machines Security RDBMS Hypervisor Focus Ops Host OS Server Hardware Storage In-house Networking Out-sourced Power Physical space Connectivity

  5. Engineering = Dev + Ops + QA Strategy • Multi-functional agile teams • Focus on upper layers of the stack • Infrastructure-as-code => reproducible & testable • Continuous Delivery => small amounts of change & early feedback. • Dev & Ops both confronted with quality of their work • Design for failure: resilient, self-monitoring, self-healing

  6. Status early 2015 • Last hardware renewal : 2011 • Big bang migration • New hardware / network design / storage solution / colo • Lots of vendors to manage • Go for another big bang ? • Do we really need our own hardware ? • Why not use the cloud ?

  7. Extra layer RAR RANT CUSTOMER SUPPORT Modules Registration System Focus Security Third party software Application servers Engineering Orchestration & Config Mgmt In-house Virtual Machines Out-sourced RDBMS Hypervisor Security Host OS Storage Server Hardware Networking Power Physical space Connectivity

  8. Initial assessment of AWS • Initial tests: • Get to know AWS services • Proof-of-concept • Risk assessment • Technically feature complete ? • Confidentiality, Integrity, Availability ? • Legal risk assessment • Performance tests • Cost assessment • Man days • €

  9. Conclusion of assessment • Software-defined everything Avoid configuration drift o Infra predictable & documented => increased security o • Encryption all data in transit + data at rest • IaaS = enabler to focus on core business No need for home-grown HA solutions o Use well-designed services with built-in redundancy o Underlying services keep improving ‘for free’ o • Pay what you use Dev & Test environments : business hours only o Easily scale up / down o

  10. Infra-as-code: building blocks Configuration + Monitoring + … Puppet Pulp (rpm repo) In-house software Third-party software Access rules VMs Load-balancers Cloudformation Disk volumes Network layout RDBMS Git repos: • Puppet modules + config • In-house software • Cloudformation templates

  11. Overview environments

  12. High availability • All components • distributed over 2 availability zones within one AWS Region • active-active • behind Elastic Load Balancers • Intelligent health checks • Share content via RDBMS or via EFS (= NFS like) • All RDBMS instances in multi-AZ mode

  13. Oracle – multi-AZ RDS Datacenter - Diegem Datacenter - Nossegem HQ (Leuven) Oracle RAC DMS Disaster Recovery Stand-by node 2 node 1 Ireland applications applications Primary RDS Stand-by RDS • On-prem Both RAC nodes in same DC o Manual fail-over to stand-by instance o • AWS: multi-AZ RDS Synchronous Replication o Automatic & Transparent Fail-Over o Availability Zone 1 Availability Zone 2

  14. RDS & Database migration • Amazon RDS = enormous time saver ! • No OS level access on Amazon RDS => DataGuard etc not an option • Amazon Database Migration Service (DMS) too immature for the migration • Used complex Oracle Datapump export / import sequence instead • Temporarily up-scaled Oracle instance • Final export / transfer / import / verify : 2.5h

  15. Experience so far • RAR’s dealt well with change of IP addresses • Overall satisfied with quality of service & docs • No performance issues • Not impacted by S3 outage in US

  16. Next step– Full DR site in another region HQ (Leuven) • Keep DB’s + git in sync with main site • In case of region failure § Create resources from code DMS § Switch entry points via DNS Disaster Recovery DMS Ireland Frankfurt Primary RDS Primary RDS Stand-by RDS Stand-by RDS Availability Zone 1 Availability Zone 1 Availability Zone 2 Availability Zone 2

  17. Next steps • Disaster Recovery site in another region • Fully automate Continuous Delivery Pipeline • Blue / Green deployments • Nameservers in the cloud ? • Multi-cloud ? • Serverless architecture ?

  18. The team

  19. Questions ?

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend