Lessons Learned: Building Scalable & Elastic Akka Clusters on - PowerPoint PPT Presentation

Lessons Learned: Building Scalable & Elastic Akka Clusters on Google Managed Kubernetes - Timo Mechler & Charles Adetiloye

About MavenCode MavenCode is a Data Analytics software company offering training, product development, and consulting services in the following areas: Provisioning Scalable Data Processing Pipelines and Cloud Infrastructure Deployment   Development & Deployment of Machine Learning and Artificial Intelligence Platforms Streaming and Big Data Analytics -IoT and Sensors

About The Presenters Timo Mechler (Architect & Product Manager) Decade of experience in the energy commodity markets with particular focus building out scalable research platforms for commodities trading (data collection, data analysis, data modeling). Charles Adetiloye (Lead Data Engineer) Over a decade worth of experience consulting and implementing large scale distributed data processing software platforms across different industry verticals. Previously worked/consulted with Lightbend, Twitter, Monsanto, Starbucks, and a few other startups and Fortune 500 companies.

Moving From “Proactive” to “Reactive” ! Late 1990’s 2000 - (?) 2009 (Akka) 2013 (Docker) 2014 (Kubernetes) SOA - XML, SOAP WSDL Application & Web Servers https://www.reactivemanifesto.org/ - Virtualized Commodity Hardware - More Distributed Spread Out Nodes - Beefed Up Servers - Improved Network IO - Difficult to Scale - Network Admin Functional DevOps Team - Slow Network IO - Few Concurrent Processes - Deployment Nightmare

Containerization & Cloud Orchestration Containerized Microservices Application Stack   Scala + Akka, Some Go & Python Alpine Image Dockerized   Akka, Clustering, Remoting, HTTP, Alpakka DockerSwarm Mesos Kubernetes Orchestration Layer Usability Stability DockerSwarm Mesos Kubernetes Kubernetes Kubernetes Feature Sets Community/ EcoSystem Cloud Infrastructure Layer Here To Stay ? We Work With All 3 Cloud Services And They’re All Great!!!   Amazon Azure Google But We think Google Cloud Platform (GCP) stands out: - Kubernetes was started at Google - If you are doing AI & ML stuff, GCP integration is the best - From a cost perspective with GCP you save a few $$$

Why Did We Go Reactive With Akka ? - High Performance, Resilience and Scalability - Loosely Coupled Messaging System - Active Open Source Developer Community - Battle Tested Framework, Proven Use Cases, Matured but Still Improving (since 2009)

Scalable DataPipeline 3 Schema Registry 2 1 4 *N *N STREAMING ANALYTICS DOMAIN EVENTS SCALABLE AGGREGATE ANALYSIS PUB-SUB MESSAGE QUEUE 8 *N INFERENCING 5 BATCH ROLLUP 1 PREDICTIVE ANALYSIS Events are ingested - Satellite, Telemetry, IoT, etc. 2 Events Processing Queue, Google Pub-Sub/Kafka 7 *N 6 MACHINE LEARNING/ 3 Schema Registry for Event Validation PREDICTIVE MODELING RAW DATA Near Real-time Continuously Streaming Events 4 TEXT/BINARY STORAGE 5 Batch Rollup JOB - Time or Size Rotation: TimeStamped 6 DataStore -> Parquet Compressed on Google Storage or Amazon S3 7 ML Models Generated and Versioned -> Tensorflow, MXNet, Spark MLib 8 Near Real-time Inferencing and Predictive Intelligence

How Do You Scale Your Akka Cluster Pipeline? - Time-Based (GeoSpatial) Scheduled Scaling > Event `always` happen at certain times of the day > We have a rough idea of traffic seasonality, and we can project the future needs > Happens across Timezones, we can always skew our Cluster Workload (Time, Location) - Surge-Based Scaling > Sudden spike in traffic, Due to some external factor or influencer > Delayed Delivery or Batched Delivery

Time-Based Scheduling with Akka Cluster + Kubernetes Config a Cluster Aware Group Router 1 Using Cluster-Aware Group Router akka.actor.deployment { router = round-robin-group routee.paths = [“/telematicsService/ComputeWorkerNode“] cluster { enabled = on allow-local-routees = off CWR use-roles = [“computeWorkRate”] } } CWR CWR CWR Role Out the StatefulSet with the right Akka Actor Role 2 StatefulSets Rollout -> 2.00pm 8.00am BasketBall Rotation Strategy!!! 2.00am StatefulSets Rollout ->

Surge/Spike-Based Scaling with Akka-Cluster & Kubernetes Using Cluster-Aware Pool Routers Startup the Pool Router + Configure it to Startup on Member Nodes in the Cluster 1 akka.actor.deployment { router = round-robin-pool routee.paths = [“/telematicsService/singleton/SignUpNode“] cluster { enabled = on allow-local-routees = off max-nr-instances-per-node = 3 AR use-roles = [“AppRegisteration”] } } AR AR AR Startup a Pod with the right role in AkkaConfig , Configure it for Horizontal Scalability with K8s HorizontalPod Scaling-> 2 metrics: minReplicas: 1 maxReplicas: 10 - type: Resources resource: CPU target: During Spike in Traffic, Pods will be automatically scaled out with the right role config 3 HorizontalPod Scaling->

Cluster Bootstrap with Akka Management & Service Discovery Central “Glue” point for all Akka Management extensions + Management endpoints 1 Kubernetes AWS Marathon Custom Management Endpoints show the status of the Cluster 2 Discovery Discovery Discovery Discovery Akka Service Discovery is like a “LEGO tool box” 3 Akka Discovery Akka Cluster Akka Management AkkaManagement Bootstrap Cluster HTTP

Cluster Bootstrap + Service Discovery with Kubernetes API NAMESPACE=demo_telematics //discovery-config akka.discovery.kubernetes-api { //Akka Management Host HTTP route AkkaManagement Service discovery needs to grab initial 1 pod-label-selector=“clusterName=%s” AkkaManagement(system).start seed nodes `/bootstrap/seed-nodes` pod-namespace=“demo_telematics” api-ca-path=“/app/opt/telematics/serviceaccount/ca.crt” //KickOff ClusterBootStrap In our case, Kubernetes is used for discovery by querying api-ca-token=“/app/opt/telematics/serviceaccount/token” 2 ClusterBootstrap(system).start for all pods with matching `pod-labels` in the config api-service-host-env-name=“KUBERNETES_SERVICE_HOST” api-service-port-env-name=“KUBERNETES_SERVICE_PORT” The Node Probes for existing Cluster, if YES it will Join, } 3 if NO it will create a new cluster //management-config Same Process is Repeated on Other Nodes and if all succeed, akka.management.cluster.bootstrap { 10.0.0.4 4 contact-point-discovery{ then we have a cluster ! service-name=“telematics” 10.0.0.5 10.0.0.6 10.0.0.3 discovery-method=akka.discovery.kubernetes-api 10.0.0.2 } } Looking good so far! Google Cloud Managed Kubernetes But How do I get started?

3-Step Deployment Process 2 1 Docker Registry 3 1 SBT build/package/dockerize your AKKA code 2 SBT Publish to Docker Registry. 3 Helm Deploy to Minikube(DevTest) or GKE (PROD) Google Kubernetes MiniKube

Deployments with Helm Charts We Use HELM for Managing: - Container Packing and Deployment on Kubernetes in Different Environments - Upgrading and Versioning Container Deployments Users go to: app1.rxdemo.com app2rxdemo.com Users e.g Google Cloud Layer 7 Load Balancer Ingress Controller Looks up routing rules to route to the correct services Kubernetes Service Service: App2 Service: App3 Service: App1 Deployments Kubernetes POD Deployments

Quick Demo - Telematics Event Processor on Google Cloud ML PIPELINE REACTIVE PIPELINE MODEL VERSIONS, A|B|C TELEMATIC EVENTS Tire Pressure Location Info ClusterSingletonProxy Fuel Consumption ClusterSingletonProxy Average Speed Prediction SCALABLE ClusterSingletonManager PREDICTIVE PUB-SUB ANALYSIS MESSAGE QUEUE ClusterSingletonProxy WEATHER INFO Google ClusterSingletonManager BIGQUERY Storage gs:/ /

GoogleCloud Kubernetes Setup for Stateful Akka Deployment 1. Create Multi-Zone Cluster gcloud container clusters create telematics-rx18—cluster —zone us-central1-a \ —node-locations us-central1a, us-central1b, us-central1c 2. Create NameSpace for Your Akka Clusters kubectl create namespace ns-telematics 3. Create Service Account kubectl create serviceaccouct sa-telematics -n ns-telematics kubectl get sa-telematics -o json —namespace ns-telematics | jq -r .secrets[].name 4. Grab Service Account Certificate & Token kubectl get secret sa-telematics-token-4478c -o son —namespace ns-telematics | jq -r ‘.data[“ca.crt”] | base64 —decode > ca.crt kubectl get secret sa-telematics-token-4478c -o son —namespace ns-telematics | jq -r ‘.data[“token”] | base64 —decode > token 5. Grant the Right Privilege for the `sa-telematics` Service Account to Query PODs in the namespace kubectl —namespace=kube-system create clusterrolebinding rolebind-telematics - clusterrole=cluster-admin —-serviceaccount=ns-telematics:sa-telematics

Lessons Learned - With the growing number of interconnected devices generating data, infrastructure that can handle elastic data loads is more important than ever - Kubernetes is a stable and continually growing container orchestration framework with an active development support community - Deployment of Akka on Kubernetes is straightforward and helps avoid pitfalls related to scalability latency, and reliance on an external system for orchestration - If you’re not heavily invested in other platforms yet and looking to build a scalable backend + AI & ML integration down the road, it’s worth checking out Google Cloud

Lessons Learned: Building Scalable & Elastic Akka Clusters on - PowerPoint PPT Presentation

Lessons Learned: Building Scalable & Elastic Akka Clusters on Google Managed Kubernetes - Timo Mechler & Charles Adetiloye About MavenCode MavenCode is a Data Analytics software company offering training, product development, and

Lessons Learned Lessons Learned From From Lessons Learned Lessons Learned From From

Internet Server Clusters Internet Server Clusters Jeff Chase Duke University, Department of

Akka Building Distributed Systems for Concurrent, Fault-tolerant and Scalable Java Applications

UP UP AND OUT: SCALING SOFTWARE WITH AKKA Jonas Bonr CTO Typesafe @jboner Scaling software

Making most of Scala Akka, Scala, Spray, Specs2; all in 50 minutes! Jan Machacek Chief whip

I nternational research The evidence on clusters is clear Firms located in clusters are more

Using Kieker with Elastic APM: An Experience Report Valentin Seifermann Duan Okanovi SSP

Monitor your containers with the Elastic Stack Monica Sarbu Monica Sarbu Team lead, Beats team

Cache Coherence in Scalable Machines Scalable Cache Coherent Systems Scalable, distributed

1 [Saito] Key Points Key Points How Do Computers Fail? How Do Computers Fail?

Above the Clouds: Introducing Akka Jonas Bonr CTO Typesafe Twitter: @jboner torsdag 13

Lessons Learned From Sequenced, Integrated Strategies of Economic After Hours Seminar

Some lessons learned from Team Science Some lessons learned from Team Science Lewis Cantley Weill

Opportunities Opportunities Lessons Learned Using Lessons Learned Using Vegetative

OSHA Lessons Learned Adam Fries OSHA Compliance Officer February 13, 2018 OSHA Lessons Learned

Lessons Learned from A Three-Week Lessons Learned from A Three-Week Long User Study w ith

PlanetLab: Evolution vs Intelligent Design in Global Network Infrastructure Larry Peterson

Rumor Spreading and Conductance Flavio Chierichetti Silvio Lattanzi Alessandro Panconesi Sapienza

Centrality, treeness and miscellaneous Social and Technological Networks Rik Sarkar University

Completeness of Resolution Theorem (Completeness) If a set of clauses S has no models then S *

Rethinking kubernetes networking with SRv6 and Contiv-VPP Abdelsalam, Cisco Systems ; ; Daniel

The Synergy of Finite State Machines Yuval Emek Technion Israel Institute of Technology The

Syntactic Theory Tree-Adjoining Grammar (TAG) Yi Zhang Department of Computational Linguistics

Efficient Non-Segregated Routing for Reconfigurable Demand-Aware Networks Thomas Fenz, Klaus-Tycho

Sambuz

Useful Links

Newsletter

Mail Us