Managing Containers with Helix Kanak Biscuitwala Jason Zhang - PowerPoint PPT Presentation

Helix Controller Rebalancer ResourceAssignment computeResourceMapping( RebalancerConfig rebalancerConfig, ResourceAssignment prevAssignment, Cluster cluster, ResourceCurrentState currentState); Based on the current nodes in the cluster and constraints, find an assignment of task to node

Helix Controller Rebalancer ResourceAssignment computeResourceMapping( RebalancerConfig rebalancerConfig, ResourceAssignment prevAssignment, Cluster cluster, ResourceCurrentState currentState); Based on the current nodes in the cluster and constraints, find an assignment of task to node What else do we need?

Helix Controller What is Missing? Dynamic Container Automated Service Allocation Deployment Resource Utilization Container Isolation Monitoring

Helix Controller Target Provider Fixed CPU Memory Bin Packing Based on some constraints, determine how many containers are required in this system We’re working on integrating with monitoring systems in order to query for usage information

Helix Controller Target Provider Fixed TargetProviderResponse evaluateExistingContainers( Cluster cluster, ResourceId resourceId, CPU Collection<Participant> participants); class TargetProviderResponse { Memory List<ContainerSpec> containersToAcquire; List<Participant> containersToRelease; List<Participant> containersToStop; List<Participant> containersToStart; Bin Packing } Based on some constraints, determine how many containers are required in this system We’re working on integrating with monitoring systems in order to query for usage information

Helix Controller Adding a Target Provider Target Provider Rebalancer Constraints Nodes Task Assignment

Helix Controller Adding a Target Provider Target Provider Rebalancer Constraints Nodes Task Assignment How do we use the target provider response?

Helix Controller Container Provider YARN Mesos Local Given the container requirements, ensure that number of containers are running

Helix Controller Container Provider ListenableFuture<ContainerId> YARN allocateContainer(ContainerSpec spec); � ListenableFuture<Boolean> deallocateContainer(ContainerId containerId); � Mesos ListenableFuture<Boolean> startContainer(ContainerId containerId, Participant participant); � ListenableFuture<Boolean> Local stopContainer(ContainerId containerId); Given the container requirements, ensure that number of containers are running

Helix Controller Adding a Container Provider Target Provider Rebalancer Constraints Nodes Container Provider Task Assignment Target Provider + Container Provider = Provisioner

Application Lifecycle With Helix and the Task Abstraction Capacity Target Provider Planning Container Provider Provisioning Fault Existing Helix Controller (enhanced by Provisioner) Tolerance State Existing Helix Controller (enhanced by Provisioner) Management

System Architecture

System Architecture Resource Provider

System Architecture submit job Client Resource Provider

System Architecture submit job Client Resource Provider App Launcher Provisioner Rebalancer Controller Container

System Architecture submit job Client Resource Provider container request App Launcher Provisioner Rebalancer Controller Container

System Architecture submit job Client Resource Provider container request Participant Launcher App Launcher Helix Participant Provisioner App Rebalancer Participant Container Controller Container

System Architecture submit job Client Resource Provider container request Participant Launcher App Launcher Helix Participant Provisioner assign tasks App Rebalancer Participant Container Controller Container

Helix + YARN YARN Architecture Resource submit job Client Manager node status node status container request Node Manager Node Manager assign work status Application Master Container grab package App Package HDFS/Common Area

Helix + YARN Helix + YARN Architecture Resource submit job Client Manager node status node status container request Node Manager Node Manager Helix Controller Helix Participant assign tasks Rebalancer App status Application Master Container grab package App Package HDFS/Common Area

Helix + Mesos Mesos Architecture Mesos offer resources Master Scheduler offer response node status node status Scheduler Slave Mesos Slave Mesos Slave Mesos Executor Slave Machine Slave Machine grab executor Executor Package HDFS/Common Area

Helix + Mesos Helix + Mesos Architecture Mesos offer resources Master Scheduler offer response Helix Controller node status node status Scheduler Slave assign tasks Mesos Slave Mesos Slave Mesos Executor Helix Participant/App Slave Machine Slave Machine grab executor Helix Executor Package HDFS/Common Area

Example

Distributed Document Store Overview Master Slave Partition 0 Partition 0 Partition 0 Partition 1 Partition 1 Partition 1 Oracle Oracle Oracle Partition 2 Partition 2 Partition 2 P2 Backup P1 Backup P0 Backup ETL ETL ETL HDFS

Distributed Document Store Overview Master Slave Partition 0 Partition 0 Partition 0 Partition 1 Partition 1 Partition 1 Oracle Oracle Partition 2 Partition 2 Partition 2 P2 Backup P1 Backup P0 Backup ETL ETL HDFS

Distributed Document Store YARN Example Resource submit job Client Manager node status node status container request Node Manager Node Manager Helix Participant Helix Controller assign work Rebalancer P1 Backup ETL Partition 0 Oracle Partition 1 status Application Master Container

Distributed Document Store YAML Specification appConfig: { config: { k1: v1 } } appPackageUri: 'file://path/to/myApp-pkg.tar' appName: myApp services: [DB, ETL] # the task containers serviceConfigMap: {DB: { num_containers: 3, memory: 1024 }, ... ETL: { time_to_complete: 5h, ... }, ...} servicePackageURIMap: { DB: ‘file://path/to/db-service-pkg.tar', ... } ...

Distributed Document Store YAML Specification appConfig: { config: { k1: v1 } } appPackageUri: 'file://path/to/myApp-pkg.tar' appName: myApp TargetProvider services: [DB, ETL] # the task containers specification serviceConfigMap: {DB: { num_containers: 3, memory: 1024 }, ... ETL: { time_to_complete: 5h, ... }, ...} servicePackageURIMap: { DB: ‘file://path/to/db-service-pkg.tar', ... } ...

Distributed Document Store Service/Container Implementation public class MyQueuerService extends StatelessParticipantService { @Override public void init() { ... } � @Override public void onOnline() { ... } � @Override public void onOffline() { ... } }

Distributed Document Store Task Implementation public class BackupTask extends Task { @Override public ListenableFuture<Status> start() { ... } � @Override public ListenableFuture<Status> cancel() { ... } � @Override public ListenableFuture<Status> pause() { ... } � @Override public ListenableFuture<Status> resume() { ... } }

Distributed Document Store State Model-Style Callbacks public class StoreStateModel extends StateModel { public void onBecomeMasterFromSlave() { ... } � public void onBecomeSlaveFromMaster() { ... } � public void onBecomeSlaveFromOffline() { ... } � public void onBecomeOfflineFromSlave() { ... } }

Distributed Document Store Spectator (for Discovery) class ¡RoutingLogic ¡{ ¡ ¡ ¡ ¡public ¡void ¡write(Request ¡request) ¡{ ¡ ¡ ¡ ¡ ¡ ¡partition ¡= ¡getPartition(request.key); ¡ ¡ ¡ ¡ ¡ ¡List<Participant> ¡nodes ¡= ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡routingTableProvider.getInstance( ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡partition, ¡“MASTER”); ¡ ¡ ¡ ¡ ¡ ¡nodes.get(0).write(request); ¡ ¡ ¡ ¡} ¡ � ¡ ¡ ¡public ¡void ¡read(Request ¡request) ¡{ ¡ ¡ ¡ ¡ ¡ ¡partition ¡= ¡getPartition(request.key); ¡ ¡ ¡ ¡ ¡ ¡List<Participant> ¡nodes ¡= ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡routingTableProvider.getInstance(partition); ¡ ¡ ¡ ¡ ¡ ¡random(nodes).read(request); ¡ ¡ ¡ ¡} ¡ }

Helix at LinkedIn

Helix at LinkedIn In Production User Writes Oracle Oracle Backup/Restore DB Oracle ETL Data Replicator Change Capture HDFS Change Consumers Analytics Index Search Index

Managing Containers with Helix Kanak Biscuitwala Jason Zhang - PowerPoint PPT Presentation

Managing Containers with Helix Kanak Biscuitwala Jason Zhang Apache Helix Committers @ LinkedIn helix.apache.org @apachehelix Intersection of Job Types Oracle Oracle DB DB Intersection of Job Types Oracle Oracle DB DB Backup Backup

The Triple Helix Model Role of different entities 1 The Triple Helix Model Role of

Improving Trust in Containers Matthew Garrett @mjg59 | mjg59@coreos.com | coreos.com

Unprivileged Containers Jess Frazelle, @jessfraz How do containers help security? Containers are

Herd of Containers Sad DIF Database Engineer Herd of Containers: PostgreSQL in containers at

Matthias Sohn Adel Zaalouk SAP From Containers to Kubernetes From Containers to Kubernetes

Everything you need to know about Containers Security Track Containers Jos Manuel Ortega

Helix Track Finding and Track Fitting Algorithm A FPGA tracking algorithm for helix tracking using

SUSE Containers as a Service Platform 53 53 Why Do You Want to Invest in Containers? 54 54

Containers in the Enterprise Avoiding the Kobayashi Maru Agenda Containers Bring Change

Exploding the Linux Container Host Presenter: Ben Corrie (@bensdoings) Containers vs VMs

Persistent storage for Containers Anil Degwekar What are we talking about? Containers have

Triple Helix Association and entrepreneurship university 10 Sep. 2015, Coneect workshop, Berlin

Science Parks, Accelerators and the Triple Helix: How to Make Technology Districts Thrive Triple

www.sustainSCM.com 1 Quick Overview Who is Helix? What is ePlanning? Optimizing the

Helix Nebula, the Science Cloud CloudCom SCC-Computing Summer Symposium on EU-China- North

Tutorials 1) Kinked Helix at Low Resolution Download DireX and Tutorial files from: Simple toy

Doctoral School relaunch 28 May 2019 The Doctoral School Formed in 2014, with over 1000 doctoral

Wednesday, May 21, 2014 John Richards and Keisha Watson-Bah McCourt School of Public Policy

Relaunching and Refining We'll Look At The good and the bad side of being post initial

Optimizing the Board Chair CEO Relationship May 14, 2018 ArtsFunds convenings series is

Talk given by David McCoy at the Medact Relaunch Conference on November 9. NB. Some small changes

Reverse Traceroute Relaunch David Choffnes, Northeastern (joint work with USC) What is

Building and Delivering Next Generation Developer Services Using Private Cloud Infrastructure

FermiCloud Introduction As part of the FY2010 activities, the (then) Grid Facilities Department