An An opportunis istic ic HT HTCo Condor po pool insid in - PowerPoint PPT Presentation

At HTCondor Week 2019 Presented by Igor Sfiligoi, UCSD for the PRP team An An opportunis istic ic HT HTCo Condor po pool insid in ide an an in interac activ ive-fr friendl ndly Ku Kubernetes cluster HTCondor Week, May 2019 1

Ou Outlin tline • Where do I come from? • What we did? • How is it working? • Looking head HTCondor Week, May 2019 2

The Pacific Research Platform • The PRP originally created as a regional networking project • Establishing end-to-end links between 10Gbps and 100Gbps (GDC) HTCondor Week, May 2019 3

The Pacific Research Platform • The PRP originally created as a regional networking project UWashington 40G 192TB • Establishing end-to-end links between 10Gbps and 100Gbps I2 Chicago UIC NCAR-WY • Expanded nationally since 40G FIONA 40G FIONA1 40G 160TB I2 NYC UIUC I2 Kansas City • And beyond, too 40G FIONA 40G FIONA 40G FIONA Netherlands KISTI UvA 10G 35TB 10G 35TB FIONA6 PRP Korea U Hawaii Guam 40G 160TB U of Guam 10G 35TB Singapore CENIC/PW Link U of Queensland 10G 35TB Asian Pacific RP PRPv2 Nautilus Australia Transoceanic Transoceanic Nodes Nodes HTCondor Week, May 2019 4

The Pacific Research Platform • Recently the PRP evolved in a major resource provider, too • Because scientists really need more than bandwidth tests • They need to share their data at high speed and compute on it, too • The PRP now also provides • Extensive compute power – About 330 GPUs and 3.5k CPU cores • A large distributed storage area - About 2 PBytes • Select user communities now directly use all the resources PRP has to offer • Still doing all the network R&D in the same setup, too • We call it the Nautilus cluster HTCondor Week, May 2019 5

Kubernetes as a resource manager Industry standard • Large and active development and support community Container based • More freedom for users Flexible scheduling • Allows for easy mixing of service and user workloads HTCondor Week, May 2019 6

Designed for interactive use Users expect to get what • Makes for very happy users they need when they need it Congestion happens only • And is typically short in duration very rarely HTCondor Week, May 2019 7

Opportunistic use Time for Idle compute No congestion opportunistic resources use HTCondor Week, May 2019 8

Kubernetes priorities Priorities natively • Low priority pods only start if no demand supported in Kubernetes from higher priority ones Preemption out of the • Low priority pods killed the moment box a high priority pod needs the resources Perfect for opportunistic • Just keep enough low-priority pods in the system use https://kubernetes.io/docs/concepts/configuration/pod-priority-preemption/ HTCondor Week, May 2019 9

HTCondor as the OSG helper PRP wanted to give opportunistic resources to Open Science Grid (OSG) users • Since they can tolerate preemption But OSG does not have native support for Kubernetes • Supports only resources provided by batch systems We thus instantiated an HTCondor pool • As a fully Kubernetes/Containerized deployment HTCondor Week, May 2019 10

HTCondor in a (set of) container(s) Putting HTCondor in a set • Just create an image with HTCondor binaries in it! of containers is not hard • Configuration injected through Kubernetes pod config HTCondor deals nicely • The Collector must be discoverable – Kubernetes service with ephemeral IPs • Everything else just works from there Persistency needed for • And potentially for the Negotiator, if long term accounting desired the Schedd(s) • Everything else can live off ephemeral storage HTCondor Week, May 2019 11

Service vs Opportunistic Collector and Schedd(s) deployed as high priority service pods • Should be running at all times • Few pods, not high CPU or GPU users, so OK • Using Kubernetes Deployment to re-start the pods in case of HW problems and/or maintenance • Kubernetes Service used to get a persistent routing IP to the collector pod Startds deployed as low priority pods Pure opportunistic • Hundreds of pods in the Kubernetes queue at all times, many in Pending state • HTCondor Startd configured to accept jobs as soon as it starts and forever after • If pod preempted, HTCondor gets a SIGTERM and has a few seconds to go away HTCondor Week, May 2019 12

Then came the users Everything was working nicely, • Well, until we had more than a single user until we let in real users OSG users got used • So they can use any weird software they like to rely on Containers But HTCondor Startd already • Cannot launch a user-provided container running inside a container! Not without So I need to provide • How many of each kind? elevated privileges user-specific execute pods HTCondor Week, May 2019 13

Then came the users Everything was working nicely, • Well, until we had more than a single user until we let in real users OSG users got used • So they can use any weird software they like to rely on Containers But HTCondor Startd already • Cannot launch a user-provided container running inside a container! So I need to provide • How many of each kind? user-specific execute pods HTCondor Week, May 2019 14

Dealing with many opportunistic pod types Having idle Startd pods • A different kind of pod could use that resource not OK anymore • A glidein-like setup would solve that Keeping pods without users • They will just terminate without ever running a job not OK anymore • Who should regulate the “glidein pressure”? How do I manage fair share • Kubernetes scheduling is basically just priority-FIFO between different pod types? How am I to know what • Ideally, HTCondor should have native Kubernetes support Container images users want? HTCondor Week, May 2019 15

Dealing with many opportunistic pod types Having idle Startd pods • A different kind of pod could use that resource not OK anymore • A glidein-like setup would solve that Keeping pods without users • They will just terminate without ever running a job not OK anymore • Who should regulate the “glidein pressure”? I know how to implement this. How do I manage fair share • Kubernetes scheduling is basically just priority-FIFO between different pod types? How am I to know what • Ideally, HTCondor should have native Kubernetes support Container images users want? HTCondor Week, May 2019 16

Dealing with many opportunistic pod types Having idle Startd pods • A different kind of pod could use that resource not OK anymore • A glidein-like setup would solve that Keeping pods without users • They will just terminate without ever running a job not OK anymore • Who should regulate the “glidein pressure”? I was told How do I manage fair share this is coming. • Kubernetes scheduling is basically just priority-FIFO between different pod types? How am I to know what • Ideally, HTCondor should have native Kubernetes support Container images users want? HTCondor Week, May 2019 17

Dealing with many opportunistic pod types Having idle Startd pods • A different kind of pod could use that resource not OK anymore • A glidein-like setup would solve that Keeping pods without users • They will just terminate without ever running a job not OK anymore • Who should regulate the “glidein pressure”? How do I manage fair share • Kubernetes scheduling is basically just priority-FIFO In OSG-land, glideinWMS between different pod types? solves this for me. How am I to know what • Ideally, HTCondor should have native Kubernetes support Container images users want? HTCondor Week, May 2019 18

Dealing with many opportunistic pod types Having idle Startd pods • A different kind of pod could use that resource No concrete plans on how not OK anymore • A glidein-like setup would solve that to address these yet. Keeping pods without users • They will just terminate without ever running a job not OK anymore • Who should regulate the “glidein pressure”? How do I manage fair share • Kubernetes scheduling is basically just priority-FIFO between different pod types? How am I to know what • Ideally, HTCondor should have native Kubernetes support Container images users want? HTCondor Week, May 2019 19

Dealing with many opportunistic pod types For now, I just periodically adjust the balance • A completely manual process Currently supporting only a few, well behaved users • Maybe not optimal, but good enough But looking forward to a more automated future HTCondor Week, May 2019 20

Are side-containers an option? Ideally, I do want to use user-provided, per-job Containers • Running HTCondor and user jobs in separate pods not an option due to opportunistic nature But Kubernetes pods are made of several Containers Pod • Could I run HTCondor in a dedicated Container? HTCondor container • Then start the user pod in a side-container? Pretty sure currently not supported User job container • But, at least in principle, fits the architecture • Would also need HTCondor native support HTCondor Week, May 2019 21

It has been pointed out to me that latest CentOS supports unprivileged Singularity Will nested Have not tied it out Containers be • Probably I should a reality soon? Cannot currently assume all of my nodes have a recent-enough kernel • But eventually will get there HTCondor Week, May 2019 22

An An opportunis istic ic HT HTCo Condor po pool insid in - PowerPoint PPT Presentation

At HTCondor Week 2019 Presented by Igor Sfiligoi, UCSD for the PRP team An An opportunis istic ic HT HTCo Condor po pool insid in ide an an in interac activ ive-fr friendl ndly Ku Kubernetes cluster HTCondor Week, May 2019 1

Getting popular Figure 1 : Condor downloads by platform Figure 2 : Known # of Condor hosts

Pool::count Pool::grow() Pool::alloc() Pool_element_header Pool_element_header::next

Condor Gold plc www.condorgold.com 1 CONDOR GOLD PLC DISCLAIMER This written presentation (the

CONDOR GOLD Presentation PDAC 3 - 6 March 2019 CONDOR GOLD PLC Disclaimer This

Condor Gold plc www.condorgold.com 4 th to 5 th March 2013 1 CONDOR GOLD PLC DISCLAIMER This

CONDOR GOLD Presentation 121 Mining Investment 20 & 21 May 2019 1 CONDOR GOLD PLC

What is Condor? Specialized job and resource management system (RMS) for compute intensive

My Neck, My Back, My Neck and My Back: Airrosti Musculoskeletal Cost by Pool TAC Pool 2017 Paid

Condor Resources plc Ocean Equities Mining for Growth Conference 7 th -8 th September 2011

Condor Resources plc Master Investor Conference 16 th April 2011 www.condorresourcesplc.com 1

Condor Resources plc Proactive Investors Presentation 10 th February 2011

OSG All-Hands Meeting UNC - Chapel Hill March 4, 2008 Condor on RCAC Clusters Campus Condor

A Decade of Condor at Fermilab Steven Timm timm@fnal.gov Fermilab Grid & Cloud Computing

USA Site Report: DOSAR C.M. Jenkins 9/23/2009 DOSAR Site Report - C M Jenkins 1 Condor Cluster

Interactive NanoAOD analysis Nick Amin Aug 19, 2019 Introduction Condor jobs have a lot of

PVC Swimming Pool Lin iners 1 PVC Swimming Pool Lin iners In ground pre-tailored pool liners A

NAUTILUS IPv6 Mobility Convergence and Deployment http://www.nautilus6.org Thierry Ernst Keio

Case for Transforming Parallel Run-times Into Operating System Kernels Paper Reading Group Kyle

Seven years of data taking and analysis of the data of the Explorer and Nautilus g.w. detectors

Supreme Court Overturns Federal Circuit on Liability for Inducing Infringement and Standard for

Virtual Realities and Digital Virtual Realities and Digital Health Adaptive Technology Health

Overview of Research in the HExSA Lab @ IIT Laboratory for High-performance Experimental Systems

Pharo 70 The key challenges S. Ducasse for the RMoD Team Points Cleaning and stabilising

Using Dtrace for Gnome Performance Analysis Krishnan Parthasarathi Desktop Sustaining Engineer

An An opportunis istic ic HT HTCo Condor po pool insid in - PowerPoint PPT Presentation

At HTCondor Week 2019 Presented by Igor Sfiligoi, UCSD for the PRP team An An opportunis istic ic HT HTCo Condor po pool insid in ide an an in interac activ ive-fr friendl ndly Ku Kubernetes cluster HTCondor Week, May 2019 1

Getting popular Figure 1 : Condor downloads by platform Figure 2 : Known # of Condor hosts

Pool::count Pool::grow() Pool::alloc() Pool_element_header Pool_element_header::next

Condor Gold plc www.condorgold.com 1 CONDOR GOLD PLC DISCLAIMER This written presentation (the

CONDOR GOLD Presentation PDAC 3 - 6 March 2019 CONDOR GOLD PLC Disclaimer This

Condor Gold plc www.condorgold.com 4 th to 5 th March 2013 1 CONDOR GOLD PLC DISCLAIMER This

CONDOR GOLD Presentation 121 Mining Investment 20 &amp; 21 May 2019 1 CONDOR GOLD PLC

What is Condor? Specialized job and resource management system (RMS) for compute intensive

My Neck, My Back, My Neck and My Back: Airrosti Musculoskeletal Cost by Pool TAC Pool 2017 Paid

Condor Resources plc Ocean Equities Mining for Growth Conference 7 th -8 th September 2011

Condor Resources plc Master Investor Conference 16 th April 2011 www.condorresourcesplc.com 1

Condor Resources plc Proactive Investors Presentation 10 th February 2011

OSG All-Hands Meeting UNC - Chapel Hill March 4, 2008 Condor on RCAC Clusters Campus Condor

A Decade of Condor at Fermilab Steven Timm timm@fnal.gov Fermilab Grid &amp; Cloud Computing

USA Site Report: DOSAR C.M. Jenkins 9/23/2009 DOSAR Site Report - C M Jenkins 1 Condor Cluster

Interactive NanoAOD analysis Nick Amin Aug 19, 2019 Introduction Condor jobs have a lot of

PVC Swimming Pool Lin iners 1 PVC Swimming Pool Lin iners In ground pre-tailored pool liners A

NAUTILUS IPv6 Mobility Convergence and Deployment http://www.nautilus6.org Thierry Ernst Keio

Case for Transforming Parallel Run-times Into Operating System Kernels Paper Reading Group Kyle

Seven years of data taking and analysis of the data of the Explorer and Nautilus g.w. detectors

Supreme Court Overturns Federal Circuit on Liability for Inducing Infringement and Standard for

Virtual Realities and Digital Virtual Realities and Digital Health Adaptive Technology Health

Overview of Research in the HExSA Lab @ IIT Laboratory for High-performance Experimental Systems

Pharo 70 The key challenges S. Ducasse for the RMoD Team Points Cleaning and stabilising

Using Dtrace for Gnome Performance Analysis Krishnan Parthasarathi Desktop Sustaining Engineer

CONDOR GOLD Presentation 121 Mining Investment 20 & 21 May 2019 1 CONDOR GOLD PLC

A Decade of Condor at Fermilab Steven Timm timm@fnal.gov Fermilab Grid & Cloud Computing