Brookhaven Laboratory Cloud Activities Update John Hover, Jose - PowerPoint PPT Presentation

Brookhaven Laboratory Cloud Activities Update John Hover, Jose Caballero John Hover, Jose Caballero US ATLAS T2/3 Workshop US ATLAS T2/3 Workshop Indianapolis, Indiana Indianapolis, Indiana John Hover 11 March 2013 1

Outline Addendum to November Santa Cruz Status Report Addendum to November Santa Cruz Status Report http://indico.cern.ch/conferenceProgram.py?confId=201788 Current BNL Status Current BNL Status – Condor Scaling on EC2 – EC2 Spot Pricing and Condor – VM Lifecycle with APF – Cascading multi-target clusters Next Steps and Plans Next Steps and Plans Discussion Discussion John Hover 11 March 2013 2

Condor Scaling 1 RACF recieved a $50K grant from Amazon: Great opportunity to test: RACF recieved a $50K grant from Amazon: Great opportunity to test: – Condor scaling to thousands of nodes over WAN – Empirically determine costs Naive Approach: Naive Approach: – Single Condor host (schedd, collector, etc.) – Single process for each daemon – Password authentication – Condor Connection Broker (CCB) Result: Maxed out at ~3000 nodes Maxed out at ~3000 nodes Result: – Collector load causing timeouts of schedd daemon. – CCB overload? – Network connections exceeding open file limits – Collector duty cycle -> .99. John Hover 11 March 2013 3

Condor Scaling 2 Refined approach: Refined approach: – Split schedd from collector,negotiator,CCB – Run 20 collector processes. Configure startds to randomly choose one. Enable collector reporting, so that all sub-collectors report to one top- level collector (which is not public). – Tune OS limits: 1M open files, – Enable shared port daemon on all nodes: multiplexes TCP connections. Results in dozens of connections rather than thousands. – Enable session auth, so that each connection after the first bypasses password auth check. Result: Result: – Smooth operation up to 5000 startds, even with large bursts. – No disruption of schedd operation on other host. – Collector duty cycle ~.35. Substantial headroom left. Switching to 7-slot startds would get us to 35000 slots, with marginal additional load. John Hover 11 March 2013 4

Condor Scaling 3 Overall results: Overall results: – Ran ~5000 nodes for several weeks. – Production simulation jobs. Stageout to BNL. – Spent approximately $13K. Only $750 was for data transfer. – Moderate failure rate due to spot terminations. – Actual spot price paid very close to baseline, e.g. still less than . $.01/hr for m1.small. – No solid statistics on efficiency/cost yet, beyond a rough appearance of “competitive.” John Hover 11 March 2013 5

EC2 Spot Pricing and Condor On-demand vs. Spot On-demand vs. Spot – On-Demand: You pay standard price. Never terminates. – Spot: You declare maximum price. You pay current,variable spot price. If/when spot price exceeds your maximum, instance is terminated without warning. Note: NOT like priceline.com, where you pay what you bid. Problems: Problems: – Memory provided in units of 1.7GB, less than ATLAS standard. – More memory than needed per “virtual core” – NOTE: On our private Openstack, we created a 1-core, 2GB RAM instance type--avoiding this problem. Condor now supports submission of spot-price instance jobs. Condor now supports submission of spot-price instance jobs. – Handles it by making one-time spot request, then cancelling it when fulfilled. John Hover 11 March 2013 6

EC2 Types Type Memory VCores “CUs” CU/Core $Spot/hr $On- Slots? Typical Demand/hr m1.small 1.7G 1 1 1 .007 .06 - m1.medium 3.75G 1 2 2 .013 .12 1 m1.large 7.5G 2 4 2 .026 .24 3 m1.xlarge 15G 4 8 2 .052 .48 7 Issues/Observations: Issues/Observations: – We currently bid 3 * <baseline>. Is this optimal? – Spot is ~1/10th the cost of on-demand. Nodes are ~1/2 as powerful as our dedicated hardware. Based on estimates of Tier 1 costs, this is competitive. – Amazon provides 1.7G memory per CU, not “CPU”. Insufficient for ATLAS work (tested). – Do 7 slots on m1.xlarge perform economically? John Hover 11 March 2013 7

EC2 Spot Considerations Service and Pricing Service and Pricing – Nodes terminated without warning. (No signal.) – Partial hours are not charged . Therefore, ATLAS needs to consider: Therefore, ATLAS needs to consider: – Shorter jobs. Simplest approach. Originally ATLAS worked to ensure jobs were at least a couple hours, to avoid pilot flow congestion. Now we have the opposite need. – Checkpointing. Some work in Condor world providing the ability to checkpoint without linking to special libraries. (But not promising.) – Per-event stageout (event server). If ATLAS provides sub-hour units of work, we could get significant free time! If ATLAS provides sub-hour units of work, we could get significant free time! John Hover 11 March 2013 8

VM Lifecycle with APF Condor scaling test used manually started EC2/Openstack VMs. Now we want Condor scaling test used manually started EC2/Openstack VMs. Now we want APF to manage this: APF to manage this: 2 AutoPyFactory (APF) Queues 2 AutoPyFactory (APF) Queues – First (standard) observes a Panda queue, submits pilots to local Condor pool. – Second observes a local Condor pool, when jobs are Idle, submits WN VMs to IaaS (up to some limit). Worker Node VMs Worker Node VMs – Condor startds join back to local Condor cluster. VMs are identical, don't need public IPs, and don't need to know about each other. Panda site (BNL_CLOUD) Panda site (BNL_CLOUD) – Associated with BNL SE, LFC, CVMFS-based releases. – But no site-internal configuration (NFS, file transfer, etc). John Hover 13 Nov 2012 9

John Hover 13 Nov 2012 10

VM Lifecycle 2 Current status: Current status: – Automatic ramp-up working properly. – Submits properly to EC2 and Openstack via separate APF queues. – Passive draining when Panda queue work completes. – Out-of-band shutdown and termination via command line tool: Required configuration to allow APF user to retire nodes. Next step: Next step: – Active ramp-down via retirement from within APF. – Adds in tricky issue of “un-retirement” during alternation between ramp-up and ramp-down. – APF issues condor_off -peaceful -daemon startd -name <host> – APF uses condor_q and condor_status to associate startds with VM jobs. Adds in startd status to VM job info and aggregate statistics. Next step 2: Next step 2: – Automatic termination of retired startd VMs. Accomplished by comparing condor_status and condor_status -master output. John Hover 13 Nov 2012 11

Ultimate Capabilities APF's intrinsic queue/plugin architecture, and code in development, will APF's intrinsic queue/plugin architecture, and code in development, will allow: allow: – Multiple targets • E.g., EC2 us-east-1, us-west-1, us-west-2 all submitted to in a load-balanced fashion. – Cascading targets, e.g.: • We can preferentially utilize free site clouds (e.g. local Openstack or other academic clouds) • Once that is full we submit to EC2 spot-priced nodes. • During particularly high demand, submit EC on-demand nodes. • Retire and terminate in reverse order. The various pieces exist and have been tested, but final integration in APF The various pieces exist and have been tested, but final integration in APF is in progress. is in progress. John Hover 13 Nov 2012 12

Next Steps/Plans APF Development APF Development – Complete APF VM Lifecycle Management feature. – Simplify/refactor Condor-related plugins to reduce repeated code. Fault tolerance. – Run multi-target workflows. Is more between-queue coordination is necessary in practice. Controlled Performance/Efficiency/Cost Measurements Controlled Performance/Efficiency/Cost Measurements – Test m1.xlarge (4 “cores”, 15GB RAM) with 4, 5, 6, and 7 slots. – Measure “goodput” under various spot pricing schemes. Is 3*<baseline> sensible? – Google Compute Engine? Other concerns Other concerns – Return to refinement of VM images. Contextualization. John Hover 13 Nov 2012 13

Questions/Discussion John Hover 13 Nov 2012 14

Brookhaven Laboratory Cloud Activities Update John Hover, Jose - PowerPoint PPT Presentation

Brookhaven Laboratory Cloud Activities Update John Hover, Jose Caballero John Hover, Jose Caballero US ATLAS T2/3 Workshop US ATLAS T2/3 Workshop Indianapolis, Indiana Indianapolis, Indiana John Hover 11 March 2013 1 Outline Addendum to

Recent Developments at the Brookhaven Source Development Laboratory Brian Sheehy National

Barbara Chapman Stony Brook University Brookhaven National Laboratory How To Get Tied Up In

Building a Private Cloud Cloud Infrastructure Using Opensource Building a Private Cloud OSCON

KAFKA STREAMS CLOUD MONITORING AWS CLOUD MONITORING AWS APP CLOUD MONITORING AWS HTTP APP

Update Natural and Cultural Resources Brookhaven National Laboratory Community Advisory Council

Natural and Cultural Resources Update Brookhaven National Laboratory Community Advisory Council

SNR SNR- -cloud interaction cloud interaction cloud interaction SNR SNR cloud interaction

Cloud Cloud Cloud Cloud network Edge Edge Edge Edge as a Edge Edge Edge Edge Edge

Cloud Ross Mallace Commercial Director Cloud/SaaS Cloud is here. ALL By 2020 most core

Embracing Cloud Ian Apperley Agenda A little about me What is Cloud and where did it come

Are We Really Cloud-Native? Bert Ertman Cloud-Native Computing What is Cloud-Native? answer:

CS5412: THE CLOUD VALUE PROPOSITION Lecture XXII Ken Birman Cloud Hype 2 The cloud is

SAS and (the) Cloud Dave Annis SAS Solutions onDemand SAS and (the) Cloud Everyones Cloud

Cloud Computing & Cloud Models Cloud Models Topics Defining cloud computing

CS5412: THE CLOUD VALUE PROPOSITION Lecture XXII Ken Birman Cloud Hype 2 The cloud is

Electron Cloud Build Electron Cloud Build- Electron Cloud Build Electron Cloud Build -Up

These are slides with a history. I found them on the web... They are apparently based on Dan

Strategic bidding in electricity markets Pr Holmberg, Research Institute of Industrial Economics

2.6 Gradients and Directional Derivatives Prof. Tesler Math 20C Fall 2018 Prof. Tesler 2.6

Computer Graphics (CS 543) Lecture 6 (Part 1): Lighting, Shading and Materials (Part 1) Prof

International Web Sites Richard Ishida 1 Version: 9 October 2006 International Web Sites This

Vanderbilt University Poll: 2020 Nashville Edition April 9 th May 10 th , 2020 John G. Geer

Guardianship and Speakers Reasonable Pat Cottingham and Marla Fischer-Lempke Alternatives

for Children in Kinship Foster Care: Making Informed Decisions FEBRUARY 27, 2020 WEBINAR 2

Brookhaven Laboratory Cloud Activities Update John Hover, Jose - PowerPoint PPT Presentation

Brookhaven Laboratory Cloud Activities Update John Hover, Jose Caballero John Hover, Jose Caballero US ATLAS T2/3 Workshop US ATLAS T2/3 Workshop Indianapolis, Indiana Indianapolis, Indiana John Hover 11 March 2013 1 Outline Addendum to

Recent Developments at the Brookhaven Source Development Laboratory Brian Sheehy National

Barbara Chapman Stony Brook University Brookhaven National Laboratory How To Get Tied Up In

Building a Private Cloud Cloud Infrastructure Using Opensource Building a Private Cloud OSCON

KAFKA STREAMS CLOUD MONITORING AWS CLOUD MONITORING AWS APP CLOUD MONITORING AWS HTTP APP

Update Natural and Cultural Resources Brookhaven National Laboratory Community Advisory Council

Natural and Cultural Resources Update Brookhaven National Laboratory Community Advisory Council

SNR SNR- -cloud interaction cloud interaction cloud interaction SNR SNR cloud interaction

Cloud Cloud Cloud Cloud network Edge Edge Edge Edge as a Edge Edge Edge Edge Edge

Cloud Ross Mallace Commercial Director Cloud/SaaS Cloud is here. ALL By 2020 most core

Embracing Cloud Ian Apperley Agenda A little about me What is Cloud and where did it come

Are We Really Cloud-Native? Bert Ertman Cloud-Native Computing What is Cloud-Native? answer:

CS5412: THE CLOUD VALUE PROPOSITION Lecture XXII Ken Birman Cloud Hype 2 The cloud is

SAS and (the) Cloud Dave Annis SAS Solutions onDemand SAS and (the) Cloud Everyones Cloud

Cloud Computing &amp; Cloud Models Cloud Models Topics Defining cloud computing

CS5412: THE CLOUD VALUE PROPOSITION Lecture XXII Ken Birman Cloud Hype 2 The cloud is

Electron Cloud Build Electron Cloud Build- Electron Cloud Build Electron Cloud Build -Up

These are slides with a history. I found them on the web... They are apparently based on Dan

Strategic bidding in electricity markets Pr Holmberg, Research Institute of Industrial Economics

2.6 Gradients and Directional Derivatives Prof. Tesler Math 20C Fall 2018 Prof. Tesler 2.6

Computer Graphics (CS 543) Lecture 6 (Part 1): Lighting, Shading and Materials (Part 1) Prof

International Web Sites Richard Ishida 1 Version: 9 October 2006 International Web Sites This

Vanderbilt University Poll: 2020 Nashville Edition April 9 th May 10 th , 2020 John G. Geer

Guardianship and Speakers Reasonable Pat Cottingham and Marla Fischer-Lempke Alternatives

for Children in Kinship Foster Care: Making Informed Decisions FEBRUARY 27, 2020 WEBINAR 2

Cloud Computing & Cloud Models Cloud Models Topics Defining cloud computing