Dynamic Virtual Clusters in a Grid Dynamic Virtual Clusters in a - PowerPoint PPT Presentation

Dynamic Virtual Clusters in a Grid Dynamic Virtual Clusters in a Grid Site Manager Site Manager Jeff Chase, David Irwin, Laura Grit, Justin Moore, Sara Sprenkle Department of Computer Science Duke University

Dynamic Virtual Clusters Dynamic Virtual Clusters Grid Services Grid Services Grid Services

Motivation Motivation Next Generation Grid • Flexibility Dynamic instantiation of software environments and services • Predictability Resource reservations for predictable application service quality • Performance Dynamic adaptation to changing load and system conditions • Manageability Data center automation

Cluster- -On On- -Demand (COD) Demand (COD) Cluster Virtual Differences: Cluster #1 • OS (Windows, Linux) COD • Attached File Systems • Applications DHCP • User accounts Virtual NIS Cluster #2 NFS DNS Goals for this talk • Explore virtual cluster provisioning • Middleware integration (feasibility, impact) COD database (templates, status)

Cluster- -On On- -Demand and the Grid Demand and the Grid Cluster Safe to donate resources to the grid • Resource peering between companies or universities • Isolation between local users and grid users • Balance local vs. global use Controlled provisioning for grid services • Service workloads tend to vary with time • Policies reflect priority or peering arrangements • Resource reservations Multiplex many Grid PoPs • Avaki and Globus on the same physical cluster • Multiple peering arrangements

Outline Outline Overview • Motivation • Cluster-On-Demand System Architecture • Virtual Cluster Managers • Example Grid Service: SGE • Provisioning Policies Experimental Results Conclusion and Future Work

System Architecture System Architecture A GridEngine Commands Middleware Provisioning Layer Policy VCM GridEngine COD B VCM GridEngine Manager VCM GridEngine XML-RPC Interface Node reallocation Sun GridEngine Batch Pools within C Three Isolated Vclusters

Virtual Cluster Manager (VCM) Virtual Cluster Manager (VCM) Communicates with COD Manager • Supports graceful resizing of vclusters Simple extensions for well-structured grid services • Support already present Software handles membership changes Node failures and incremental growth • Application services can handle this gracefully COD Vcluster VCM Service Manager add_nodes remove_nodes resize

Sun GridEngine GridEngine Sun Ran GridEngine middleware within vclusters Wrote wrappers around GridEngine scheduler Did not alter GridEngine Most grid middleware can support modules COD Vcluster VCM Service Manager add_nodes qconf remove_nodes qstat resize

Pluggable Policies Pluggable Policies Local Policy • Request a node for every x jobs in the queue • Relinquish a node after being idle for y minutes Global Policies • Simple Policy Each vcluster has a priority Higher priority vclusters can take nodes from lower priority vclusters • Minimum Reservation Policy Each vcluster guaranteed percentage of nodes upon request Prevents starvation

Outline Outline Overview • Motivation • Cluster-On-Demand System Architecture • Virtual Cluster Managers • Example Grid Service: SGE • Provisioning Policies Experimental Results Conclusion and Future Work

Experimental Setup Experimental Setup Live Testbed • Devil Cluster (IBM, NSF) 71 node COD prototype • Trace driven---sped up traces to execute in 12 hours • Ran synthetic applications Emulated Testbed • Emulates the output of SGE commands • Invisible to the VCM that is using SGE • Trace driven • Facilitates fast, large scale tests Real batch traces • Architecture, BioGeometry, and Systems groups

Live Test Live Test 80 70 Systems Architecture 60 BioGeometry Number of Nodes 50 40 30 20 10 0 Day1 Day2 Day3 Day4 Day5 Day6 Day7 Day8 Time 2500 Systems Architecture 2000 BioGeometry Number of Jobs 1500 1000 500 0 Day1 Day2 Day3 Day4 Day5 Day6 Day7 Day8 Time

Architecture Vcluster Vcluster Architecture

Emulation Architecture Emulation Architecture Trace Architecture Trace Systems Load Generation Trace BioGeometry Provisioning Policy VCM Emulated GridEngine Emulator FrontEnd COD VCM Manager qstat VCM XML-RPC Interface Each Epoch 1. Call resize module 2. Pushes emulation forward one epoch COD Manager and VCM are 3. qstat returns new state of cluster unmodified from real system 4. add_node and remove_node alter emulator

Minimum Reservation Policy Minimum Reservation Policy

Emulation Results Emulation Results Minimum Reservation Policy • Example policy change • Removed starvation problem Scalability • Ran same experiment with 1000 nodes in 42 minutes making all node transitions that would have occurred in 33 days • There were 3.7 node transitions per second resulting in approximately 37 database accesses per second. • Database scalable to large clusters

Related Work Related Work Cluster Management • NOW, Beowulf, Millennium, Rocks • Homogenous software environment for specific applications Automated Server Management • IBM’s Oceano and Emulab • Target specific applications (Web services, Network Emulation) Grid • COD can support GARA for reservations • SNAP combines SLAs of resource components COD controls resources directly

Future Work Future Work Experiment with other middleware Economic-based policy for batch jobs Distributed market economy using vclusters • Maximize profit based on utility of applications • Trade resources between Web Services, Grid Services, batch schedulers, etc.

Conclusion Conclusion No change to GridEngine middleware Important for Grid services • Isolates grid resources from local resources • Enables policy-based resource provisioning Policies are pluggable Prototype system • Sun GridEngine as middleware Emulated system • Enables fast, large-scale tests • Test policy and scalability

Example Epoch Example Epoch Architecture Nodes 4,6. Format 2a. qstat 1abc.resize and Forward requests 3a.nothing VCM GridEngine Systems COD 3b.request 2b. qstat VCM GridEngine Nodes 8b. qconf add_host Manager 7b.add_node VCM GridEngine 3c.remove 5. Make Allocations 7c.remove_node 2c. qstat Update Database Node Configure nodes reallocation 8c. qconf remove_host BioGeometry Sun GridEngine Batch Pools within Nodes Three Isolated Vclusters

Dynamic Virtual Clusters in a Grid Dynamic Virtual Clusters in a - PowerPoint PPT Presentation

Dynamic Virtual Clusters in a Grid Dynamic Virtual Clusters in a Grid Site Manager Site Manager Jeff Chase, David Irwin, Laura Grit, Justin Moore, Sara Sprenkle Department of Computer Science Duke University Dynamic Virtual Clusters Dynamic

Sun and Grid John Barr Grid Business Development 07808 328351 john.barr@sun.com Sun and Grid

ON-GRID VS OFF-GRID SOLAR On-Grid Solar is solar generation that is connected to the utility grid

I nternational research The evidence on clusters is clear Firms located in clusters are more

Internet Server Clusters Internet Server Clusters Jeff Chase Duke University, Department of

Supercomputers and Supercomputers and Clusters and Clusters and Grid, Grid, Oh My! Oh My!

Migrating from Grid to Cloud: Migrating from Grid to Cloud: Migrating from Grid to Cloud:

Outline n Introduction Proxy Dynamic Delegation in Grid Gateway n Is there the need for a

SEE-GRID Deploying a Grid-enabled eInfrastructure in SE Europe www.see-grid.org Jorge Sanchez,

Modernizing T&D on the Electric Grid 11/29/2011 Mark Nealon System Meter & Smart Grid

Grid Grid to Grid Grid-to to Ports Clock Routing for to-Ports Clock Routing for Ports Clock

Grid/Clo d Comp ting Grid/Clo d Comp ting Grid/Cloud Computing Grid/Cloud Computing over

SEE-GRID-SCI SEE-GRID Infrastructure for Regional eScience www.see-grid-sci.eu International

Experiences of the Virtual Experiences of the Virtual Community Grid Workgroup

An Overview of High Performance An Overview of High Performance Computing, Clusters, and the Grid

Grid! Alison Fulford Housekeeping National Grid 2 Introductions National Grid 3 Workplace

One Page Everywhere Fluid, Responsive Design with Semantic.gs The Semantic Grid System Grid

Fishbanks Jason Jay Lecturer in Sustainability Director, Sustainability Initiative at MIT Sloan

A Surface-Syntactic UD Treebank for Naija B.Caron, M.Courtin, K.Gerdes, S.Kahane SyntaxFest 2019

Hybrid Numeric Constraints & Applications Michel RUEHER University of Nice Sophia-Antipolis

Working with text xt file formats CSV, JSON, XML, Excel regular expressions module

A Presentation on: How to Start a Local Commission on Disability In Your Community MGL

De Deal aling ing wit ith h mi missing ssing dat ata a in in pr pract actice: ice:

Lab 2-3: LC-3b Simulator to toupper2.cod Tinghuan Chen Department of Computer Science and

Towards a constructive simplicial model of univalent foundations Nicola Gambino 1 Simon Henry 2 1

Dynamic Virtual Clusters in a Grid Dynamic Virtual Clusters in a - PowerPoint PPT Presentation

Dynamic Virtual Clusters in a Grid Dynamic Virtual Clusters in a Grid Site Manager Site Manager Jeff Chase, David Irwin, Laura Grit, Justin Moore, Sara Sprenkle Department of Computer Science Duke University Dynamic Virtual Clusters Dynamic

Sun and Grid John Barr Grid Business Development 07808 328351 john.barr@sun.com Sun and Grid

ON-GRID VS OFF-GRID SOLAR On-Grid Solar is solar generation that is connected to the utility grid

I nternational research The evidence on clusters is clear Firms located in clusters are more

Internet Server Clusters Internet Server Clusters Jeff Chase Duke University, Department of

Supercomputers and Supercomputers and Clusters and Clusters and Grid, Grid, Oh My! Oh My!

Migrating from Grid to Cloud: Migrating from Grid to Cloud: Migrating from Grid to Cloud:

Outline n Introduction Proxy Dynamic Delegation in Grid Gateway n Is there the need for a

SEE-GRID Deploying a Grid-enabled eInfrastructure in SE Europe www.see-grid.org Jorge Sanchez,

Modernizing T&amp;D on the Electric Grid 11/29/2011 Mark Nealon System Meter &amp; Smart Grid

Grid Grid to Grid Grid-to to Ports Clock Routing for to-Ports Clock Routing for Ports Clock

Grid/Clo d Comp ting Grid/Clo d Comp ting Grid/Cloud Computing Grid/Cloud Computing over

SEE-GRID-SCI SEE-GRID Infrastructure for Regional eScience www.see-grid-sci.eu International

Experiences of the Virtual Experiences of the Virtual Community Grid Workgroup

An Overview of High Performance An Overview of High Performance Computing, Clusters, and the Grid

Grid! Alison Fulford Housekeeping National Grid 2 Introductions National Grid 3 Workplace

One Page Everywhere Fluid, Responsive Design with Semantic.gs The Semantic Grid System Grid

Fishbanks Jason Jay Lecturer in Sustainability Director, Sustainability Initiative at MIT Sloan

A Surface-Syntactic UD Treebank for Naija B.Caron, M.Courtin, K.Gerdes, S.Kahane SyntaxFest 2019

Hybrid Numeric Constraints &amp; Applications Michel RUEHER University of Nice Sophia-Antipolis

Working with text xt file formats CSV, JSON, XML, Excel regular expressions module

A Presentation on: How to Start a Local Commission on Disability In Your Community MGL

De Deal aling ing wit ith h mi missing ssing dat ata a in in pr pract actice: ice:

Lab 2-3: LC-3b Simulator to toupper2.cod Tinghuan Chen Department of Computer Science and

Towards a constructive simplicial model of univalent foundations Nicola Gambino 1 Simon Henry 2 1

Modernizing T&D on the Electric Grid 11/29/2011 Mark Nealon System Meter & Smart Grid

Hybrid Numeric Constraints & Applications Michel RUEHER University of Nice Sophia-Antipolis