Urgent Computing, Sharing Grid Resources, and Elastic Computing
Pete Beckman
Argonne National Laboratory University of Chicago http://www.mcs.anl.gov/~beckman
Urgent Computing, Sharing Grid Resources, and Elastic Computing - - PowerPoint PPT Presentation
Urgent Computing, Sharing Grid Resources, and Elastic Computing Pete Beckman Argonne National Laboratory University of Chicago http://www.mcs.anl.gov/~beckman SPRUCE Urgent Computing Flat Rock, North Carolina, 2006 Argonne Natl Lab/U
Argonne National Laboratory University of Chicago http://www.mcs.anl.gov/~beckman
SPRUCE Urgent Computing Flat Rock, North Carolina, 2006 http://www.mcs.anl.gov/~beckman
3 Argonne Nat’l Lab/U Chicago
SPRUCE Urgent Computing Flat Rock, North Carolina, 2006 http://www.mcs.anl.gov/~beckman
4 Argonne Nat’l Lab/U Chicago
SPRUCE Urgent Computing Flat Rock, North Carolina, 2006 http://www.mcs.anl.gov/~beckman
5 Argonne Nat’l Lab/U Chicago
SPRUCE Urgent Computing Flat Rock, North Carolina, 2006 http://www.mcs.anl.gov/~beckman
6 Argonne Nat’l Lab/U Chicago
SPRUCE Urgent Computing Flat Rock, North Carolina, 2006 http://www.mcs.anl.gov/~beckman
7 Argonne Nat’l Lab/U Chicago
SPRUCE Urgent Computing Flat Rock, North Carolina, 2006 http://www.mcs.anl.gov/~beckman
8 Argonne Nat’l Lab/U Chicago
SPRUCE Urgent Computing Flat Rock, North Carolina, 2006 http://www.mcs.anl.gov/~beckman
9 Argonne Nat’l Lab/U Chicago
GETS USER GETS USER ORGANIZATION
GETS priority is invoked GETS priority is invoked “ “call-by-call call-by-call” ” Calling cards are in widespread use and easily understood by the NS/EP User, simplifying GETS usage
GETS is a "ubiquitous" service in the Public Switched Telephone Network…if you can get a DIAL TONE, you can make a GETS call
SPRUCE Urgent Computing Flat Rock, North Carolina, 2006 http://www.mcs.anl.gov/~beckman
10 Argonne Nat’l Lab/U Chicago Automated Trigger Human Trigger Right-of-Way Token
2 1
SPRUCE Science Gateway
First Responder
SPRUCE Urgent Computing Flat Rock, North Carolina, 2006 http://www.mcs.anl.gov/~beckman
11 Argonne Nat’l Lab/U Chicago
Conventional Job Submission Parameters
Urgent Computing Parameters
Urgent Computing Job Submission SPRUCE Job Manager Supercomputer Resource Local Site Policies Priority Job Queue User Team Authentication
3
Choose a Resource
4 5
SPRUCE Urgent Computing Flat Rock, North Carolina, 2006 http://www.mcs.anl.gov/~beckman
12 Argonne Nat’l Lab/U Chicago
Supercomputer Resource
Domain Specialist Interpreter
6
Results Decision Maker
7
Student fun with AJAX…
SPRUCE Urgent Computing Flat Rock, North Carolina, 2006 http://www.mcs.anl.gov/~beckman
15 Argonne Nat’l Lab/U Chicago
SPRUCE Urgent Computing Flat Rock, North Carolina, 2006 http://www.mcs.anl.gov/~beckman
16 Argonne Nat’l Lab/U Chicago
Deadline Urgency Level
59% 78% 98% 95% Reliability … … … … … 45 days ago City Airflow SDSC::Elimidata SDSC::Elimidata NCSA::Cobalt NCSA::Cobalt Platform 8 days ago Tornado 30 days ago Influenza 14 days ago City Airflow Validated App. Normal priority, no SPRUCE support SDSC::Datastar Automated, immediate access, kill existing jobs, 10 min turnaround Automated, next job Human-in-the-loop, immediate access, kill existing jobs, 15 min. turnaround Policy … … … … SDSC::Elimidata PSC::Rachel NCSA::Cobalt Platform
Warm Standby Validation History Site Policies
(5.3 hrs, 1024 nodes) SDSC::Datastar Immediate Immediate Next Available Job (Policy Based) … … … PSC::Rachel NCSA::Cobalt Platform
Live Job/Queue Data User Team MDS4 Service SPRUCE Data Advisor Best HPC Resource Urgent Computation Request
SPRUCE Urgent Computing Flat Rock, North Carolina, 2006 http://www.mcs.anl.gov/~beckman
18 Argonne Nat’l Lab/U Chicago
SPRUCE Urgent Computing Flat Rock, North Carolina, 2006 http://www.mcs.anl.gov/~beckman
19 Argonne Nat’l Lab/U Chicago
SPRUCE Urgent Computing Flat Rock, North Carolina, 2006 http://www.mcs.anl.gov/~beckman
20 Argonne Nat’l Lab/U Chicago
SPRUCE Urgent Computing Flat Rock, North Carolina, 2006 http://www.mcs.anl.gov/~beckman
22 Argonne Nat’l Lab/U Chicago
SPRUCE Urgent Computing Flat Rock, North Carolina, 2006 http://www.mcs.anl.gov/~beckman
23 Argonne Nat’l Lab/U Chicago
SPRUCE Urgent Computing Flat Rock, North Carolina, 2006 http://www.mcs.anl.gov/~beckman
24 Argonne Nat’l Lab/U Chicago