OSG and the Campus Rob Gardner University of Chicago Research - - PowerPoint PPT Presentation

osg and the campus
SMART_READER_LITE
LIVE PREVIEW

OSG and the Campus Rob Gardner University of Chicago Research - - PowerPoint PPT Presentation

OSG and the Campus Rob Gardner University of Chicago Research Professor of Physics, Enrico Fermi Institute Senior Fellow, Computation Institute Towards Security Assured Cyberinfrastructure in Pennsylvania (SAC-PA) CI Cybersecurity Workshop,


slide-1
SLIDE 1

Rob Gardner • University of Chicago

Research Professor of Physics, Enrico Fermi Institute Senior Fellow, Computation Institute

OSG and the Campus

Towards Security Assured Cyberinfrastructure in Pennsylvania (SAC-PA) CI Cybersecurity Workshop, June 22, 2017

1

slide-2
SLIDE 2

What is the Open Science Grid?

2

  • Helps researchers speed up their research using high

throughput computing methods

  • Helps campus HPC administrators share resources for

multi-campus and national collaborative research

  • Last 30 days: 100M core-hours
  • Last 12 months: 200 Million jobs consumed 1 Billion

hours of computing involving 1.5 Billion data transfers to move >200 Petabytes

  • Accomplished by federating 114 clusters providing

1h-100M hours each

slide-3
SLIDE 3

OSG is Open to All

3

  • Open to providers at all scales

○ from small colleges to large national labs

  • Open to user communities at all scales

○ from individual students to large research communities ■ domain science specific and across many campuses ■ campus specific and across many domain sciences

  • Open to any business model

○ sharing, allocations, purchasing ○ preemption is an essential part of operations

slide-4
SLIDE 4

OSG Magic

4

slide-5
SLIDE 5

OSG supports computing across different types of resources

5

Seamless integration is they key to our success!

slide-6
SLIDE 6

OSG Tools to Match Diversity of Scale

  • OSG Connect

○ OSG hosts the service on OSG hardware

  • OSG Cluster in a Box

○ OSG manages services on hardware placed inside campus SciDMZs

  • OSG Compute Element

○ Gateway software that campuses deploy or OSG hosts

6

In all cases seamless integration is key

slide-7
SLIDE 7

OSG Connect Service (login.osgconnect.net)

Campus identity (CILogon) ‣ OSG Connect identity (Globus)

  • virtual organization (OSG) ‣ HTCondor to sites

⇒ Virtual HTC cluster experience

7

slide-8
SLIDE 8

OSG Connect - easy way to get started

OSG as a campus cluster

★ Login host ★ Job scheduler ★ Software ★ Storage

8

slide-9
SLIDE 9

OSG Connect Service

  • For users without an institutional submission point
  • login node for job management,

login.osgconnect.net

  • Stash is a temporary storage service

○ Globus Online, HTTP, Xrootd ○ Posix accessible from login nodes ○ Origin server for StashCache

  • Uses OASIS software repository for user-installed

software

9

slide-10
SLIDE 10

Applications Repository: OASIS

  • Repository for common user software
  • Accessed with a module command

○ identical software on all clusters ○ apps/libraries installed

#!/bin/bash switchmodules oasis module load R module load matlab ...

10

slide-11
SLIDE 11

11

slide-12
SLIDE 12

12

31 fields of science

slide-13
SLIDE 13

13

by wall hour usage

slide-14
SLIDE 14

14

diversity by discipline

slide-15
SLIDE 15

15

diversity by institution

slide-16
SLIDE 16

16

diversity by job scale

Usage by person 8,000,000 to 1 hrs

slide-17
SLIDE 17

How can OSG can help?

  • We can provide software and services that allow you to share your resources

with a specific set of other institutions, or the nation at-large. Who you share with is entirely under your control. In some cases OSG can host these services on your behalf

  • We can provide software and services that allow your scientists access to

shared resources at a specific set of other institutions, or the nation at large. Whose resources your scientists access is under the control of the scientists,

  • nce enabled by you and us.
  • We can help you with your perfSONAR configuration - to include in mesh

testing with other universities and archival of measurements for troubleshooting

17

slide-18
SLIDE 18

18

OSG User Support

CI Connect ATLAS Midwest Tier2 Center Emelie Mats Suchandra Dave Ken Bala Team

http://support.opensciencegrid.org/

Benedikt

slide-19
SLIDE 19

19

User Support

Intro to HTC on OSG Connect

Training

slide-20
SLIDE 20

20

Thank you!

slide-21
SLIDE 21

user-support@opensciencegrid.org support.opensciencegrid.org www.opensciencegrid.org/links

  • pensciencegrid

21

slide-22
SLIDE 22

22

Science sampler

With apologies for the many projects not included....

slide-23
SLIDE 23

Large Scale Genomics

  • FASTQ files are mapped to a

reference genome and converted to a BAM alignment file.

  • BAM files can be mined for gene

expression vectors that can be bundled into a gene expression matrix (GEM).

  • GEMs are a stable data structure that

can be mined for differentially expressed genes (DEGs) or used to construct Gene Co-expression Networks (GCNs)

23

Genomics

William Poehlman, Alex Feltus • Clemson University •• Stephen Ficklin, Washington State University

slide-24
SLIDE 24

Large Scale Genomics..

24

Genomics

William Poehlman, Alex Feltus • Clemson University •• Stephen Ficklin, Washington State University

slide-25
SLIDE 25

25

William Poehlman, Alex Feltus • Clemson University •• Stephen Ficklin, Washington State University

Genomics

slide-26
SLIDE 26

26

William Poehlman, Alex Feltus • Clemson University •• Stephen Ficklin, Washington State University

Genomics

slide-27
SLIDE 27

27

Medical Science

Functional Neuroimaging

Don Krieger • University of Pittsburgh

slide-28
SLIDE 28

28

Medical Science

Functional Neuroimaging..

Don Krieger • University of Pittsburgh

  • Don Krieger has been working with

TEAM TBI at the University of Pittsburgh ○ Targeted Evaluation, Action and Monitoring of Tramatic Brain Injury

  • TEAM TBI investigates the complexity
  • f brain injury, and how targeted

interventional strategies may improve

  • utcome and function.
slide-29
SLIDE 29

Large Scale Metagenomics.

29

Computational Biology

Jiang Shu • University of Nebraska Lincoln

slide-30
SLIDE 30

Large Scale Metagenomics..

30

Computational Biology

Jiang Shu • University of Nebraska Lincoln

slide-31
SLIDE 31

Large Scale Metagenomics...

31

Computational Biology

Jiang Shu • University of Nebraska Lincoln

slide-32
SLIDE 32

Counterfactual Analysis.

  • Economic analysis

& public policy

  • Considering "what

if" scenarios in microeconomics

  • Simulate

firm/consumer behaviors

32

Economics

Fernando Luco • Texas A&M University • Project:DemandSC

slide-33
SLIDE 33

Counterfactual Analysis..

33

Economics

Fernando Luco • Texas A&M University • Project:DemandSC

slide-34
SLIDE 34

Simulating Source Coding.

  • Data deluge - much
  • f it mobile traffic
  • Optical data

compression

  • Important for digital

space and satellite communication & wireless data transmission

34

Engineering

Ahmad Golmohammadi • New Mexico State University • Project:SourceCoding

slide-35
SLIDE 35

Simulating Source Coding..

  • Whole system

simulations: transmitter, decoder, receiver & stochastic noise

  • Data compression &

reconstruction algorithms

35

Engineering

Ahmad Golmohammadi • New Mexico State University • Project:SourceCoding

slide-36
SLIDE 36

Simulating Source Coding...

  • Sparse graphs can

approach fundamental limits

  • To verify the results,

large Monte Carlo samples needed - "not possible without the OSG"

36

Engineering

Ahmad Golmohammadi • New Mexico State University • Project:SourceCoding

slide-37
SLIDE 37

Evolving Strategies for Life.

  • Understanding

evolution at molecular scale in DNA with combination of mathematical modeling and simulation

  • How quickly does a

genome fix a mutation?

  • Role of randomness

versus natural selection?

37

Evolutionary Biology

Oana Carja • University of Pennsylvania • Project:EvolSims

slide-38
SLIDE 38

Evolving Strategies for Life..

38

Evolutionary Biology

Oana Carja • University of Pennsylvania • Project:EvolSims

slide-39
SLIDE 39

Models of Prebiotic Evolution

  • Protein first origin of life

model

  • Network of interacting

molecules assumed to be polymers

  • Perhaps solve Eigen's

paradox (low probability of randomly constructing "starter gene")

39

Biophysics

Ben Intoy • University of Minnesota • Project:PreBioEvo

slide-40
SLIDE 40

Models of Prebiotic Evolution..

40

Biophysics

Ben Intoy • University of Minnesota • Project:PreBioEvo

slide-41
SLIDE 41

Protein Evolution

41

Biophysics

Milo Lin • UT Southwestern • Project:EvProtDrug Understand the fundamental physical bottlenecks and dynamical behavior of protein evolution. Important questions include the extent of dominant pathways (convergent evolution) and phase transitions in evolutionary rates (punctuated equilibrium). These principals and their structural underpinnings can also be used to inform rational design of antibiotics that exploit bottlenecks in pathogen mutational response.

slide-42
SLIDE 42

Analysis of Brain Rhythms

42

Neuroscience

Scott Cole • UCSD • Project:NeurOscillation

slide-43
SLIDE 43

Analysis of Brain Rhythms..

43

Neuroscience

Scott Cole • UCSD • Project:NeurOscillation

slide-44
SLIDE 44

A FreeSurfer Workflow Service

  • Widely used software

suite for analysis of human brain MRI scans.

  • Neurophysiology of

depression, examining possible anatomical differences involved in ADHD, and studying autism

44

Neuroscience

Suchandra Thapa • University of Chicago • Project:fsurf

slide-45
SLIDE 45

A FreeSurfer Workflow Service

  • Working with Don

Krieger (Pittsburgh) to develop an OSG-based execution service

  • Uses Pegasus
  • Handles "standard"

transforms and user

  • ptions
  • To be released this

week!

45

Neuroscience

Suchandra Thapa • University of Chicago • Project:fsurf

slide-46
SLIDE 46

46

VOs

slide-47
SLIDE 47

VO Highlights: From the smallest scales...

47

MINOS+: limits on LEDs

NOvA: Fermilab-based neutrino experiment

Mu2e: Lepton-flavor violation experiment Nearly 60M opportunistic hours on OSG and counting >500,000 in one day!

slide-48
SLIDE 48

VO Highlights: From the smallest scales...

48

STAR: Heavy Ion Physics GlueX: probing exotic mesons predicted by LQCD

slide-49
SLIDE 49

...to the largest...

49

Dark Energy Survey:

Discovery of dwarf planet- Second-most distant known object in solar system Techniques applied to ongoing Planet 9 search

Ice Cube: Neutrino Observatory, also sensitive to extremely high energy cosmic rays

slide-50
SLIDE 50

...the completed to the still in planning...

50

Infrared Processing and Analysis Center: NASA’s archive for a host of IR/sub-mm astronomy missions, galaxy catalogs, Keck Observatory, and more! LIGO India: Additional detector will greatly Improve localization of gravitational wave sources

slide-51
SLIDE 51

...and working in all corners of the globe.

51

South Pole Telescope: Microwave-millimeter telescope VERITAS: 4 12m Cerenkov telescopes for gamma ray astronomy: Arizona, USA XENON1T: Dark matter detector at Gran Sasso National Laboratory, Italy

slide-52
SLIDE 52

And in space!

52

Alpha Magnetic Spectrometer (AMS) mounted at the ISS Photo credit: NASA