4/30/2014 1 University of California, San Diego Prism@UCSD - - PowerPoint PPT Presentation

4 30 2014 1 university of california san diego
SMART_READER_LITE
LIVE PREVIEW

4/30/2014 1 University of California, San Diego Prism@UCSD - - PowerPoint PPT Presentation

4/30/2014 1 University of California, San Diego Prism@UCSD Science DMZ PI: P. Papadopulos, co-PI: L. Smarr 01/01/2013 to 12/31/2014 CHERuB 100G campus gateway PI: M. Norman, co-PI: T. Hutton, V. Polichar 01/01/2014


slide-1
SLIDE 1

4/30/2014 1

slide-2
SLIDE 2

University of California, San Diego

  • Prism@UCSD – Science DMZ

– PI: P. Papadopulos, co-PI: L. Smarr – 01/01/2013 to 12/31/2014

  • CHERuB – 100G campus gateway

– PI: M. Norman, co-PI: T. Hutton, V. Polichar – 01/01/2014 to 12/31/2015

slide-3
SLIDE 3

UCSD and its environment

4/30/2014 3 Scripps Institute

  • f Oceanography

Salk Institute Venter Institute General Atomics

CalIT2

SDSC

Physics Medical School Skaggs

In addition to the 3 main UCSD units:

  • General Campus
  • Medical School
  • Scripps I.O.

there are many other research organizations

  • n and around campus.

NCMIR

Stem Cell Institute

slide-4
SLIDE 4

Connecting YOU on UCSD Campus with the World By Creating a Big Data Freeway System

NSF CC-NIE Has Awarded Prism@UCSD Optical Switch Phil Papadopoulos, SDSC, Calit2, PI

CHERuB

slide-5
SLIDE 5

Prism@UCSD: A Researcher Defined 10 and 40Gbit/s Campus Scale Data Carrier

  • high-bandwidth end-to-end optical connections
  • routed by next generation Arista switches (7504)
  • connects lab “data producers” with SDSC data-intensive computing &

storage resources

  • 10 Terabit/s of aggregate bandwidth, has full bisection similar to in-

machine room clusters, but is deployed at a campus scale

  • builds upon and upgrades the Quartzite "campus-scale network

laboratory" NSF MRI (awarded 2006)

  • adds IPv6 and OpenFlow
  • existing optical fiber connection to the SDSC is being expanded to

120Gbps as a high-bandwidth bridge to cloud/parallel storage and NSF XSEDE resources

Project in Brief

slide-6
SLIDE 6

PRISM Puts SDSC’s Big Data Gordon Supercomputer and Data Oasis Storage Into Your Lab

12

slide-7
SLIDE 7

PRISM is Connecting CERN’s CMS Experiment To Our Physics Department

80 Gbps PRISM Connection Has Been Made

slide-8
SLIDE 8

UCSD is a Tier-2 LHC Data Center: CMS Flow into UCSD Physics Dept. Peaks at 2.4 Gbps

Source: Frank Wuerthwein, Physics UCSD

slide-9
SLIDE 9

Dan Cayan USGS Water Resources Discipline Scripps Institution of Oceanography, UC San Diego

much support from Mary Tyree, Mike Dettinger, Guido Franco and other colleagues

Sponsors: California Energy Commission

NOAA RISA program California DWR, DOE, NSF

Planning for climate change in California

substantial shifts on top of already high climate variability

SIO Campus Climate Researchers Need to Download Results from Remote Supercomputer Simulations to Make Regional Climate Change Forecasts

slide-10
SLIDE 10

average summer afternoon temperature average summer afternoon temperature

10 GFDL A2 1km downscaled to 1km Hugo Hidalgo Tapash Das Mike Dettinger

slide-11
SLIDE 11

Ultra High Resolution Microscopy Images Created at the National Center for Microscopy Imaging

slide-12
SLIDE 12

NIH National Center for Microscopy & Imaging Research Integrated Infrastructure of Shared Resources

Source: Steve Peltier, Mark Ellisman, NCMIR

Local SOM Infrastructure Scientific Instruments End User FIONA Workstation Shared Infrastructure

slide-13
SLIDE 13

PRISM Links Calit2’s VROOM to NCMIR to Explore Confocal Light Microscope Images of Rat Brains

slide-14
SLIDE 14

Protein Data Bank (PDB) Needs Bandwidth to Connect Resources and Users

  • Archive of experimentally

determined 3D structures of proteins, nucleic acids, complex assemblies

  • One of the largest scientific

resources in life sciences

Source: Phil Bourne and Andreas Prlić, PDB Hemoglobin Virus

slide-15
SLIDE 15

PDB Usage Is Growing Over Time

  • More than 300,000 Unique Visitors per Month
  • Up to 300 Concurrent Users
  • ~10 Structures are Downloaded per Second 7/24/365
  • Increasingly Popular Web Services Traffic

Source: Phil Bourne and Andreas Prlić, PDB

slide-16
SLIDE 16

RCSB PDB

159 million entry downloads

PDBe

34 million entry downloads

PDBj

16 million entry downloads

2010 FTP Traffic

Source: Phil Bourne and Andreas Prlić, PDB

slide-17
SLIDE 17
  • Why is it Important?

– Enables PDB to Better Serve Its Users by Providing Increased Reliability and Quicker Results

  • How Will it be Done?

– By More Evenly Allocating PDB Resources at Rutgers and UCSD – By Directing Users to the Closest Site

  • Need High Bandwidth Between Rutgers & UCSD Facilities

PDB Plans to Establish Global Load Balancing

Source: Phil Bourne and Andreas Prlić, PDB

slide-18
SLIDE 18

PRISM Will Link Computational Mass Spectrometry and Genome Sequencing Cores to the Big Data Freeway

ProteoSAFe: Compute-intensive discovery MS at the click of a button MassIVE: repository and identification platform for all MS data in the world

Source: proteomics.ucsd.edu

slide-19
SLIDE 19

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

http://cherub.ucsd.edu

slide-20
SLIDE 20

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

CHERuB*: SDSC-ACT partner to bring 100Gbps connectivity to UCSD

UCSD/SDSC New 100G path

LBL - CMMAP UNL - OSG UWisc Madison

  • OSG

Pink line – New CENIC 100G Blue lines – Existing/planned ANI 100G Green lines – Existing PacWave 100G Maroon lines – XSEDE 10G network Thin lines – Other existing 10G or lower

NICS - CMMAP UCR FNAL - Tier-1 LHC Austin/TACC UCSB NERSC - POLARBEAR, CAIDA

Production late 2014

*Configurable, High-speed, Extensible Research Bandwidth

slide-21
SLIDE 21

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

The Plumbing (ask Tom Hutton)

PacWave, CENIC, Internet2, NLR, ESnet, StarLight, XSEDE & other R&E networks DWDM 100G transponders DWDM 100G transponders 818 W. 7th, Los Angeles, CA 10100 Hopkins Drive, La Jolla, CA

up to 3 add'l 100G transponders can be attached up to 3 add'l 100G transponders can be attached to CENIC/ PacWave switch L2

UCSD/SDSC Gateway Juniper MX960 "MX0"

New 2x100G/8x10G line card + optics New 40G line card +

  • ptics

SDSC Juniper MX960 "Medusa"

New 100G card/

  • ptics

Other SDSC resources

UCSD Primary Node Cisco 6509 "Node B" PRISM@UCSD Arista 7504

PRISM@UCSD

  • many UCSD big

data users

  • mult. 40G+

connections UCSD Production users

  • mult. 10G

connections GORDON compute cluster 2x40G 4x10G 100G 100G

  • mult. 40G

connections

NEW UCSD Key:

Green/dashed lines - new component/ equipment in proposal Pink/black - existing UCSD infrastructure

UCSD/SDSC Cisco 6509

UCSD DYNES

add'l 10G card/optics

100G Equinix/L3/CENIC POP SDSC NAP existing CENIC fiber Nx10G 10G

Existing ESnet SD router

10G

Dual Arista 7508 "Oasis"

SDSC DYNES 128x10G 256x10G DataOasis/ SDSC Cloud

slide-22
SLIDE 22

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

CENIC/ESnet 100G Connection enables Big Data science collaborations between NERSC and SDSC

UCSD/SDSC New 100G path

LBL - CMMAP UNL - OSG UWisc Madison

  • OSG

Pink line – New CENIC 100G Blue lines – Existing/planned ANI 100G Green lines – Existing PacWave 100G Maroon lines – XSEDE 10G network Thin lines – Other existing 10G or lower

NICS - CMMAP UCR FNAL - Tier-1 LHC Austin/TACC UCSB NERSC - POLARBEAR, CAIDA

slide-23
SLIDE 23

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

A Unique, Powerful, Data-Intensive Testbed for Scientific Discovery

EDISON HPC SYSTEM 2 PF, 434 TB RAM

6 PB

150 GB/s 100 GB/s

4.5 PB DTN DTN

ESnet/CENIC 100 Gb/s GORDON HPD SYSTEM 0.3 PF, 364 TB RAM+SSD

slide-24
SLIDE 24

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

POLARBEAR Cosmology Telescope

UC Berkeley/NERSC-UCSD/SDSC

  • Goal: Measure B-mode

polarization in the CMB from inflation era

  • Data path: Chile (obs)-

UCB/NERSC (analysis)- UCSD/SDSC (analysis)

  • Data acquisition rates:
  • 22 GB/mo. (current)
  • 3 TB/mo. (2014-2016)
  • Map making data analysis

NERSC & SDSC

  • 100 MC realizations of 100 TB

data = 10 PB

Atacama Desert, Chile

slide-25
SLIDE 25

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

Next Generation Network Measurement

CAIDA (SDSC)-NERSC

  • CAIDA operates the UCSD

Network Telescope, which collects Internet Background Radiation

  • Data paths: global internet,

ESnet

  • Data rates: 3-4 TB/mo
  • Using NERSC tape archive to

replicate 100 TB historical data

  • Other projects: network

measurement tools, Future Internet Architecture

100’s TB archival data SDSC/NERSC unassigned IPv4 addresses

slide-26
SLIDE 26

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

High Energy Physics LHC/US-CMS

UCSD Tier-2—US-CMS collaboration

  • Goals: Higgs boson,

supersymmetry, BSM

  • Data Paths: CERN-

FNAL (Tier 1)-UCSD (Tier 2) via ESnet and CENIC/I2

  • Peak Bandwidths:
  • Current: 10+5 Gbps
  • 2015: 40 Gbps when

LHC operates @ 14 Tev

slide-27
SLIDE 27

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

Education & Training: UCSD Telemedicine Center

slide-28
SLIDE 28

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

CHERuB Implementation Status

  • January, 2014
  • Project funded, equipment on order
  • February, 2014
  • Equipment received
  • Production network switch upgraded
  • March, 2014
  • Campus gateway upgraded, connected to regional 100G feed
  • Successful border-to-regional test @100Gbps
  • Next steps (April/May):
  • Connect Prism switch, test @2x40Gbps
  • Connect SDSC infrastructure, test @100Gbps
  • Connect production switch, test @4x10Gbps
  • Production Goal: September 2014
slide-29
SLIDE 29

Comet is a ~2000TeraFLOP System Architected for the “Long Tail of Science”

NSF Track 2 award to SDSC $12M NSF award to acquire $3M/yr x 4 yrs to operate Production early 2015