Interoperation with Interoperation with Infrastructures: - - PowerPoint PPT Presentation

interoperation with interoperation with infrastructures
SMART_READER_LITE
LIVE PREVIEW

Interoperation with Interoperation with Infrastructures: - - PowerPoint PPT Presentation

Interoperation with Interoperation with Infrastructures: Infrastructures: NDGF-EGEE NDGF-EGEE Michael Grnager, PhD Technical Coordinator, NDGF International Symposium on Grid Computing 08 Taipei , April 11 th 2008 Outline Outline


slide-1
SLIDE 1

Interoperation with Interoperation with Infrastructures: Infrastructures: NDGF-EGEE NDGF-EGEE

Michael Grønager, PhD Technical Coordinator, NDGF International Symposium on Grid Computing 08 Taipei, April 11th 2008

slide-2
SLIDE 2

ISGC08, Taipei, April, 2008 2

Outline Outline

 History, Motivation and Goal  A Job Lifetime Tour

gLite ARC

 Interoperability

slide-3
SLIDE 3

ISGC08, Taipei, April, 2008 3

History History

 NorduGrid ARC

Initiated in 2001 as “Nordic EDG” In production in 2002 Only grid in CERN ATLAS DC1

 EGEE gLite

Started in 2000 as EDG Operated by EGEE since 2004 Today worlds largest grid

slide-4
SLIDE 4

ISGC08, Taipei, April, 2008

NDGF Facility - 2008Q1 NDGF Facility - 2008Q1

slide-5
SLIDE 5

ISGC08, Taipei, April, 2008

Motivation Motivation

 Accounting export from

SGAS to APEL

Biggest EU Tier-1 for

ATLAS in 2007

 Service Availability

Monitoring – via WLCG SAM sensors for ARC-CE

Top reliable Tier-1

worldwide

 Operation integrated in

CIC-on-Duty

slide-6
SLIDE 6

ISGC08, Taipei, April, 2008

 Accounting export from

SGAS to APEL

Biggest EU Tier-1 for

ATLAS

 Service Availability

Monitoring – via WLCG SAM sensors for ARC-CE

Top reliable Tier-1

worldwide

Motivation Motivation

The Nordic infrastructure has a high degree of compatibility with the EGEE infrastructure. The resources contributed via NDGF was in 2007 the biggest North European EGEE site with 40% of all computations. - that is 4% of entire EGEE and the 5th biggest European EGEE site. Only missing part is job submission !

slide-7
SLIDE 7

ISGC08, Taipei, April, 2008 7

Motivation Motivation

 Why not just install gLite ?

ARC is deployed at 70 sites (~20000CPUs) Runs on several OS'es Optimal resource usage

 Why not use ARC directly ?

ATLAS can - and also through PanDa... Hard for smaller VOs to integrate a new grid Simplify deployment with only one m/w

 Be an integral part of the European grid !

Operation, Monitoring, Accounting already

there...

slide-8
SLIDE 8

ISGC08, Taipei, April, 2008 8

Many OS'es ! Many OS'es !

slide-9
SLIDE 9

ISGC08, Taipei, April, 2008 9

gLite resource usage gLite resource usage

slide-10
SLIDE 10

ISGC08, Taipei, April, 2008 10

gLite resource usage gLite resource usage

 CE role:  submission to LRMS  some status checks  WN role:  data handling  some status checks  logging and bookeeping  run the job  Many nodes idle when handling data (up/download)  Not in line with site requirements (WN world access,

specific OS etc)

slide-11
SLIDE 11

ISGC08, Taipei, April, 2008 11

ARC resource usage ARC resource usage

slide-12
SLIDE 12

ISGC08, Taipei, April, 2008 12

ARC resource usage ARC resource usage

 CE role:  submission to LRMS  status checks  logging and book keeping  WN role:  run the job!  CE handles all data up and download:  Wall time = CPU time !  No connections needed from/to WN  No software needed on WN

slide-13
SLIDE 13

ISGC08, Taipei, April, 2008 13

Job Lifetime Tour Job Lifetime Tour

 Job description language – not enough  Submission protocol – not enough  Information Index – not enough  We need to understand the entire Job Lifetime

Cycle!

 ... And we also need integration with:

Operation Monitoring Accounting

slide-14
SLIDE 14

ISGC08, Taipei, April, 2008 14

JobCycle: gLite JobCycle: gLite

slide-15
SLIDE 15

ISGC08, Taipei, April, 2008 15

JobCycle: gLite JobCycle: gLite

glite-wms-job-submit

slide-16
SLIDE 16

ISGC08, Taipei, April, 2008 16

JobCycle: gLite JobCycle: gLite

WMProxy

slide-17
SLIDE 17

ISGC08, Taipei, April, 2008 17

JobCycle: gLite JobCycle: gLite

Brokering

slide-18
SLIDE 18

ISGC08, Taipei, April, 2008 18

JobCycle: gLite JobCycle: gLite

LDAP Query Brokering

slide-19
SLIDE 19

ISGC08, Taipei, April, 2008 19

JobCycle: gLite JobCycle: gLite

Jobwrapper Brokering

slide-20
SLIDE 20

ISGC08, Taipei, April, 2008 20

JobCycle: gLite JobCycle: gLite

Condor-C Submission

slide-21
SLIDE 21

ISGC08, Taipei, April, 2008 21

JobCycle: gLite JobCycle: gLite

blah-submit

slide-22
SLIDE 22

ISGC08, Taipei, April, 2008 22

JobCycle: gLite JobCycle: gLite

JobWrapper

slide-23
SLIDE 23

ISGC08, Taipei, April, 2008 23

JobCycle: gLite JobCycle: gLite

JobWrapper Staging Files Staging Files

slide-24
SLIDE 24

ISGC08, Taipei, April, 2008 24

JobCycle: gLite JobCycle: gLite

JobWrapper Staging Files Staging Files WN Idle !

slide-25
SLIDE 25

ISGC08, Taipei, April, 2008 25

JobCycle: gLite JobCycle: gLite

JobWrapper Running

slide-26
SLIDE 26

ISGC08, Taipei, April, 2008 26

JobCycle: gLite JobCycle: gLite

JobWrapper Reporting Status: Maradona Reporting Status: BLAH status Reporting Status: LB

slide-27
SLIDE 27

ISGC08, Taipei, April, 2008 27

JobCycle: gLite JobCycle: gLite

JobWrapper Staging Files Staging Files

slide-28
SLIDE 28

ISGC08, Taipei, April, 2008 28

JobCycle: gLite JobCycle: gLite

JobWrapper Staging Files Staging Files WN Idle !

slide-29
SLIDE 29

ISGC08, Taipei, April, 2008 29

JobCycle: gLite JobCycle: gLite

JobWrapper Reporting Status: Maradona Reporting Status: BLAH status Reporting Status: LB

slide-30
SLIDE 30

ISGC08, Taipei, April, 2008 30

JobCycle: gLite JobCycle: gLite

Finished

slide-31
SLIDE 31

ISGC08, Taipei, April, 2008 31

JobCycle: gLite JobCycle: gLite

Finished glite-wms-job-status WMProxy LB

slide-32
SLIDE 32

ISGC08, Taipei, April, 2008 32

JobCycle: ARC JobCycle: ARC

slide-33
SLIDE 33

ISGC08, Taipei, April, 2008 33

JobCycle: ARC JobCycle: ARC

arcsub

slide-34
SLIDE 34

ISGC08, Taipei, April, 2008 34

JobCycle: ARC JobCycle: ARC

brokering LDAP Query

slide-35
SLIDE 35

ISGC08, Taipei, April, 2008 35

JobCycle: ARC JobCycle: ARC

submission gsiftp

slide-36
SLIDE 36

ISGC08, Taipei, April, 2008 36

JobCycle: ARC JobCycle: ARC

preparing staging

slide-37
SLIDE 37

ISGC08, Taipei, April, 2008 37

JobCycle: ARC JobCycle: ARC

submission to LRMS running

slide-38
SLIDE 38

ISGC08, Taipei, April, 2008 38

JobCycle: ARC JobCycle: ARC

FINISHING uploading

slide-39
SLIDE 39

ISGC08, Taipei, April, 2008 39

JobCycle: ARC JobCycle: ARC

FINISHED reporting status

slide-40
SLIDE 40

ISGC08, Taipei, April, 2008 40

JobCycle: ARC JobCycle: ARC

arcstat gsiftp

slide-41
SLIDE 41

ISGC08, Taipei, April, 2008 41

Interoperability Interoperability

 Protocol  Unify protocols  WMS  Implement ARC submission in WMS  Gateway  Introduce a gateway between gLite and ARC  Co-installation  deploy ARC and gLite simultaneous

slide-42
SLIDE 42

ISGC08, Taipei, April, 2008 42

Protocol Protocol

 Current candidate: OGSA BES  During implementation by INFN for CREAM CE  During implementation by KnowARC for ARC v.1  HOWEVER:  handles only small part of the job cycle

 (still LB, Maradona, staging...)

 no unification of data handling  will work for “hello world”  a lot more work needed

slide-43
SLIDE 43

ISGC08, Taipei, April, 2008 43

WMS WMS

 Work initiated in 2006  Uses Condor to submit to ARC  Functional today  Requires:  Special gLite-WN Runtime Environment  Proxies on WNs  outbound connections open

slide-44
SLIDE 44

ISGC08, Taipei, April, 2008 44

WMS WMS

slide-45
SLIDE 45

ISGC08, Taipei, April, 2008 45

Gateway Gateway

slide-46
SLIDE 46

ISGC08, Taipei, April, 2008 46

Gateway Gateway

 Modified gLite-CE to submit to ARC  Add ARC as another LRMS to BLAH  Parses the JobWrapper

 Handles data up and download  Handles LB events  Handles Maradona status events

 Adapts gLite job to run on ARC  No data handling by WN  No need for proxy on WN  No need for “outbound” WNs  Functional today – needs some tweaks on infosystem

slide-47
SLIDE 47

ISGC08, Taipei, April, 2008 47

Conclusions Conclusions

 Interoperability is many tasks:

Job life-cycle Data flow Status Monitoring Accounting Operation

slide-48
SLIDE 48

ISGC08, Taipei, April, 2008 48

Conclusions Conclusions

 Interoperability is many tasks:

Job life-cycle Data flow Status Monitoring Accounting Operation

 Two functional schemes exists for NDGF-EGEE

WMS Gateway

slide-49
SLIDE 49

ISGC08, Taipei, April, 2008 49

Acknowledgements Acknowledgements

Thanks to: Tord Ekelöf, Mattias Ellert, Laurence Field, Claudio Grandi, Daniel Johannson, Oliver Keeble, Josva Kleist, Balazs Konya, Erwin Laure, Francesco Preltz, Di Qing, Markus Schultz, Anders Selander, Oxana Smirnova, David Smith, Christian Søttrup, Mattias Wadenstein, Rod Walker, Anders Wäänänen and many others...

slide-50
SLIDE 50

ISGC08, Taipei, April, 2008 50

Thanks! Thanks!

Questions?