Introduction L. Poggioli, LAL ATLAS latest news Actions list - - PowerPoint PPT Presentation

introduction
SMART_READER_LITE
LIVE PREVIEW

Introduction L. Poggioli, LAL ATLAS latest news Actions list - - PowerPoint PPT Presentation

Introduction L. Poggioli, LAL ATLAS latest news Actions list Inputs from recent meetings LCG-TECH ftf 16/04 https://indico.in2p3.fr/conferenceDisplay.py?confId=9731 Pre-GDB 13/05 http://indico.cern.ch/event/272787/


slide-1
SLIDE 1

CAF_T2_150514 Luc 1

Introduction

  • L. Poggioli, LAL
  • ATLAS latest news
  • Actions list

Inputs from recent meetings

– LCG-TECH ftf 16/04

https://indico.in2p3.fr/conferenceDisplay.py?confId=9731

– Pre-GDB 13/05

http://indico.cern.ch/event/272787/

slide-2
SLIDE 2

CAF_T2_150514 Luc 2

Pledges: C-RSG feedback (1)

CPU reduction: HLT usage should Be pledged

slide-3
SLIDE 3

CAF_T2_150514 Luc 3

Pledges: C-RSG feedback (2)

Potential budget crisis?

  • Run-2 LHC parameters likely to be pessimistic
  • Flat budget model limitations (C-RSG study ongoing)
slide-4
SLIDE 4

CAF_T1_150514 Luc 4

Activities since last CAF

OK but big fluctuations from production

slide-5
SLIDE 5

CAF_T1_150514 Luc 5

Production last month (1)

SW not ready / Lack of jobs to process

slide-6
SLIDE 6

CAF_T1_150514 Luc 6

Production forecast

  • Based on delivery of new sw release 19.x.x end May
  • Till then NO MCORE activities

Claire, Wolfgang ADC weekly 29/04

slide-7
SLIDE 7

CAF_T2_150514 Luc 7

MCORE

  • Today

– MC12 simul & reco 1core – MC14 xcore – Xcore: 25-30% total

  • Soon

– Only MC14 – Xcore: 60-70% total

  • “Big” sites

– Asked to deploy xcore dynamically (a priori easy in Torque) – Timescale: 1-2 months (i.e. to be ready for DC14)

After clarification with Andrej, Simone, Andreu

slide-8
SLIDE 8

CAF_T2_240314 Luc 8

JIRA, RUCIO

  • Savannah -> JIRA migration

– Almost done (DPD project remains to move)

  • RUCIO

– Migrated clouds: ALL but US, CERN, NDGF – Full commissioning stress test (on real but not

  • fficial data) 20/05-End June

– All DDM endpoints have new naming except /SAM/ subdirectory

  • Files and directories not following this convention are
  • ut of DDM catalogs (dark data) and can be removed
  • To be done per site/squad
slide-9
SLIDE 9

CAF_T2_240314 Luc 9

  • All FR-sites have deployed perfSonar
  • Monitoring http://maddash.aglt2.org/maddash-webui/
  • More a site tool than a VO tool
  • F. Schaer is following

– eg Firewall issues, asymmetries, inconsistencies

perfSonar

Bandwidth Latency

slide-10
SLIDE 10

CAF_T2_240314 Luc 10

XrootD/FAX/HTTP (1)

  • ATLAS priorities to sites (for T1s/T2Ds)

1) Enable xrootD data access 2)Enable FAX 3) Enable HTTP/WebDaV data access

  • Done for most FR-T2Ds& T1

– LPC ongoing, LPSC?

  • FAX: 48 sites in

– Failover mode

  • 241 queues (prod & ana)
  • Tiny %network used

– Overflow mode

  • Leave data, move job
  • Tested for US sites

SSB dashboard Rob, pre-GDB

slide-11
SLIDE 11

CAF_T2_240314 Luc 11

XrootD/FAX/HTTP (2)

  • WebDAV

– All functionalities to manipulate data

  • RUCIO aware of sites DT
  • RUCIO knows where to find replicas

– Supports FTS (candidate to replace SRM)

  • When final RUCIO migration done

– Possible to access files via WebDaV – 62 sites today, 329 endpoints – Access via RUCIO redirector or Metalinks

  • Efforts needed

– SAM tests, Monitoring

Cédric, pre-GDB

slide-12
SLIDE 12

LCG-TECH_16042014 Luc Poggioli 12

Remote access to local storage

  • ATLAS recommendation

– xrootd direct access for analysis – xrdcp for production (copy-to-scratch)

  • For dpm sites

– Most sites are using copy-to-scratch with lcg-cp, but should be encouraged to move to xrootd

  • More efficient than gridftp copy-to-scratch
  • RFIO broken for directIO
  • For dCache

– Analysis: Xrootd & dcap similar performance – Prod: xrdcp (eg at CC)

Campana, Elmheuser, Manoulis

slide-13
SLIDE 13

CAF_T2_240314 Luc 13

Actions List (1)

  • FAX

– Follow test jobs results & understand

  • perfSonar

– Follow BW & latency performance

  • Xrootd DA for analysis queues

– If no objection from sites, proceed

  • MCORE

– Try with IRFU, TOKYO – Dynamic

  • Pledges deployment?
slide-14
SLIDE 14

CAF_T2_240314 Luc 14

Actions List (2)

  • ARC CE

– Machines in // Try CPPM, LPSC ?

  • Sites

– Analysis queue at Beijing Done – Romania: RO-07 as T2D

  • Support

– DAST situation improved (eg Laurent has joined) – Squad

  • No news bad news
  • Recontact all sites (eg Saclay, LAPP, LPC)
  • Next week: HEPIX 19-23 May at LAPP