ARC Storage Solution, NGIn School Jon K. Nilsen, Dept. Of Physics, - - PowerPoint PPT Presentation

arc storage solution ngin school
SMART_READER_LITE
LIVE PREVIEW

ARC Storage Solution, NGIn School Jon K. Nilsen, Dept. Of Physics, - - PowerPoint PPT Presentation

ARC Storage Solution, NGIn School Jon K. Nilsen, Dept. Of Physics, Univ. Of Oslo Outline KnowARC storage solution Current ARC Data Management pros and cons Requirements for KnowARC solution Storage solution by KnowARC


slide-1
SLIDE 1

Jon K. Nilsen, Dept. Of Physics, Univ. Of Oslo

ARC Storage Solution, NGIn School

slide-2
SLIDE 2

Jon K. Nilsen, Dept. Of Physics, Univ. Of Oslo

Outline

  • KnowARC storage solution

– Current ARC Data Management – pros and cons – Requirements for KnowARC solution – Storage solution by KnowARC

  • NGIn School

– Program – Experiences – Summary

slide-3
SLIDE 3

Jon K. Nilsen, Dept. Of Physics, Univ. Of Oslo

Grid Data Management Services

  • What services do we need?

– Storage Manager – Storage Elements – Indexing services – File transfer services

  • Internal and external communication through web

services

slide-4
SLIDE 4

Jon K. Nilsen, Dept. Of Physics, Univ. Of Oslo

Current ARC – pros and cons

  • Good

– Easy to install and configure – Multi-OS, multi back-end, non-intrusive – Stable, high performance services – Good error handling

  • Bad

– Non Service Oriented Architecture – No native indexing service – No consistent data management solution – Limited data management user interface – No native file transfer service

slide-5
SLIDE 5

Jon K. Nilsen, Dept. Of Physics, Univ. Of Oslo

Required capabilities

(from KnowARC design document)

  • Scalable, reliable indexing service capable of operating
  • n collections
  • Indexing service should not appear as a single point of

failure

  • Metadata should be extendable to have application-

specific information

  • Must have standard interfaces and API
  • Must be aware of VO-specific storage solutions, and

use them for storage

– E.g., WLCG has SRM as standard solution, so ARC needs to support SRM

slide-6
SLIDE 6

Jon K. Nilsen, Dept. Of Physics, Univ. Of Oslo

Required Capabilities (cont'd)

  • Must handle bulk data manipulation
  • Must be aware of physical file replicas and use this

information to optimize file transfer

  • Must support secondary storage facilities and flexible

data staging

  • User should see single access point to data (ref.

Google)

  • Should provide simple (POSIX-like) access to data for

applications

  • Easy to install
  • Easy to use
slide-7
SLIDE 7

Jon K. Nilsen, Dept. Of Physics, Univ. Of Oslo

KnowARC storage design

slide-8
SLIDE 8

Jon K. Nilsen, Dept. Of Physics, Univ. Of Oslo

Catalog

  • Maintains all metadata

– GUID – Logical -> physical filename – Is it replicated – Is it marked for deletion – Access control list

  • Metadata stored in a

Distributed Hash Table

slide-9
SLIDE 9

Jon K. Nilsen, Dept. Of Physics, Univ. Of Oslo

Catalog

  • Interface

– New file – New collection – Get metadata – Remove – Change metadata – Traverse logical name

slide-10
SLIDE 10

Jon K. Nilsen, Dept. Of Physics, Univ. Of Oslo

Storage Elements

  • Where the physical files

are stored

  • With possibility to mount

third-party storage

  • Interface

– Get file – Put file – Delete file – Copy file to a URL

slide-11
SLIDE 11

Jon K. Nilsen, Dept. Of Physics, Univ. Of Oslo

Storage Manager

  • Provides high-level

interface to user

  • Communicates with

catalog to get metadata and create new files and collections

  • Communicates with

Storage Elements to initiate transfers

slide-12
SLIDE 12

Jon K. Nilsen, Dept. Of Physics, Univ. Of Oslo

Storage Manager

  • Interface

– Put file – Get file – Delete file – Make new collection – List collection – Move file or collection – Modify metadata – Stat (get metadata)

slide-13
SLIDE 13

Jon K. Nilsen, Dept. Of Physics, Univ. Of Oslo

Download

slide-14
SLIDE 14

Jon K. Nilsen, Dept. Of Physics, Univ. Of Oslo

LFN, PFN, GUID, Source URL, Transfer URL

  • Logical file name: some/filename
  • Physical file name:

a.nice.site:4242/tmp/some/filename

  • Global Unique Identifier: 63047220-2dd5-11d9-9669-

0800200c9a66

  • Source URL:

srm://a.nice.site:4242/tmp/some/filename

  • Transfer URL:

gridftp://a.nice.site:4242/tmp/some/filename or

gridftp://yet.another.site:4242/MyDocuments/tuid

slide-15
SLIDE 15

Jon K. Nilsen, Dept. Of Physics, Univ. Of Oslo

ARC Storage Discussions

  • Builds on an already working solution

– Grid Underground Storage System – Deployed in Hungary – with one catalog, one storage element and

  • ne manager (but no problem to expand it)
  • The distributed hash catalog is the key element

– One promising algorithm (Etna) is being considered – No suitable implementations of it yet

  • What is the intended use of this system?

– One ring (of catalogs) to rule them all? – Or yet another dCache solution?

slide-16
SLIDE 16

Jon K. Nilsen, Dept. Of Physics, Univ. Of Oslo

NGIn School

slide-17
SLIDE 17

Jon K. Nilsen, Dept. Of Physics, Univ. Of Oslo

NGIn School

  • NGin: Innovative tools and services for NorduGrid
  • Project funded by the NorduNet3 program
  • Goal is twofold

– Extend existing Grid middleware – Train new Grid experts, securing future technology development

  • Training program includes Grid PhD positions and a

Nordic Grid School, the NGIn School

  • First Grid School at NorduGrid07
slide-18
SLIDE 18

Jon K. Nilsen, Dept. Of Physics, Univ. Of Oslo

NGIn School program

  • Day 1 (General Grid Introduction)

– Intro to Grid – Intro to ARC – First steps with ARC (tutorials)

  • Day 2 (Specialized tutorials)

– HEP distributed analysis – ARC in bioinformatics, ARC in medical imaging – ARC-gLite interoperability – Grid Job Manager – Dynamic Runtime Environments

  • Day 3 (ARC Development)

– Sys-admin and developer training

slide-19
SLIDE 19

Jon K. Nilsen, Dept. Of Physics, Univ. Of Oslo

Experiences

  • Day 1

– ~20 participants – Interesting introductions to Grid and ARC middleware – Very general topics

  • Day 2

– ~8 active participants – Quite specific topics -> more fluctuating participants –

  • Day 3

– Merged into NorduGrid parallel session – Training consisted of Ferenc writing an ARC service on the beamer

dN participants/dt0

slide-20
SLIDE 20

Jon K. Nilsen, Dept. Of Physics, Univ. Of Oslo

NGIn School Summary

  • A very nice school for newcomers
  • From newbie to specialized user in two days
  • Maybe not so useful for NGIn PhDs
  • Could have been more emphasis on developer training
  • Who was the audience of the school?

– Recruiting new Grid users? – Training NGIn PhDs? – Both?