S N he e ur a title
Operated by Los Alamos National Security, LLC for the U.S. Department of Energy's NNSA
S N he e ur a title Operated by Los Alamos National Security, - - PowerPoint PPT Presentation
S N he e ur a title Operated by Los Alamos National Security, LLC for the U.S. Department of Energy's NNSA S Los Alamos National Laboratory LA-UR-17-24107 MarFS and DeltaFS @ LANL you e User Level FS Challenges and Opportunities logo
S N he e ur a title
Operated by Los Alamos National Security, LLC for the U.S. Department of Energy's NNSA
S you e logo and delete wo e is
Los Alamos National Laboratory
Brad Settlemyer, LANL HPC May 16, 2017
LA-UR-17-24107
Operated by Los Alamos National Security, LLC for the U.S. Department of Energy's NNSA
Los Alamos National Laboratory 5/16/17 | 3
Los Alamos National Laboratory 5/16/17 | 4
Los Alamos National Laboratory 5/16/17 | 5
Los Alamos National Laboratory 5/16/17 | 6
Los Alamos National Laboratory 5/16/17 | 7
security is not the same as POSIX security
Los Alamos National Laboratory 5/16/17 | 8
Obj001 Obj repo 1 Obj repo 2 Obj001 Obj002 /GPFS-MarFS-md1 /GPFS-MarFS-md2 Dir1.1 Dir2.1 UniFile-1 All md is just normal except mtime and size which are set by pftool/fuse
Additional meta: Xattr-objid repo=1 id=Obj001
chunksize=256M
MultiFile-1 Obj repo1 access methods info Obj repo2 access methods info Config file/db
Additional meta: Xattr-repo=2 chunksize=256M,
(means it’s a multi-part file and the obj/offset list is in the GPFS mdfile File: list of obj name space/objname/offset/ length (obj name space=2, Obj001 offs/ length, Obj002 … Xattr-restart
trashdir trashdir Lazy Tree Info Lazy Tree Info
Scality, S3, erasure, etc.
GPFS MarFS Metadata File System(s). /MarFS top level namespace aggregation
Los Alamos National Laboratory 5/16/17 | 9
Load Balancer Scheduler Reporter Stat Readdir Stat Copy/Rsync/ Compare
D
e Q u e u e
Dirs Queue Stat Queue Cp/R/C Queue
Los Alamos National Laboratory 5/16/17 | 10
Scratch1 (78 PB) Store1 (~3PB) Store2 (38PB) Store3 (38PB)
FTA FE FTA1 FTA2 FTA3 FTA4 FTA5 FTA6
User: Submit batch job pfcp –r /scratch1/fs1 /marfs/fs1
FTA Cluster A collection of pftool worker nodes capable
data movement in parallel
Los Alamos National Laboratory 5/16/17 | 11
storage (1 RING), 1GPFS cluster)
Los Alamos National Laboratory 5/16/17 | 12
Los Alamos National Laboratory 4/19/17 | 12
File Transfer Agent Storage Node Zpool 1 Zpool 2 Zpool 3 Zpool 4 Storage Node Zpool 1 Zpool 2 Zpool 3 Zpool 4 Storage Node Zpool 1 Zpool 2 Zpool 3 Zpool 4 Storage Node Zpool 1 Zpool 2 Zpool 3 Zpool 4 Storage Node Zpool 1 Zpool 2 Zpool 3 Zpool 4 Storage Node Zpool 1 Zpool 2 Zpool 3 Zpool 4 Storage Node Zpool 1 Zpool 2 Zpool 3 Zpool 4 Storage Node Zpool 1 Zpool 2 Zpool 3 Zpool 4 Storage Node Zpool 1 Zpool 2 Zpool 3 Zpool 4 Storage Node Zpool 1 Zpool 2 Zpool 3 Zpool 4 Storage Node Zpool 1 Zpool 2 Zpool 3 Zpool 4 Storage Node Zpool 1 Zpool 2 Zpool 3 Zpool 4 D D D D D D D D D P P Each Zpool is a 17+3 Parity of 10+2 Storage Node Zpool 1 Zpool 2 Zpool 3 Zpool 4 D Meta-data servers Storage nodes in separate racks Multiple JBODs per Storage Node Data and Parity are round-robined to storage nodes Storage Nodes NFS export to FTAs
Los Alamos National Laboratory 5/16/17 | 13
storage (1 RING), 1GPFS cluster)
Los Alamos National Laboratory 4/19/17 | 16
Trinity Open Science Key 4 PB Trinity Production (current) Future MarFS deployment (Crossroads 2020)
Los Alamos National Laboratory 5/16/17 | 14
Los Alamos National Laboratory 5/16/17 | 15
1Trillion particles (S. Byna)
Los Alamos National Laboratory 5/16/17 | 16
range of 0.1 – 10%
trajectory of the particles with highest energy at end of simulation
easy to dump the highest energy particle ids
Los Alamos National Laboratory 5/16/17 | 17
Los Alamos National Laboratory 5/16/17 | 18
Los Alamos National Laboratory 5/16/17 | 19
Los Alamos National Laboratory 5/16/17 | 20
Los Alamos National Laboratory 5/16/17 | 21
Los Alamos National Laboratory 5/16/17 | 22
Los Alamos National Laboratory 5/16/17 | 23
Los Alamos National Laboratory 5/16/17 | 24
Los Alamos National Laboratory 5/16/17 | 25
Burst Buffer(3.7 PB, 3.3 TB/s)
Tightly Coupled Parallel Application
Parallel File System(78PB, 1.145 TB/s) Tape Archive
Object Storage (300PB*, 100GB/s) Trinity Platform
Los Alamos National Laboratory 5/16/17 | 26
***/Fast Storage Loosely-coupled Parallel Application (DeltaFS)
MarFS/Object Storage Future Platform Vision
Los Alamos National Laboratory 5/16/17 | 27
in-situ (both seemed right to me)
Los Alamos National Laboratory 5/16/17 | 28