CS 5412/LECTURE 13. CEPH: A SCALABLE HIGH-PERFORMANCE DISTRIBUTED FILE SYSTEM
Ken Birman Spring, 2020
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 1
CS 5412/LECTURE 13. Ken Birman CEPH: A SCALABLE HIGH-PERFORMANCE - - PowerPoint PPT Presentation
CS 5412/LECTURE 13. Ken Birman CEPH: A SCALABLE HIGH-PERFORMANCE Spring, 2020 DISTRIBUTED FILE SYSTEM HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 1 HDFS LIMITATIONS Although many applications are designed to use the normal POSIX
Ken Birman Spring, 2020
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 1
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 2
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 3
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 4
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 5
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 6
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 7
8 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
9 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
Storage capacity, throughput, client performance. Emphasis on HPC.
“…failures are the norm rather than the exception…”
Dynamic workloads
10 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
11 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
12 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
13 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
CRUSH
Files striped onto predictably named objects CRUSH maps objects to storage devices
Dynamic subtree partitioning
Distributes metadata amongst MDSs
OSDs handle migration, replication, failure detection and recovery
14 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
Nearly POSIX Decoupled data and metadata operation
FUSE or directly linked
15
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
16 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
Consistency / correctness by default Optionally relax constraints via extensions Extensions for both data and metadata
17 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
Repetitive metadata updates handled in memory Optimizes on-disk layout for read access
18 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
19 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
20 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
21 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
Inputs
x is the placement group Hierarchical cluster map Placement rules
Outputs a list of OSDs
Anyone can calculate object location Cluster map infrequently updated
22 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
23 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
Client is oblivious to replication
24 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
25 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
26 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
27 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
28 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
29 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
30 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
31
Compare latencies of (a) a MDS where all metadata are stored in a shared OSD cluster and (b) a MDS which has a local disk containing its journaling
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
32 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
33 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
34 HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 35
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 36
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 37
. . . Air traffic controllers update flight plans Flight plan manager tracks current and past flight plan versions Message bus Microservices for various tasks, such as checking future plane separations, scheduling landing times, predicting weather issues, offering services to the airlines WAN link to other ATC centers Flight plan update broadcast service
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 38
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 39
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 40
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 41
Time → ATC controller Version Mgr Message Bus ATC rules checker . . . Points at which we might do serialization/deserialization Overhead →
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 42
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 43
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 44
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 45
Time → ATC controller Version Mgr Message Bus ATC rules checker . . . Points at which we might do serialization/deserialization Overhead → Wasted work!
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 46
Time → ATC controller Version Mgr Message Bus ATC rules checker . . . Dual scheme reduces overheads! A A B B B B B B B B B A B B B Overhead → Here we fetch the full data for the flight plan from the flight plan database
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 47
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 48
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 49
. . . Air traffic controllers update flight plans Flight plan manager tracks current and past flight plan versions Message bus Microservices for various tasks, such as checking future plane separations, scheduling landing times, predicting weather issues, offering services to the airlines WAN link to other ATC centers Flight plan update broadcast service If this one component is correct, the whole system can be proved safe!
HTTP://WWW.CS.CORNELL.EDU/COURSES/CS5412/2020SP 50