A"“Hitchhiker’s”"Guide"to"Fast"and"Efficient"Data" Reconstruc:on"in"Erasure;coded"Data"Centers
- K. V. Rashmi, Nihar Shah, D. Gu,
- H. Kuang, D. Borthakur, K. Ramchandran
- - PowerPoint PPT Presentation
A"Hitchhikers"Guide"to"Fast"and"Efficient"Data" Reconstruc:on"in"Erasure;coded"Data"Centers K. V. Rashmi, Nihar Shah, D. Gu, H. Kuang, D. Borthakur, K. Ramchandran
a" b" c" d" e" f" g" h" i" j" a" b" c" d" e" f" g" h" i" j" a" b" c" d" e" f" g" h" i" j" a" b" c" d" e" f" g" h" i" j"
TOR" TOR" TOR" TOR" AS/Router"
…% …% …% …%
block 1 block 2 block 3 block 4
a b a+b a+2b parity"blocks" data"blocks"
Overhead" 2x" 2x"
block 1 block 2 block 3 block 4
a b a b Fault"" tolerance:" tolerates"any"one"failure" tolerates"any"two"failures"
Borthakur, “HDFS and Erasure Codes (HDFS-RAID)”! Fan, Tantisiriroj, Xiao and Gibson, “DiskReduce: RAID for Data-Intensive Scalable Computing”, PDSW 09!
a" b" c" d" e" f" g" h" i" j" P1" P2" P3" P4" a" b" c" d" e" f" g" h" i" j" a" b" c" d" e" f" g" h" i" j" a" b" c" d" e" f" g" h" i" j"
a" b" c" d" e" f" g" h" i" j" a" b" c" d" e" f" g" h" i" j" a" b" c" d" e" f" g" h" i" j"
a" b" c" d" e" f" g" h" i" j" P1" P2" P3" P4"
Borthakur, “HDFS and Erasure Codes (HDFS-RAID)”! Fan, Tantisiriroj, Xiao and Gibson, “DiskReduce: RAID for Data-Intensive Scalable Computing”, PDSW 09!
a"
Network Transfer & disk IO = 1x Network Transfer & disk IO = 2x
b" a+b" a a b b a b a+b a+2b
block 1 block 2 block 3 block 4 block 1 block 2 block 3 block 4
a a
TOR" TOR" TOR" TOR" Router"
…% …% …% …%
machine"1"
a%
machine"2" machine"3" machine"4"
Rashmi et al., “A Solution to the Network Challenges of Data Recovery in Erasure-coded Storage: A Study
"
"
block"1" block"2" block"3" block"4"
1"byte" block"1" block"2" block"3" block"4"
1"byte"
1"byte" block"1" block"2" block"3" block"4"
1"byte"
subtract"
1"byte" block"1" block"2" block"3" block"4"
1"byte"
subtract"
1"byte" block"1" block"2" block"3" block"4"
1"byte"
block"1" block"2" block"3" block"4"
1"byte" 1"byte"
1"byte" block"1" block"2" block"3" block"4"
1"byte"
* K.V. Rashmi, Nihar Shah, K. Ramchandran, “A Piggybacking Design Framework for Read-and Download-
efficient Distributed Storage Codes”, in IEEE International Symposium on Information Theory, 2013.!
block"10" block"11" block"14" block"13" block"12" block"9" block"8" block"7" block"6" block"5" block"4" block"2" block"1" block"3" 256"MB" data" parity"
block"10" block"11" block"14" 256"MB" block"13" block"12" block"9" block"8" block"7" block"6" block"5" block"4" block"2" block"1" block"3" data" parity"
256"MB" block"10" block"11" block"14" 256"MB" block"13" block"12" block"9" block"8" block"7" block"6" block"5" block"4" block"2" block"1" block"3" data" parity"
System% Data%transfer% Connec:vity%(#machines)% RS" 2.56"GB" 10" HH"blocks"1;9" 1.67"GB" 11" HH"block"10" 1.67"GB" 13"
Code"metrics:"" Storage"requirement"" Same"(op:mal)"" Supported"parameters"" All" Fault"tolerance"" Same"(op:mal)""
"
Reconstruc:on:" Network"transfers" 35%"less"" Disk"IO" 35%"less" Data"read"and"transfer":me"(median)"" 31.8%"less"" Data"read"and"transfer":me"(95th"%ile)" 30.2%"less" Computa:on":me"(median)" 36.1%"less"
"
Encoding:" Encoding":me"(median)" 72.1%"more"
… . . . . . . . . . . . . . . . . . . . . . . . . . . .
coupled bytes
(encoded together)
unit 1 unit 2 unit 10 unit 14 unit 3
…
unit 4 unit 12 unit 13
… . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
hop length unit 1 unit 2 unit 10 unit 11 unit 14
…
data units parity units
unit 3
…
unit 4 unit 12 unit 13
. . . . . . . . . . . . . . . . . . . . . . . .
1 byte coupled bytes
(encoded together)
(a) coupling adjacent bytes to form stripes (b) hop-and-couple
. . . . . . . . .
unit 11
data units parity units