Austere Flash Caching with Deduplication and Compression
Qiuping Wang*, Jinhong Li*, Wen Xia# Erik Kruus^, Biplob Debnath^, Patrick P. C. Lee*
*The Chinese University of Hong Kong (CUHK) #Harbin Institute of Technology, Shenzhen ^NEC Labs
1
Austere Flash Caching with Deduplication and Compression Qiuping - - PowerPoint PPT Presentation
Austere Flash Caching with Deduplication and Compression Qiuping Wang * , Jinhong Li * , Wen Xia # Erik Kruus ^ , Biplob Debnath ^ , Patrick P. C. Lee * * The Chinese University of Hong Kong (CUHK) # Harbin Institute of Technology, Shenzhen ^ NEC
*The Chinese University of Hong Kong (CUHK) #Harbin Institute of Technology, Shenzhen ^NEC Labs
1
2
3
4
SSD Chunking I/O Deduplication and compression LBA à FP FP à CA, length FP-index LBA-index RAM HDD … Dirty list Variable-size compressed chunks (after deduplication) Fixed-size chunks LBA, CA LBA, CA Read/write
5
LBA (8B) à CA (8B) LBA (8B) à FP (20B) FP (20B) à CA (8B) + Length (4B)
6
7
8
Bucket mapping / data …
…
slot
…
9
Bucket LBA-index LBA-hash prefix FP hash Flag
…
FP-index … … FP-hash prefix Flag slot Bucket
…
… … slot
10
Metadata region … … Data region FP List of LBAs Chunk Bucket … … slot Bucket … … slot
11
32KiB 20KiB Compress Slice and Pad 8KiB each
12
FP-index … … SSD RAM … … FP List of LBAs Length FP-hash prefix Flag … …
Chunk
Bucket
Metadata Region Data Region
Subchunk
13
Slot …
LBA-index
Slot … … 2 3
Reference Counter
Old … … …
FP-index
… Recent
recency in LBAs
14
+1 +1 +1 FP-hash
w h
15
16
AC-D AC-DC CD-LRU-D CD-ARC-D CD-ARC-DC
1 10 100 1000 12.5 25 37.5 50 62.5 75 87.5 100 Cache Capacity (%) Memory (MiB) 1 10 100 1000 12.5 25 37.5 50 62.5 75 87.5 100 Cache Capacity (%) Memory (MiB) 1 10 100 1000 12.5 25 37.5 50 62.5 75 87.5 100 Cache Capacity (%) Memory (MiB)
(a) WebVM (b) Homes (c) Mail
17
AC-D AC-DC CD-LRU-D CD-ARC-D CD-ARC-DC
25 50 75 100 12.5 25 37.5 50 62.5 75 87.5 100 Cache Capacity (%) Read Hit (%) 10 20 30 40 50 12.5 25 37.5 50 62.5 75 87.5 100 Cache Capacity (%) Read Hit (%) 25 50 75 100 12.5 25 37.5 50 62.5 75 87.5 100 Cache Capacity (%) Read Hit (%)
(a) WebVM (b) Homes (c) Mail
18
AC-D AC-DC CD-LRU-D CD-ARC-D CD-ARC-DC
20 40 60 80 12.5 25 37.5 50 62.5 75 87.5 100 Cache Capacity (%) Write Rd. (%) 20 40 60 80 12.5 25 37.5 50 62.5 75 87.5 100 Cache Capacity (%) Write Rd. (%) 20 40 60 80 12.5 25 37.5 50 62.5 75 87.5 100 Cache Capacity (%) Write Rd. (%)
(a) WebVM (b) Homes (c) Mail
19
AC-D AC-DC CD-LRU-D CD-ARC-D CD-ARC-DC
25 50 75 100 20 40 60 80 I/O Dedup Ratio (%) Thpt (MiB/s)
(a) Throughput vs. I/O dedup ratio (write-to-read ratio 7:3)
25 50 75 100 9:1 7:3 5:5 3:7 1:9 Write-to-Read Ratio Thpt (MiB/s)
(b) Throughput vs. write-to-read ratio (I/O dedup ratio 50%)
20
25 50 75 100 Latency (us) 5975 6000 6025 Fingerprint Compression Lookup Update SSD HDD
50 100 150 200 250 1 2 4 6 8 Number of threads Thpt (MiB/s)
50% dedup 80% dedup
21
22