SLIDE 1
Wayne State University Cluster and Internet Computing Laboratory
Xing Lin Song Jiang Cluster and Internet Computing Laboratory - - PowerPoint PPT Presentation
SS-CDC : A Two-stage Parallel Content-Defined Chunking Method for Data Deduplicating Fan Ni Xing Lin Song Jiang Cluster and Internet Computing Laboratory Wayne State University Data is Growing Rapidly From storagenewsletter.com Most
Wayne State University Cluster and Internet Computing Laboratory
2
From storagenewsletter.com
3
4
Logical Physical File1 File2
– How to deduplicate more data? – How to deduplicate faster?
Chunking and fingerprinting Remove duplicate chunks
6
7
8
9
10
11
12
14
15
16
17
18
19
20
Cassandra Redis Debian Linux-src Neo4j Wordpress Node
512KB segments 1MB segments 2MB segments
23