Cloud Analytics Data Warehousing
Marco Serafini
COMPSCI 532 Lecture 18
Cloud Analytics Data Warehousing Marco Serafini COMPSCI 532 - - PowerPoint PPT Presentation
Cloud Analytics Data Warehousing Marco Serafini COMPSCI 532 Lecture 18 Trivia How does Amazon make money? Selling books? Entertainment? 2 2 Migrating to the Cloud ELASTICITY COST Pay-as-you-go HW procurement at
COMPSCI 532 Lecture 18
22
33
44
55
66
7
7
STORAGE PERFORMANC E ACCESS APPENDS AVAILABILITY PRICE OBJECT (S3)
X ✓ Low FILE SYSTEM (EFS)
✓ ✓ High BLOCK (EBS) + Instance (*) ✓ X Mid INSTANCE-LOCAL ++ Instance ✓ X High (**) (*) Can be detached from an instance and reattached to another (**) Storage-heavy instances are expensive
88
COMPUTE COMPUTE COMPUTE COMPUTE LS LS LS LS
Principle: move computation to data
99
COMPUTE COMPUTE COMPUTE COMPUTE LS LS LS LS STORAGE SERVICE
Arbitrary computation Read/Write only Cannot move computation to data!
10
10
12 12
13
13
14
14
15
15
16
16
17
17
18
18
19
19
20
20
21
21
22
23 23
24 24
25
25
latency
data, most data cold)
versioning
store
26
26
27 27
28
28
30 30
31
31
32
32
33
33
data scanned
Athena
cluster cost
34
34
EBS very expensive Instance storage + S3 backup cheaper
35
35
COMPUTE COMPUTE COMPUTE COMPUTE LS LS LS LS STORAGE SERVICE
Arbitrary computation Read/Write only
36 36
37
38
38