Towards General-Purpose Resource Management in Shared Cloud Services
Jonathan Mace, Brown University Peter Bodik, MSR Redmond Rodrigo Fonseca, Brown University Madanlal Musuvathi, MSR Redmond
Towards General-Purpose Resource Management in Shared Cloud - - PowerPoint PPT Presentation
Towards General-Purpose Resource Management in Shared Cloud Services Jonathan Mace , Brown University Peter Bodik, MSR Redmond Rodrigo Fonseca, Brown University Madanlal Musuvathi, MSR Redmond Shared-tenant cloud services Processes service
Jonathan Mace, Brown University Peter Bodik, MSR Redmond Rodrigo Fonseca, Brown University Madanlal Musuvathi, MSR Redmond
2
Ideally manage resources to provide end-to-end guarantees and isolation Challenge OS/hypervisor mechanisms insufficient
✘ Shared threads & processes ✘ Application-level resource bottlenecks (locks, queues) ✘ Resources across multiple processes and machines
Today lack of guarantees, isolation some ad-hoc solutions
3
4
5
HDFS DataNode HDFS DataNode HDFS DataNode
6
HDFS DataNode HDFS DataNode HDFS DataNode
7
8
HDFS DataNode HDFS DataNode HDFS DataNode
9
HDFS DataNode HDFS DataNode HDFS DataNode
10
HDFS DataNode HDFS DataNode HDFS DataNode
11
hardware resources: disk, network, cpu, … software resources: locks, queues, … data structures: transaction logs, shared batches, …
12
HDFS DataNode HDFS DataNode HDFS DataNode
13
HDFS DataNode HDFS DataNode HDFS DataNode
14
HDFS DataNode HDFS DataNode HDFS DataNode
15
HDFS DataNode HDFS DataNode HDFS DataNode
16
e.g., if a tenant is causing contention, throttle
17
HDFS DataNode HDFS DataNode HDFS DataNode
18
HDFS DataNode HDFS DataNode HDFS DataNode Admission Control
19
HDFS DataNode HDFS DataNode
while (!Thread.isInterrupted()){ sendPacket(); }
HDFS DataNode Admission Control
20
HDFS DataNode HDFS DataNode
while (!Thread.isInterrupted()){ rate_limit(); sendPacket(); }
21
HDFS DataNode Admission Control
22
23
Tenants
24
Tenants
25
Tenants
26
Tenants
27
Tenants
28
Tenants
Tenants
29
30
Open Read Create Rename Delete Normalized Throughput HDFS HDFS w/ Retro
1.1 1 0.9
Open Read Create Rename Delete Normalized Latency
1.2 1 0.8
0.01% to 2% average overhead
latency, throughput
HDFS DataNode HDFS DataNode HDFS DataNode
31
HDFS DataNode HDFS DataNode HDFS DataNode
32
33