Fighting with Unknowns: Estimating the Performance of Scalable Distributed Storage Systems with Minimal Measurement Data
Moo-Ryong Ra and Hee Won Lee1
AT&T Labs Research
May 23, 2019
1Presenter at MSST 2019
Fighting with Unknowns: Estimating the Performance of Scalable - - PowerPoint PPT Presentation
Fighting with Unknowns: Estimating the Performance of Scalable Distributed Storage Systems with Minimal Measurement Data Moo-Ryong Ra and Hee Won Lee 1 AT&T Labs Research May 23, 2019 1 Presenter at MSST 2019 Motivation Goal To
1Presenter at MSST 2019
◮ To estimate the performance of scalable distributed storage
◮ Mathematical modeling or black-box approach needs a
◮ We propose a simple, yet accurate performance estimation
◮ Our technique aims to identify max IOPS for an arbitrary
100%read
rw = P1 · kS1
SN 100%read
SN rw
◮ Block Storage, Strong Consistency, 3x Replication ◮ FIO: 104 OpenStack VMs, each running 8 FIO jobs
Converged Server
Host 1
Ceph-mon OSD
1.6TB NVMe
OSD
1.6TB NVMe
OSD
1.6TB NVMe
OSD
1.6TB NVMe
Host 4
OSD
1.6TB NVMe
OSD
1.6TB NVMe
OSD
1.6TB NVMe
OSD
1.6TB NVMe
Host 3
Ceph-mon OSD
1.6TB NVMe
OSD
1.6TB NVMe
OSD
1.6TB NVMe
OSD
1.6TB NVMe
Host 2
Ceph-mon OSD
1.6TB NVMe
OSD
1.6TB NVMe
OSD
1.6TB NVMe
OSD
1.6TB NVMe
Host 8
OSD
1.6TB NVMe
OSD
1.6TB NVMe
OSD
1.6TB NVMe
OSD
1.6TB NVMe
Host 9
OSD
1.6TB NVMe
OSD
1.6TB NVMe
OSD
1.6TB NVMe
OSD
1.6TB NVMe
vm-01 vm-09 vm-02 vm-10 vm-03 vm-11 vm-04 vm-12 vm-08 vm-16 Logging Monitoring Alerting vm-97 vm-98 vm-99 vm-100 vm-104
VMs Ceph Service Daemons
25 GbE Network Link
◮ Object Storage, Eventual Consistency, 3x Replication ◮ COSBench: 32 workers
10 GbE Network Link
Host 1
Object Server
480GB SSD 480GB SSD
Host 2
Object Server
480GB SSD 480GB SSD
Host 3
Object Server
480GB SSD 480GB SSD
Host 4
Object Server
480GB SSD 480GB SSD
Host 5
Proxy Server
Host 6
10 GbE Network Link Swift Service Daemons Swift Client Daemons Client
1 2 3 4 5 6 7 8 9 10 1000 2000 3000 4000 5000
f_rw IO Size
Ceph (block size) Swift (object size) 4KB 4MB 2MB 1MB 512KB 512KB 1MB 2MB 4MB 4KB
100000 200000 300000 400000 500000 600000 700000 800000 10 20 30 40 50 60 70 80 90 100
C Read Ratio (%)
4KB 8KB 16KB 32KB 64KB 128KB 256KB 512KB 1MB 2MB
200 400 600 800 1000 1200 1400 1600 1800 2000 10 20 30 40 50 60 70 80 90 100
C Read Ratio (%)
4KB 8KB 16KB 32KB 64KB 128KB 256KB 512KB 1MB 2MB
200 400 600 800 1000 1200 1400 1600 10 30 50 70 90
T_total (IOPS) The ratio of 16KB objects (%)
Measured Estimated
200 400 600 800 1000 1200 10 30 50 70 90
T_total (IOPS) The ratio of 16KB objects (%)
Measured Estimated