Position: Synergetic Effects of Software and Hardware Parameters on the LSM System
Authors: Jinghuan Yu, Heejin Yoon* Sam H. Noh*, Young-ri Choi*, Chun Jason Xue *
Position: Synergetic Effects of Software and Hardware Parameters on - - PowerPoint PPT Presentation
Position: Synergetic Effects of Software and Hardware Parameters on the LSM System Authors: Jinghuan Yu, Heejin Yoon* Sam H. Noh*, Young-ri Choi*, Chun Jason Xue * Log Structured Merge-tree (LSM) Specific designs for HDD and write-intensive
Authors: Jinghuan Yu, Heejin Yoon* Sam H. Noh*, Young-ri Choi*, Chun Jason Xue *
What is the critical factor deciding performance? Does the working principle of LSM still fit these new mediums?
Specific designs for HDD and write-intensive workload. Periodical compaction with various resource occupation.
PMM
parallelism
NVMe SSD
higher requirement for CPU
by write granularity SATA SSD
foreground garbage collection Media Type Average Access Latency (µs) SATA SSD 37.78 NVMe SSD 11.77 PMM 2.61
Performance Comparison of Devices in RocksDB
CPUs causes IO congestion, decreasing performance
benefits from larger batch size
tends to be stable as the number of CPUs increases
size
the number of CPUs or batch size
difference is far from bandwidth comparison
50 100 150 200 250 300 350 400 450 2 CPUs 4 CPUs 8 CPUs 2 CPUs 4 CPUs 8 CPUs 2 CPUs 4 CPUs 8 CPUs SATA SSD NVMe SSD PMM Throughput (kOps/Sec) 16 MB 32 MB 64 MB 128 MB
Operation Batch Size
Existing Solutions Our Targets
VS
Resource Utilization Heterogenous Storage Parameter Tuning
predictable
analysis
Strength
Weakness
compaction
Design Opportunities
10000 20000 30000 40000 2 CPUs 4 CPUs 8 CPUs SATA SSD IO Time(s)
Cumulative IO Time in Compaction Runs
16 MB 32 MB 64 MB 128 MB Operation Batch Size
In SATA SSD, IO time increases dramatically as number of CPUs increases
Strength
Weaknesses
performance
Design Opportunities
the data sink too quickly and decreases the performance.
works
Performance of NVMe SSD with different bandwidth (limited by cgroup).
Bandwidth = 400MB Bandwidth = 800MB Bandwidth = 1200MB Bandwidth = 1600 MB Unlimited bandwidth Operation Batch Size
Strength
Weaknesses
which can be solved by changing the size ratio between L0 and L1 files.
Design Opportunities
such as WAL
Size Ratio here means the (total size of L0 files) / (total size of L1 files), controlled by compaction scheduling parameters Operation Batch Size
Challenges Solutions DOTA: Device Oriented Tuning Advisor
Workload adapting Global resource management Online Tuning Thread pool and resource allocation Amplification reduction and data reuse Environment detecting and monitoring Data placement and migration Online modeling
Email Address: jinghuayu2-c@my.cityu.edu.hk Github link: https://github.com/supermt/ utils_for_lsm.git