HP-Mapper: A High Performance Storage Driver for Docker Containers - PowerPoint PPT Presentation

HP-Mapper: A High Performance Storage Driver for Docker Containers Fan Guo 1 , Yongkun Li 1 , Min Lv 1 , Yinlong Xu 1 , John C.S. Lui 2 1 University of Science and Technology of China 2 The Chinese University of Hong Kong

Outline 1 Background & Motivation 2 HP-Mapper Design 3 Evaluation 4 Conclusion 2

Container and Docker n Container ü Process-level virtualization Share host kernel l Namespaces, Cgroup l ü Better performance Fast deployment l Low resources usage VM l Near bare-mental performance l n Docker ü Most popular container engine ü Extensively used in production Container 3

Storage Management of Containers n Container images ü Store all requirements for running the containers ü Hierarchical, read-only, sharable n Storage drivers ü Support cross-level lookup and copy-on-write (COW) Container Container … Unified view Docker storage drivers Cross-level Copy-on-write lookup writeable layer writeable layer Apache (Image) nginx (Image) emacs (Image) Debian (Base Image) 4

Docker Storage Drivers n File-based Drivers n Block-based Drivers ü File-level COW ü Block-level COW (≥64KB) ü Share cached data ü Cannot share cached data ü Overlay2, AUFS ü DeviceMapper, ZFS, BtrFS file4 file5 Container vblk1 vblk2 vblk3 vblk4 vblk5 Block-based Drivers (vblk→blk) File-based Drivers (path→file) Layer3 blk3 File1 (writable) Layer2 blk4 blk5 File2 (read-only) File2 Layer1 blk1 blk2 blk3 blk3 blk4 blk5 File1 File1 File2 (read-only) File1 File2 5

High COW Latency n File-based storage drivers incur large COW latency (especially for large files) ü Incur large write overhead ü Degrade write performance File Size 4KB 64KB 1MB 16MB DeviceMapper 0.12 0.74 0.96 1.39 Block-based BtrFS 0.09 0.09 0.09 0.10 Overlay2 1.99 2.49 7.14 61.7 File-based Copy-on-write latency (ms) 6

Disk I/O Redundancy n Block-based drivers introduce many redundant I/Os when reading data from a shared file ü Degrade I/O performance ü Waste I/O bandwidth Block-based File-based Total amount of reading data during the startup of 64 containers 7

Cache Redundancy n Both kinds of storage drivers generate a lot of redundant cached data ü Block-based: Read multiple copies of the data (as cache can not be shared) ü File-based: Unchanged data in the file are also copied when performing copy-on-write 8

Motivation n Limitations of current storage drivers ü Tradeoff between write and read performance ü Low cache efficiency n Our goal: Develop a new storage driver for docker containers ü Low COW overhead (or high write performance) ü High read I/O performance ü High cache efficiency 9

HP-Mapper: A High Performance Storage Driver n Key idea: HP-Mapper works at the block level and manages physical blocks ü Why block level: Low COW overhead ü Why physical block: Able to detect redundant I/Os & redundant cached data n Three modules ü Address mapper ü I/O interceptor ü Cache manager 11

Address Mapper n Tradeoff exists in block-based management ü Large blocks: high COW latency ü Small blocks: high lookup and storage overhead due to large metadata size n HP-Mapper uses a two-level mapping tree ü Support two different block sizes ü Differentiate different requests w/ on-demand allocation New write: large sequential I/O (large block size) l COW: depending on req size l 12

Address Mapper n Two-level mapping tree design Root of Level-1 index1 index2 offset VBN2 index1 … offset VBN1 key key … … … … ... … … key key 1xx 0xx … … … … … … … … … (PBN) (root) Root of Level-2 … key key … … … … ... … key key … … … … … … … … PBN PBN … … Storage efficiency Separated block placement + defragmentation 13

I/O Interceptor – Metadata Management n How to detect redundant I/O ü Indexing with physical block number (PBN) n PBN-indexing hash table ü Entries are linked in a two-dimensional list (LRU) Head entry (latest copy) + Tail entry (other copies) l Head entry2 ... 001 Head entry1 002 Tail entry1 u64 PBN ( key ) … u16 copies_num u64 Super_Block … Tail entry2 u64 Inode_ID … void *next void *tail … ... 14

I/O Interceptor - Workflow n Detect redundant I/O w/ PBN-index Avoid unnecessary check (write I/Os to writable layer) 15

Cache Manager n How to remove redundant copies in cache ü Periodically scan the hash table to locate cached pages ü Maintain the hotness of each page (multiple LRU) n Page eviction: cache hit ratio vs. cache usage ü Limit # of copies: utilization-aware adjustment ü Hotness-aware eviction Page1 Page2 Page3 Page4 Locate & Head entry1 Head entry2 (cold) (hot) (hot) (hot) monitor (PBN2) (PBN1) Cached copies of PBN1 Tail entry Tail entry Page5 Page6 Page7 Page8 Page9 (cold) (hot) (hot) (hot) (hot) Cached copies of PBN2 Tail entry Tail entry Evict (copies_limit = 3) Tail entry Tail entry Page2 Page3 Page4 Page1 Scan next (hot) (hot) (hot) (cold) ... ... block Page6 Page7 Page8 Page5 Page9 16 Hash Table (hot) (hot) (hot) (cold) (hot)

Experiment Setup n Prototype ü Act as a plugin module in Linux kernel 3.10.0 ü Backing file system: Ext4 n Workloads ü Container images: Tomcat, Nextcloud, Vault n Overhead of HP-Mapper Overhead of HP-Mapper 18 (MT represents Mapping Tree, HT represents Hash Table)

Reduction of COW Latency n Copy-on-write(COW) latency File Size 4KB 64KB 1MB 16MB DeviceMapper 0.13 0.74 0.96 1.39 BtrFS 0.09 0.09 0.09 0.10 Overlay2 1.99 2.49 7.14 61.7 HP-Mapper 0.07 0.12 0.55 0.57 Copy-on-write latency (ms) HP-Mapper reduces up to more than 90% COW latency comparing with DeviceMapper and Overlay2 19

Reduction of Redundant I/Os n Total amount of reading/writing data when launching 64 containers from a single image HP-Mapper efficiently removes redundant read I/Os, and also reduces more than 50% writing data on average 20

Improvement of Cache Efficiency n Cache usage when starting 64 containers from a single image HP-Mapper reduces more than 65% cache usage on average 21

Improvement of Startup Time n Total startup time when launching 64 containers from a single image on SSD/HDD HP-Mapper achieves up to 2.0 × - 7.2 × faster startup speed than the other three storage drivers 22

Improvement of Startup Time n Total startup time when launching 64 containers in memory-scarce systems DM BtrFS Overlay2 HP-Mapper DM BtrFS Overlay2 HP-Mapper 200 40 Total Startup time (s) Total Startup time (s) 150 30 100 20 50 10 0 0 12GB 8GB 4GB 6GB 4GB 2GB Total available memory Total available memory Launch 64 Vault containers Launch 64 Nextcloud containers HP-Mapper achieves larger improvement as memory size decreases 23

Conclusion n Tradeoffs exist for Docker storage drivers ü File-based: High COW overhead ü Block-based: Low cache efficiency & redundant I/O n We develop HP-Mapper which achieves ü Low COW overhead by following block-based design with differentiated block sizes ü High I/O efficiency by intercepting redundant I/Os ü High cache efficiency by enabling cache sharing and hotness-aware management 25

Thanks! Q&A Yongkun Li ykli@ustc.edu.cn http://staff.ustc.edu.cn/~ykli 26

HP-Mapper: A High Performance Storage Driver for Docker Containers - PowerPoint PPT Presentation

HP-Mapper: A High Performance Storage Driver for Docker Containers Fan Guo 1 , Yongkun Li 1 , Min Lv 1 , Yinlong Xu 1 , John C.S. Lui 2 1 University of Science and Technology of China 2 The Chinese University of Hong Kong Outline 1 Background

docker service is the new docker run Getting Started with Docker Clustering Mike Goelzer /

Setup docker rm $(docker ps -aq) docker network rm my_net Demo - Install and activate yum -y

Docker Provider The Docker provider is used to interact with Docker containers and images. It uses

Computational Topology - Mapper Jiaqi Ni Eindhoven University of Technology June 14, 2018

Docker Review Basic Commands docker image ls # list images currently present locally docker

Going D/S/K Prod Like A Pro BRET FISHER Docker Captain, DevOps Dude, Creator of Docker Mastery

Orchestration in Docker Swarm mode, Docker services and declarative application deployment Mike

Docker meets Python A look on the Docker SDK for Python pip install docker Jan Wagner

INTRODUCTION TO DOCKER ADRIAN MOUAT SO WHAT IS DOCKER? SIMILAR TO A LIGHTWEIGHT VM Both

Docker Orchestration: Beyond the Basics Aaron Lehmann Software Engineer, Docker About me

Alberta Transportation Driver Fitness and Monitoring Mature Driver Medical Examinations Driver

Driver Group plc Results for the six months to March 2019 Driver Group plc 2 Business Profile

NHTSA Novice Driver Initiatives NHTSA Novice Driver Initiatives Driver Education Driver

Driver Group plc Results for the year to 3 0 Septem ber 2 0 1 9 Driver Group plc 2 Business

A starting point for ToR Vehicle has a driver Steering from Only vehicle systems Driver is able to

Artworks and Articles Meet Artworks and Articles Meet MAPPER and Persistent MAPPER and

A Write-friendly Hashing Scheme for Non-volatile Memory Systems Pengfei Zuo and Yu Hua Huazhong

Activation phases and geomorphic maturity of two earth-flow slides in Southern Italy Article

Sherlock Rules Proof Positive and Negative in Data Cleaning Matteo Interlandi Nan Tang Outline

http://cs246.stanford.edu It is always possible to decompose a real matrix A into A = U V T ,

Trusted Machine Learning for Probabilistic Models Shalini Ghosh, Patrick Lincoln, Ashish Tiwari

Diagnose data for cleaning Cleaning Data in Python Cleaning data Prepare data for analysis

Sibyl: A system for large scale supervised machine learning Kevin Canini, Tushar Chandra, Eugene

Geometry of Boltzmann Machines Guido Montfar Max Planck Institute for Mathematics in the