current status of cvmfs
play

Current Status of CVMFS Fast, scalable, and reliable software - PowerPoint PPT Presentation

Current Status of CVMFS Fast, scalable, and reliable software distribution Jakob Blomer July 3rd, 2009 1 / 20 Outline 1 Overview 2 Recent Development 3 Benchmarks 4 Next Steps 2 / 20 1 Overview 2 Recent Development 3 Benchmarks 4 Next Steps 3


  1. Current Status of CVMFS Fast, scalable, and reliable software distribution Jakob Blomer July 3rd, 2009 1 / 20

  2. Outline 1 Overview 2 Recent Development 3 Benchmarks 4 Next Steps 2 / 20

  3. 1 Overview 2 Recent Development 3 Benchmarks 4 Next Steps 3 / 20

  4. Software Distribution for Virtual Machines • “Ready to run” binaries, i. e. /mnt/cvmfs mirrors destination of make install • Read-only, public files 4 / 20

  5. Repository Statistics Latest Release, CernVM x86 Platform • Many (many) small files • In the repository: significant amount of redundancy 10 th / 90 th # Files Size Median ZIP-Ratio Percentile ALICE 38 000 1.6 GB 3.3 KB 330 Byte / 24 KB 2.8 ATLAS 150 000 6.5 GB 2.2 KB 217 Byte / 36 KB 10.7 CMS 110 000 5 GB 2.1 KB 320 Byte / 20 KB 1.9 LHCb 166 000 4.5 GB 1.5 KB 32 Byte / 16 KB 2.6 5 / 20

  6. Infrastructure Reference setup: Apache + Squid Key: Set of web caches serving static HTTP content from memory 6 / 20

  7. 1 Overview 2 Recent Development 3 Benchmarks 4 Next Steps 7 / 20

  8. The CernVM File System Evolved from GROW-FS Fuse Modules CVMFS S CVMFS Components Quota / LRU Trace Capturing Catalog Cache Prefetcher Building Blocks SQLite libcurl libcrypto SHA1 MD5 zlib 8 / 20

  9. Distinctive Features • Satisfied with outgoing HTTP(S) connection, i. e. works with practically every Internet connection • Every downloaded file is verified by SHA1 • Chain of forward proxy servers • Pre-built cache packages • Offline mode • Trace file system operations • Nested catalogs • Catalogs can be signed by X.509 certificate • Catalogs store TTL, thereby stay up to date 9 / 20

  10. Distinctive Features • Satisfied with outgoing HTTP(S) connection, i. e. works with practically every Internet connection • Every downloaded file is verified by SHA1 • Chain of forward proxy servers • Pre-built cache packages • Offline mode • Trace file system operations • Nested catalogs • Catalogs can be signed by X.509 certificate • Catalogs store TTL, thereby stay up to date 9 / 20

  11. Distinctive Features • Satisfied with outgoing HTTP(S) connection, i. e. works with practically every Internet connection • Every downloaded file is verified by SHA1 • Chain of forward proxy servers • Pre-built cache packages • Offline mode • Trace file system operations • Nested catalogs • Catalogs can be signed by X.509 certificate • Catalogs store TTL, thereby stay up to date 9 / 20

  12. Distinctive Features • Satisfied with outgoing HTTP(S) connection, i. e. works with practically every Internet connection • Every downloaded file is verified by SHA1 • Chain of forward proxy servers • Pre-built cache packages • Offline mode • Trace file system operations • Nested catalogs • Catalogs can be signed by X.509 certificate • Catalogs store TTL, thereby stay up to date 9 / 20

  13. Distinctive Features • Satisfied with outgoing HTTP(S) connection, i. e. works with practically every Internet connection • Every downloaded file is verified by SHA1 • Chain of forward proxy servers • Pre-built cache packages • Offline mode • Trace file system operations • Nested catalogs • Catalogs can be signed by X.509 certificate • Catalogs store TTL, thereby stay up to date 9 / 20

  14. Distinctive Features • Satisfied with outgoing HTTP(S) connection, i. e. works with practically every Internet connection • Every downloaded file is verified by SHA1 • Chain of forward proxy servers • Pre-built cache packages • Offline mode • Trace file system operations • Nested catalogs • Catalogs can be signed by X.509 certificate • Catalogs store TTL, thereby stay up to date 9 / 20

  15. Distinctive Features • Satisfied with outgoing HTTP(S) connection, i. e. works with practically every Internet connection • Every downloaded file is verified by SHA1 • Chain of forward proxy servers • Pre-built cache packages • Offline mode • Trace file system operations • Nested catalogs • Catalogs can be signed by X.509 certificate • Catalogs store TTL, thereby stay up to date 9 / 20

  16. Distinctive Features • Satisfied with outgoing HTTP(S) connection, i. e. works with practically every Internet connection • Every downloaded file is verified by SHA1 • Chain of forward proxy servers • Pre-built cache packages • Offline mode • Trace file system operations • Nested catalogs • Catalogs can be signed by X.509 certificate • Catalogs store TTL, thereby stay up to date 9 / 20

  17. Distinctive Features • Satisfied with outgoing HTTP(S) connection, i. e. works with practically every Internet connection • Every downloaded file is verified by SHA1 • Chain of forward proxy servers • Pre-built cache packages • Offline mode • Trace file system operations • Nested catalogs • Catalogs can be signed by X.509 certificate • Catalogs store TTL, thereby stay up to date 9 / 20

  18. S CVMFS Catalog, Compression S CVMFS make install • S CVMFS provides a frontend to a repository directory • Missing part: atomic switch between repository versions (e. g. with ZFS) 10 / 20

  19. Performance • Locally SHA1-Cache • Multi-Threaded • SQLite for catalogs instead of in-memory tree representation • Pure data cache, decoupled from catalog Protocol Overhead • Network Traffic Web Server • HTTP Keep-Alive T S F K E Y 200 OK I G C N N N SYN ACK A • Prefetching A , Y , P C , A S A N K T C • Compression C T I F K K H CernVM 70 MB repository = ⇒ 20 MB network traffic 11 / 20

  20. Performance Prefetcher High predictability: ≈ 10-50 shared libraries per binary Construct prefetch hints: 1 Follow ldd -dependencies 2 Automatic construction from traces Analysis of open() sequence Automatic Construction (ROOT Extract) libGraf.so.5.22 (7) libGraf3d.so.5.22 (7) 100% 100% 100% 100% 100% libNet.so.5.22 (7) libHist.so.5.22 (7) 100% 100% 100% 18% 100% libCint.so.5.22 (7) libRIO.so.5.22 (7) 100% 100% 18% libMinuit.so (11) libCore.so.5.22 (7) 12 / 20

  21. Performance Prefetcher High predictability: ≈ 10-50 shared libraries per binary Construct prefetch hints: 1 Follow ldd -dependencies 2 Automatic construction from traces Analysis of open() sequence Automatic Construction (ROOT Extract) libGraf.so.5.22 (7) libGraf3d.so.5.22 (7) 100% 100% 100% 100% 100% libNet.so.5.22 (7) libHist.so.5.22 (7) 100% 100% 100% 18% 100% libCint.so.5.22 (7) libRIO.so.5.22 (7) 100% 100% 18% libMinuit.so (11) libCore.so.5.22 (7) 12 / 20

  22. 1 Overview 2 Recent Development 3 Benchmarks 4 Next Steps 13 / 20

  23. Setup We measure extra time for application benchmark stressHepix residing on CVMFS / AFS 14 / 20

  24. High Latency Extra Running Time 800 AFS 700 CVMFS + Keep-Alive 600 + Prefetching 500 + Compression ∆ t [s] 400 300 200 100 0 LAN 25 50 100 150 Round Trip Time [ms] 15 / 20

  25. WAN Simulation Extra Running Time (RTT 100ms) 800 AFS 700 CVMFS + Keep-Alive 600 + Prefetching, Compression 500 ∆ t [s] 400 300 200 100 0 0 5 10 15 20 25 iPerf-measured Throughput [mbit/s] 16 / 20

  26. Gbit Ethernet 187 186 185 184 Running Time [s] 183 182 181 180 179 178 177 Local Storage AFS CVMFS Current 17 / 20

  27. 1 Overview 2 Recent Development 3 Benchmarks 4 Next Steps 18 / 20

  28. Next Steps • Testing • Zero-Conf, in particular nearby proxy server • “Share My Cache”: Replace Squid by co-operative peers 19 / 20

  29. Current Status of CVMFS Fast, scalable, and reliable software distribution Jakob Blomer July 3rd, 2009 20 / 20

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend