scoutfs posix archiving at extreme scale
play

ScoutFS: POSIX Archiving at Extreme Scale Zach Brown, Versity MSST - PowerPoint PPT Presentation

ScoutFS: POSIX Archiving at Extreme Scale Zach Brown, Versity MSST 2019 POSIX Archiving with ScoutFS POSIX: NFS / SAMBA / RSYNC / CUSTOM ScoutFS SAN Fabric ScoutAM Archive Manager Archival Storage Archival Filesystem Differences Built


  1. ScoutFS: POSIX Archiving at Extreme Scale Zach Brown, Versity MSST 2019

  2. POSIX Archiving with ScoutFS POSIX: NFS / SAMBA / RSYNC / CUSTOM ScoutFS SAN Fabric ScoutAM Archive Manager Archival Storage

  3. Archival Filesystem Differences • Built for archive transfer rate, not for total data capacity • Almost all metadata at rest, file data stored on archive tiers • File count no longer constrained by data capacity • Support both user-facing file transfer and internal tier management load • Files have metadata which describes locations in archives • Archive tier management software needs to search through files • Strong desire for open source implementation

  4. Archival Filesystem Challenges • Exhaustive file searches are in the critical path - Are there new or modified files that need to be archived? - Which files were on that archive media that just caught fire? - Which large archived files were least recently used and can be released? - (.. and users would love efficient searching of their files!) • Must saturate streaming archive tier throughput - Efficient large file IO - Small files need high metadata rates to produce sufficient archive data

  5. ScoutFS Design Highlights • Start with an in-kernel coherent key/value item store: - “Logical” locking protects item consistency and governs caching - Log-based “physical” storage allows concurrent item reads and writes - Item writes grouped into atomic log fragment writes - Fundamental unit of metadata IO is large log fragments • Build a robust POSIX filesystem out of these items: - Full POSIX semantics, data extents, atomic metadata transactions • Maintain persistent file index items along with FS metadata items: - Sort files by metadata: size, mtime, xattrs, etc - Index items modified in the same transaction as primary metadata items - Concurrent write lock mode avoids global serialization

  6. Concurrent Metadata Reads and Writes A B C D 1 2 3 4 5 6 7 8

  7. 4 Nodes All Search While Creating

  8. Thank You zab@versity.com @versitysoftware

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend