cloud file services for academics
play

Cloud File Services for Academics Harry Mangalam, RCS - PowerPoint PPT Presentation

Cloud File Services for Academics Harry Mangalam, RCS <harry.mangalam@uci.edu> x40084 Full Report: http://goo.gl/fCNSIa BigData is here Most of what we see is unstructured data Or semi-structured (large ASCII files) 100TB


  1. Cloud File Services for Academics Harry Mangalam, RCS <harry.mangalam@uci.edu> x40084 Full Report: http://goo.gl/fCNSIa

  2. BigData is here ● Most of what we see is unstructured data ● Or semi-structured (large ASCII files) ● 100TB datasets now on HPC ● More coming..

  3. Bandwidth & Latency ● BW to many commercial services tops out at 1-5MB/s. ● Latency increases over distance and hops. ● Academic/Data networks can be 10-100X faster. ● LANs are fastest, often 100X faster than WANs. ● WANs block some protocols, ports, etc. ● Small file operations make everything worse.

  4. Raw Storage Pricing ● Raw storage halves in price every ~14mo ● NAS-quality 4TB disk is $170 ($4.25/100GB) ● 250TB Chassis is $35K ($14/100GB) ● Lifetime of Chassis is 4yrs ($3.5/100GB)

  5. Commercial Pricing ● Cheapest is ~$24/100GB/yr (Google, iCloud) ● Most expensive is ~$100/100GB/yr (some business services: Box, Druva) ● Pricing varies by business model (Backblaze is unlimited) ● Long Term Backup (Glacier @ $1/100GB/yr is cheapest but horrible interface. ● Since raw storage halves in price every 14mo, pricing changes very rapidly.

  6. 5 overlapping uses ● Interactive file storage ● File Sharing ● File Syncing (change to a file is propagated to all copies of that file) ● Short Term Backup (incr'l backup, disaster recovery, client-initiated recovery) ● Long Term Backup (dataset archives w. infrequent recovery)

  7. Venn Diagram of previous slide

  8. Important Features ● Ability to scale greatly. ● Support for very large files. ● Ease of user interface, informative ● Watch edits / co-editing. ● Easy to set/show permissions who can read, write, edit & delete

  9. And More.. ● Pay only for what you use. ● Sync/autosync across lots of devices. ● Direct web-availability. ● File versioning like Apple’s TimeMachine or ZFS snapshots. ● High bandwidth. ● Highly reliable.

  10. And Even More.. ● Multiple protocols (NFS, CIFS, HTTP/HTPPS, Globus, FTP/SFTP, etc). ● Mac/iOS/Win/Linux/Android clients ● Logging to show who has accessed files and how. ● Security (file sharing vs backups) ● Integrated Apps ● Stateless/Connectionless storage

  11. Local vs Commercial ● Local resources tends to offer more, more flexible, faster services at a cheaper long-term cost with less lock-in. ● Commercial services tend to offer less HR, DC req, more stability, decent options, at a higher aggregate cost. ● Legal advantages to both. ● Admin and users often have different and often opposing requirements; Admin tends to make the final decisions. ● Best solution is probably “Both”.

  12. Local Sharing Services ● OwnCloud – 'Open Source DropBox' Mac/iOS/Win/Linux/Android clients ● UCLA/CASS – Very large campus storage system, supports Globus/Grid, SMB/CIFS, NFS, CrashPlan.

  13. Commercial Services & more.. ● See: <http://goo.gl/fCNSIa> ● Harry Mangalam <harry.mangalam@uci.edu> x40084

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend