kunal lillaney advisor dr randal burns
play

Kunal Lillaney Advisor: Dr. Randal Burns Johns Hopkins University - PowerPoint PPT Presentation

Kunal Lillaney Advisor: Dr. Randal Burns Johns Hopkins University HBP CodeJamWorkshop #7 13 January 2016 Overview Human visualization drives analysis in this field y Visualization of petascale neuroscience imaging Stored on the


  1. Kunal Lillaney Advisor: Dr. Randal Burns Johns Hopkins University HBP CodeJamWorkshop #7 13 January 2016

  2. Overview • Human visualization drives analysis in this field y • Visualization of petascale neuroscience imaging – Stored on the cloud or at data center – Internet latencies ruin user experience • Deploy distributed caching – To offload server I/O and rendering – To reduce network latency • Customized to neuroscience data patterns – Combination of multi-channel data – High selectivity and reduced-dimension projects

  3. Open-science, data-intensive analysis of the brain • Peta-scale storage linked with HPC • Computational vision of brain structure • Spatial queries (clusters, volumes, distributions)

  4. ARCHITECTURE AR • Asdfjkl;

  5. Spatial D Database • Dense 3D or 4D spatial array pationtionedinto cuboids • Space filling curve and Multi-resolution zoom pyramid • Support for Neuron, Synapse, Segment and more annotation types • Store ~100TB of imaging data Z order space filling curve

  6. CATMAID CA • Asdfjkl;

  7. • Asdfjkl;

  8. \ • t6

  9. System G Goals • Visual flow (24+ frames per second) • Tolerable latency: ~100ms initial load (must be < 1 second) • Need to deliver: – Up to 30 512x512 image tiles for each view – 6 per layer, up to 5 layers • Can’t do it with Internet latencies Must p push data t to network edge, , near b browser!

  10. Co Content-Distribution N Network? • Ingest content, push toward consumer – Requires knowledge of content to be consumed • Does not match our data usage

  11. Spatial Data a and U Usage Patterns • Small regions of interest in massive data sets • Dynamic materializations of 2-d tiles – From 3-d or 4-d databases – Any (axis orthogonal) cutting plane

  12. Spatial Data a and U Usage Patterns • Exponentially many combinations of channels from the same data set (flattened for performance) Must p push data t to network edge AND m must dynamically manage data c contents ( (Caching)!

  13. Caching Architecture y Local Network Cloud memcached Cluster Data Store (PBs) Disk cache (TBs)

  14. Tile Request: Initial/Cache Miss y memcached Cluster render read

  15. Cache Prefetch: Background Load y memcached Cluster render

  16. Disk Cache y • Local performance to remote data • No computation – Tiles pre-rendered • Visual flow – When scrolling back and forth through tiles

  17. Deployment Options • Tile cache collocated with server y – Reduce I/O load on data servers – Offload rendering • Tile cache in Amazon West, servers in Amazon east – All of above and content distribution – Reduce Internet latencies • Tile cache on laptop/workstation with SSD – Maximize frame rates – Create user experience needed to visualize complex neural structures

  18. Why memcached? y • Background loading is not memcached instantaneous Cluster • Avoids server load – No computation for rendering – No I/O or NoSQL queries • Consistent interfaces for dynamic data don’t use tile cache

  19. So What? • Local performance to remote data y – Eliminate Internet latency – Terabyte cache (on workstation) of petascale data • User experience – Internet latency to first images – Local performance for most usage – Occasional stall for cache miss • Open-source, tile caching for spatial data – https://github.com/openconnectome/tilecache – Not used outside of OCP managed installations today

  20. OP OPEN CO CONNECT CTEA EAM Alex Baden Eric Perlman Sean Hill Daniel Berger Carey Priebe Bobby Kasthuri Randal Burns Clay Reid Misha Kazhdan Davi Bock Stefan Saalfeld Greg Kiar Albert Cardona Guillermo Sapira Mark Chevillet Dean Kleissas Anish Simhal Kwanghun Chung Ayushi Sinha Kwame Kutten Ming Chuang Stephen Smith Wei-Chung Allen Lee Forrest Collman Alexander Szalay Jeff Lichtman Steven Cook Raju Tomer Kunal Lillaney Karl Deisseroth R. Jacob Vogelstein Scott Emmons Larry Lindsey Joshua Vogelstein Jeremy Freeman Nick Weiler Priya Manavalan Will Gray Roncal Li Ye Disa Mhembere Logan Grosenick Da Zheng Michael Miller Greg Hager thatweare Dan Naiman Kristen Harris Patrick Parker

  21. Questions?

  22. • Website: neurodata.io • Documentation : docs.neurodata.io • Github: openconnectome • CATMAID : openconnecto.me/catmaid/ • support@neurodata.io

  23. Image Used for Demonstrational and Educational Purposes • http://upload.wikimedia.org/wikipedia/comm ons/d/d2/Internet_map_1024.jpg • http://broabandtrafficmanagement.blogspot.c om/2011/08/resource-cdn-explained.html • http://stopthecap.com/wp- content/uploads/2014/02/netflix-cdn.png

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend