nucleus eight gpu platform for visual simulation
play

Nucleus: Eight GPU Platform for Visual Simulation David Morgan - PowerPoint PPT Presentation

Nucleus: Eight GPU Platform for Visual Simulation David Morgan Principal Engineer Aechelon Technology S9224 Session Trajectory Visual Simulation Background Monsters, Clusters, and Moores Law Nucleus Architecture Challenges


  1. Nucleus: Eight GPU Platform for Visual Simulation David Morgan Principal Engineer Aechelon Technology S9224

  2. Session Trajectory • Visual Simulation Background • Monsters, Clusters, and Moore’s Law • Nucleus Architecture • Challenges • Demo

  3. Aechelon Technology

  4. Image Generation

  5. 2000: RealityMonster • 5 Racks • 8 Graphics “Pipelines” (GPUs) • 24 CPUs • 9GB RAM (NUMA) • 140GB Storage • Single IRIX OS • 15kW • $2.8M

  6. Scalability Matters • 128 CPUs • 256GB RAM • 16 GPUs

  7. 2001: GeForce 3

  8. 2002-Today: PC Clusters • 1-3 Racks per IG • 1 GPU per node • 1U Diskless Renderers • 3U Pager w/88TB Storage • Windows OS Per Node • Ethernet Interconnect • 7000W (8ch) • Unlimited Scalability

  9. Moore’s Law is Dead 3.8 3.6 Cluster 3.4 3.2 Skylake 2017 3 Nucleus Broadwell 2016 GHz 2.8 Haswell 2014 Ivy Bridge 2013 2.6 Sandy Bridge 2012 2.4 2.2 2 4 6 8 10 12 14 16 18 20 22 24 26 28 Cores

  10. 2016: 8-GPU Support

  11. Multi-GPU is Hard

  12. Nucleus • 4U • 8 Quadro GPUs • One display per GPU • 36 CPU Cores • 192GB RAM • 36TB Storage • One Windows OS • $100-200K • Operates up to 35C • 2000W • Limited Scalability

  13. Dual Root Complex

  14. Single Root Complex

  15. GPU Affinity • Exposed in OpenGL through WGL_NV_gpu_affinity extension • Quadro feature necessary to address individual GPUs on Windows • pC-Nova Maps GPU device handles to screens in the Windows virtual desktop • Beware driver crashes enumerating more than 4 screens per GPU!

  16. EDID Management http://johnsciacca.webs.com/apps/blog/show/16852621-installation-nightmares-9-professional-horror-stories

  17. DWM Is… • Independent GPUs’ video timings phase shift. • Normally correctable by tracking the phase • “Full -Screen Exclusive Mode” is gone. • DWM intermediates all drawing on multi-display systems. • One display is Primary.

  18. DWM Is Evil GPU 1 GPU 2 https://www.pandza.xyz/article/16/dwm,-dxgi,-swap-chains,-latency,-throughput-and-you

  19. Workaround: Framelock • Quadro Sync II supports 8 GPUs per system • Shared oscillator ensures displays remain in phase with Primary • DWM placated! • Downside: Video timings must all match • Downside: Wiring is delicate

  20. Future Work • GPU Multicast • Or Dual Root Complex? • VR Direct? • Clusters of Nuclei

  21. Thanks • Doug Traill • John Chaney • Tim Woodard • Steve Nash • Ian Williams

  22. Demo

  23. Questions?

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend