Nucleus: Eight GPU Platform for Visual Simulation David Morgan - - PowerPoint PPT Presentation

nucleus eight gpu platform for visual simulation
SMART_READER_LITE
LIVE PREVIEW

Nucleus: Eight GPU Platform for Visual Simulation David Morgan - - PowerPoint PPT Presentation

Nucleus: Eight GPU Platform for Visual Simulation David Morgan Principal Engineer Aechelon Technology S9224 Session Trajectory Visual Simulation Background Monsters, Clusters, and Moores Law Nucleus Architecture Challenges


slide-1
SLIDE 1

Nucleus: Eight GPU Platform for Visual Simulation

David Morgan Principal Engineer Aechelon Technology

S9224

slide-2
SLIDE 2

Session Trajectory

  • Visual Simulation Background
  • Monsters, Clusters, and Moore’s Law
  • Nucleus Architecture
  • Challenges
  • Demo
slide-3
SLIDE 3

Aechelon Technology

slide-4
SLIDE 4

Image Generation

slide-5
SLIDE 5

2000: RealityMonster

  • 5 Racks
  • 8 Graphics “Pipelines” (GPUs)
  • 24 CPUs
  • 9GB RAM (NUMA)
  • 140GB Storage
  • Single IRIX OS
  • 15kW
  • $2.8M
slide-6
SLIDE 6

Scalability Matters

  • 128 CPUs
  • 256GB RAM
  • 16 GPUs
slide-7
SLIDE 7

2001: GeForce 3

slide-8
SLIDE 8

2002-Today: PC Clusters

  • 1-3 Racks per IG
  • 1 GPU per node
  • 1U Diskless Renderers
  • 3U Pager w/88TB Storage
  • Windows OS Per Node
  • Ethernet Interconnect
  • 7000W (8ch)
  • Unlimited Scalability
slide-9
SLIDE 9

Moore’s Law is Dead

Cluster Nucleus 2 2.2 2.4 2.6 2.8 3 3.2 3.4 3.6 3.8 4 6 8 10 12 14 16 18 20 22 24 26 28 GHz Cores Skylake 2017 Broadwell 2016 Haswell 2014 Ivy Bridge 2013 Sandy Bridge 2012

slide-10
SLIDE 10

2016: 8-GPU Support

slide-11
SLIDE 11

Multi-GPU is Hard

slide-12
SLIDE 12

Nucleus

  • 4U
  • 8 Quadro GPUs
  • One display per GPU
  • 36 CPU Cores
  • 192GB RAM
  • 36TB Storage
  • One Windows OS
  • $100-200K
  • Operates up to 35C
  • 2000W
  • Limited Scalability
slide-13
SLIDE 13

Dual Root Complex

slide-14
SLIDE 14

Single Root Complex

slide-15
SLIDE 15

GPU Affinity

  • Exposed in OpenGL through WGL_NV_gpu_affinity extension
  • Quadro feature necessary to address individual GPUs on Windows
  • pC-Nova Maps GPU device handles to screens in the Windows

virtual desktop

  • Beware driver crashes enumerating more than 4 screens per GPU!
slide-16
SLIDE 16

EDID Management

http://johnsciacca.webs.com/apps/blog/show/16852621-installation-nightmares-9-professional-horror-stories

slide-17
SLIDE 17

DWM Is…

  • Independent GPUs’ video

timings phase shift.

  • Normally correctable by

tracking the phase

  • “Full-Screen Exclusive

Mode” is gone.

  • DWM intermediates all

drawing on multi-display systems.

  • One display is Primary.
slide-18
SLIDE 18

DWM Is Evil

https://www.pandza.xyz/article/16/dwm,-dxgi,-swap-chains,-latency,-throughput-and-you

GPU 2 GPU 1

slide-19
SLIDE 19

Workaround: Framelock

  • Quadro Sync II supports 8 GPUs

per system

  • Shared oscillator ensures

displays remain in phase with Primary

  • DWM placated!
  • Downside: Video timings must

all match

  • Downside: Wiring is delicate
slide-20
SLIDE 20

Future Work

  • GPU Multicast
  • Or Dual Root Complex?
  • VR Direct?
  • Clusters of Nuclei
slide-21
SLIDE 21

Thanks

  • Doug Traill
  • John Chaney
  • Tim Woodard
  • Steve Nash
  • Ian Williams
slide-22
SLIDE 22

Demo

slide-23
SLIDE 23

Questions?