LOW-LATENCY GPGPU A 5-minute intro and investigation Matheus Vitti - - PowerPoint PPT Presentation

low latency gpgpu
SMART_READER_LITE
LIVE PREVIEW

LOW-LATENCY GPGPU A 5-minute intro and investigation Matheus Vitti - - PowerPoint PPT Presentation

LOW-LATENCY GPGPU A 5-minute intro and investigation Matheus Vitti Santos @ Meeting C++ 2019 Disclaimer These findings reflect the point of view of someone whos been courting only CUDA in a hobbyist setting since 2010, and in a


slide-1
SLIDE 1

LOW-LATENCY GPGPU

A 5-minute intro and investigation

Matheus Vitti Santos @ Meeting C++ 2019

slide-2
SLIDE 2

Matheus Vitti Santos @ Meeting C++ 2019

Disclaimer

■ These findings reflect the point of view of someone who’s been courting only CUDA in a hobbyist setting since 2010, and in a (lightweight) professional setting since 2017 ■ I’d love to hear the viewpoints of AMD, Intel, Direct3D, Metal, and Vulkan folks on this - hit me up afterwards! – And maybe give me some hardware to play with…?

slide-3
SLIDE 3

Matheus Vitti Santos @ Meeting C++ 2019

So, GPGPUs have latency issues…

■ Calling GPU functions ta takes ti time ■ Moving memory around ta takes a a l lot o t of ti time ■ The GPU Driver ta takes i its ts s sweet, s t, sweet ti t time ■ …right?

slide-4
SLIDE 4

Matheus Vitti Santos @ Meeting C++ 2019

slide-5
SLIDE 5

Matheus Vitti Santos @ Meeting C++ 2019

slide-6
SLIDE 6

Matheus Vitti Santos @ Meeting C++ 2019

10+ years of GPGPU

■ APIs are slimmer! – (D3D12 / Vulkan / Metal vs. OpenGL/D3D9) ■ GPUs are faster! ■ PCI Express is faster! ■ Most of all, Drivers a are f faster!

slide-7
SLIDE 7

Matheus Vitti Santos @ Meeting C++ 2019

Literally 2 weeks ago: NVIDIA introduces “Ultra Low Latency Mode”

slide-8
SLIDE 8

Matheus Vitti Santos @ Meeting C++ 2019

“Latency” is relative

Do Domain Accepta table L Latency Protein Folding Simulation Days Weather Simulation Hours Radar Signal Convolution 200~300ms Videogame 10~30ms Audio Processing 5~10ms High Frequency Stock Trading <1ms

slide-9
SLIDE 9

Matheus Vitti Santos @ Meeting C++ 2019

“Latency” is relative

Do Domain Accepta table L Latency Protein Folding Simulation Days Weather Simulation Hours Radar Signal Convolution 200~300ms Videogame 10~30ms Audio Processing 5~10ms High Frequency Stock Trading <1ms

slide-10
SLIDE 10

Matheus Vitti Santos @ Meeting C++ 2019

Experiment: Real-time FM Synth

Solar Modulation - Savaged Regime

slide-11
SLIDE 11

Matheus Vitti Santos @ Meeting C++ 2019

Test subjects

GeForce 640M (this computer)

■ Kepler Architecture, SM 3.0, 2012 ■ 2GB GDDR5 / 128bit / 900MHz ■ ~390 Gflops – 2x PlayStation 3 – Intel UHD 620 ■ ~25 Giops

Quadro P400 (office workstation)

■ Pascal Architecture, SM 6.1, 2017 ■ 2GB GDDR5 / 64bit / 2GHz ■ ~630 Gflops – ½ Xbox One – 2x Intel Iris 5100 ■ ~200 Giops

slide-12
SLIDE 12

Matheus Vitti Santos @ Meeting C++ 2019

0.1 ms 1.0 ms 10.0 ms 32 64 128 256 512 1024

Compute Time per Audio Frame

95% GeForce 95% Quadro 99.9% GeForce 99.9% Quadro 48kHz 96kHz

slide-13
SLIDE 13

DEMO TIME!

slide-14
SLIDE 14

THANK YOU!

And go do something awesome with that GPU of yours!

slide-15
SLIDE 15

Matheus Vitti Santos @ Meeting C++ 2019

Image Sources

■ Screenshot: Marble Madness, c. Atari 1984 ■ Nvidia Control Panel: https://www.howtogeek.com/437761/how-to-enable-ultra-low- latency-mode-for-nvidia-graphics/ ■ FM Music Video: Savaged Regime