Determinism of GPU solutions for AO real-time computing E-ELT AO - PowerPoint PPT Presentation

Oct 12, 2023 •210 likes •296 views

Determinism of GPU solutions for AO real-time computing E-ELT AO RTC Architecture Hard real time system (~1 kHz) Big computation (5 TFLOPs) Low latency Maximum jitter : ~10% Jitter Where is the jitter ? Data transfer

Determinism of GPU solutions for AO real-time computing
E-ELT ● AO RTC Architecture – Hard real time system (~1 kHz) – Big computation (5 TFLOPs) – Low latency – Maximum jitter : ~10%
Jitter ● Where is the jitter ? – Data transfer – Computation ● Jitter with standard transfer and computation Case Pipeline Time (jitter) (ms) 64x64 pixels 8x8 subpupils copy only 33 (35) copy + compute 96 (63) 240x240 pixels 40x40 subpupils copy only 204 (37) copy + compute 576 (57)
Data transfer ● Normal way – Main memory is a buffer – 2 copies by communication – CPU manage the communication ● GPUdirect RDMA (Remote Direct Memory Access) – No unnecessary copy – CPU only use for launching kernel CREDIT : NVIDIA
Transfer result ● GPUdirect – Reduce jitter during transfer to almost 0 – Reduces the transfer time by 2 ● But jitter still occurs during computations...
Computation ● Normal way – High jitter – Depends on CPU – Need a Real-Time OS Time (in µs) for 8k empty kernel call (average : ~6.5µs, peak : ~31µs) ● Jitter with RDMA transfer and standard computation Case Pipeline Time(jitter) (ms) 64x64 pixels 8x8 subpupils copy only 12 (12) copy + compute 69 (59) 240x240 pixels 40x40 subpupils copy only 112 (10) copy + compute 475 (50)
Perpetual kernel ● Pros – No scheduler – No additional cost Cpy Cpy Cpy Cpy Cpy – New features Comp Comp Comp Comp Comp ● Reduce computation Timeline for standard kernel call ● New synchronization features ● Cons – More complex implementation, Cpy Cpy Cpy Cpy Cpy test and debugging Comp Comp Comp Comp Comp – Hardware dependent Timeline for perpetual kernel call – Can't use any existing library Clock cycle count for 8k iterations
What's next ? ● Implementation of RTC with perpetual kernel ● Integration with frame grabber – Test with pixel generator – Integration on the optical bench – Full loop profiling ● Study on floating point precision to reduce the number of GPU

Recommend

Real-Time GPU Management Heechul Yun 1 This Week Topic: General Purpose Graphic Processing

Real-Time GPU Management Heechul Yun 1 This Week Topic: General Purpose Graphic Processing Unit (GPGPU) management Today GPU architecture GPU programming model Challenges Real-Time GPU management 2 History GPU

834 views • 66 slides

To Be Free Terence Picton Physical Determinism Free Will Neuro-Determinism Imagined Future

Slide 1 Determined To Be Free Terence Picton Physical Determinism Free Will Neuro-Determinism Imagined Future Rotman Research Institute Conference Paul-mile Borduas, 1956, March 11, 2015 Ouvertures Imprvues Good afternoon. The

395 views • 18 slides

Section 3 Non-Determinism, Regular Expressions, and Kleenes Theorem Automata Theory

Section 3 Non-Determinism, Regular Expressions, and Kleenes Theorem Automata Theory Non-Determinism, Regular Expressions, and Kleenes Theorem 109 / 289 Chapter Non-Determinism, Regular Expressions, and Kleenes 2 Theorem Examples

928 views • 43 slides

Section 3 Non-Determinism, Regular Expressions, and Kleenes Theorem Automata Theory

537 views • 39 slides

Real-Time Operating system (RTOS) Real-time Embedded systems often have real-time computing

Real-Time Operating system (RTOS) Real-time Embedded systems often have real-time computing constraints (Temporal) Determinism Correctness of system depends not only on the logical result of the computation, but also on the time at which

531 views • 13 slides

Status of GPU offloading on Wayland Axel Davy FOSDEM 2014 Status of GPU offloading on Wayland

Status of GPU offloading on Wayland Status of GPU offloading on Wayland Axel Davy FOSDEM 2014 Status of GPU offloading on Wayland How to do GPU offloading 1 GPU offloading with X DRI2 2 GPU offloading with Wayland 3 and XWayland? 4

427 views • 29 slides

Motivation to Learn GPGPU Julius Parulek Why to Learn About GPU? Computational power of GPU vs.

Motivation to Learn GPGPU Julius Parulek Why to Learn About GPU? Computational power of GPU vs. CPU Why to Learn About GPU? NVIDIA GPU relative performances Why to Learn About GPU? Hardware Why to Learn About GPU? Interactive rendering

852 views • 46 slides

Real- Real -Time Systems Time Systems Real- -Time Systems Time Systems Real

EDA222/DIT160 Real-Time Systems, Chalmers/GU, 2008/2009 Lecture #15 Updated 2009-03-03 Dependable Distributed Dependable Distributed Real- Real -Time Systems Time Systems Real- -Time Systems Time Systems Real Aircraft/automotive

501 views • 9 slides

Real Real- -Time Systems Time Systems Designing a real- Designing a real -time system time

EDA222/DIT160 Real-Time Systems, Chalmers/GU, 2008/2009 Lecture #10 Updated 2009-02-15 Real Real- -Time Systems Time Systems Designing a real- Designing a real -time system time system Logical function What should be done &

500 views • 8 slides

Real- Real -time systems time systems Real- Real -time programming time programming

EDA222/DIT160 Real-Time Systems, Chalmers/GU, 2008/2009 Lecture #2 Updated 2009-01-18 Real- Real -time systems time systems Real- Real -time programming time programming Recommended programming method: Recommended programming method:

251 views • 8 slides

Time-domain determinism using modern SoCs OSPERT 2019 David Haworth 1/42 1 / 42 Elektrobit

Elektrobit Time-domain determinism using modern SoCs OSPERT 2019 David Haworth 1/42 1 / 42 Elektrobit Introduction Introduction What is time-domain determinism? What causes non-deterministic behavior? Overview of the AUTOSAR

600 views • 43 slides

Real graduates, Real graduates, real transitions, real transitions, real stories: real

Real graduates, Real graduates, real transitions, real transitions, real stories: real stories: A real insight to A real insight to life life after college after college April Perry, PhD Candidate April Perry, PhD Candidate University

477 views • 32 slides

UNIFIED MEMORY ON PASCAL AND VOLTA Nikolay Sakharnykh - May 10, 2017 1 HETEROGENEOUS

UNIFIED MEMORY ON PASCAL AND VOLTA Nikolay Sakharnykh - May 10, 2017 1 HETEROGENEOUS ARCHITECTURES GPU 0 GPU 1 GPU 2 CPU GPU 0 GPU 1 GPU 2 MEM MEM MEM SYS MEM 2 UNIFIED MEMORY FUNDAMENTALS Single Pointer CPU code GPU code void

870 views • 70 slides

Advancements in V-Ray RT GPU Vlado Koylazov, CTO & Co-founder Blagovest Taskov, RT GPU Team

Advancements in V-Ray RT GPU Vlado Koylazov, CTO & Co-founder Blagovest Taskov, RT GPU Team Lead Alexander Soklev, RT GPU R&D Agenda Recent improvements in RT GPU Rounded edges MDL material support Next-gen GPU

534 views • 24 slides

MULTI-GPU TRAINING WITH NCCL Sylvain Jeaugey MULTI-GPU COMPUTING Harvesting the power of

MULTI-GPU TRAINING WITH NCCL Sylvain Jeaugey MULTI-GPU COMPUTING Harvesting the power of multiple GPUs NCCL Multiple GPUs per system 1 GPU Multiple systems connected NCCL : N VIDIA C ollective C ommunication L ibrary 2 MULTI-GPU DL

1.39k views • 19 slides

Real Real Real Time Real-Time Time Time Model Checking Model Model Checking Model

Real Real Real Time Real-Time Time Time Model Checking Model Model Checking Model Checking Checking Patricia Bouyer-Decitre Patricia Bouyer-Decitre Kim Kim G Larsen Kim Kim G. . Larsen Larsen Larsen Nicolas Markey Nicolas

806 views • 43 slides

A Framework for Automatic Generation A Framework for Automatic Generation of Configuration Files

A Framework for Automatic Generation A Framework for Automatic Generation of Configuration Files for a Custom of Configuration Files for a Custom Hardware/Software RTOS Hardware/Software RTOS Jaehwan Lee* Lee* Jaehwan Kyeong Keol Keol Ryu

702 views • 29 slides

; +

: Keynote 5/8/2014 8:30:00 AM ; + *E;9

745 views • 27 slides

The SOLOS model and its possibilities Developing competencies in the logistical work process

The SOLOS model and its possibilities Developing competencies in the logistical work process A key to success Brussels 14 September 2011 Karin Bockelmann Helmuth Gelletiuk Gnter Fridrich SOLOS = Solutions for Logistics

766 views • 28 slides

Welcome to To follow along... https://goo.gl/AviLVo BreakoutEDU ...a collaborative

Welcome to To follow along... https://goo.gl/AviLVo BreakoutEDU ...a collaborative challenge where groups solve a series of both physical and online puzzles to open locks, and ultimately a box, before time runs out. In a typical

702 views • 35 slides

Affects of Queuing Mechanisms on RTP Traffic Comparative Analysis of Jitter, End-to- End Delay

Aim and Objectives Introduction Simulations and Results Conclusion References Affects of Queuing Mechanisms on RTP Traffic Comparative Analysis of Jitter, End-to- End Delay and Packet Loss Gregory Epiphaniou 1 Carsten Maple 1 Paul Sant 1

541 views • 20 slides

Continuous Performance Testing Mark Price / @epickrram Performance Engineer Improbable.io The

Continuous Performance Testing Mark Price / @epickrram Performance Engineer Improbable.io The ideal System performance testing as a first-class citizen of the continuous delivery pipeline Process Process maturity A scientific and rigorous

854 views • 50 slides

Stellar Jitter Jason T Wright Workshop on Astronomy of Exoplanets with Precise Radial Velocities

Stellar Jitter Jason T Wright Workshop on Astronomy of Exoplanets with Precise Radial Velocities University Park, PA August 16, 2010 1 Velocity and velocity variations correlate with chromospheric activity 2 Campbell et al. (1991)

513 views • 27 slides

ENSC 427 Final Project VoIP Over Campus Area Network Group 15 Mark Zhiyu Zhou Kevan Thompson

ENSC 427 Final Project VoIP Over Campus Area Network Group 15 Mark Zhiyu Zhou Kevan Thompson Elisa Xuan Lu Overview Background Project Details Implementation Details Implementation Details Discussion References &

506 views • 26 slides

Determinism of GPU solutions for AO real-time computing E-ELT AO - PowerPoint PPT Presentation

Determinism of GPU solutions for AO real-time computing E-ELT AO RTC Architecture Hard real time system (~1 kHz) Big computation (5 TFLOPs) Low latency Maximum jitter : ~10% Jitter Where is the jitter ? Data transfer

Real-Time GPU Management Heechul Yun 1 This Week Topic: General Purpose Graphic Processing

To Be Free Terence Picton Physical Determinism Free Will Neuro-Determinism Imagined Future

Section 3 Non-Determinism, Regular Expressions, and Kleenes Theorem Automata Theory

Section 3 Non-Determinism, Regular Expressions, and Kleenes Theorem Automata Theory

Real-Time Operating system (RTOS) Real-time Embedded systems often have real-time computing

Status of GPU offloading on Wayland Axel Davy FOSDEM 2014 Status of GPU offloading on Wayland

Motivation to Learn GPGPU Julius Parulek Why to Learn About GPU? Computational power of GPU vs.

Real- Real -Time Systems Time Systems Real- -Time Systems Time Systems Real

Real Real- -Time Systems Time Systems Designing a real- Designing a real -time system time

Real- Real -time systems time systems Real- Real -time programming time programming

Time-domain determinism using modern SoCs OSPERT 2019 David Haworth 1/42 1 / 42 Elektrobit

Real graduates, Real graduates, real transitions, real transitions, real stories: real

UNIFIED MEMORY ON PASCAL AND VOLTA Nikolay Sakharnykh - May 10, 2017 1 HETEROGENEOUS

Advancements in V-Ray RT GPU Vlado Koylazov, CTO &amp; Co-founder Blagovest Taskov, RT GPU Team

MULTI-GPU TRAINING WITH NCCL Sylvain Jeaugey MULTI-GPU COMPUTING Harvesting the power of

Real Real Real Time Real-Time Time Time Model Checking Model Model Checking Model

A Framework for Automatic Generation A Framework for Automatic Generation of Configuration Files

; +

The SOLOS model and its possibilities Developing competencies in the logistical work process

Welcome to To follow along... https://goo.gl/AviLVo BreakoutEDU ...a collaborative

Affects of Queuing Mechanisms on RTP Traffic Comparative Analysis of Jitter, End-to- End Delay

Continuous Performance Testing Mark Price / @epickrram Performance Engineer Improbable.io The

Stellar Jitter Jason T Wright Workshop on Astronomy of Exoplanets with Precise Radial Velocities

ENSC 427 Final Project VoIP Over Campus Area Network Group 15 Mark Zhiyu Zhou Kevan Thompson

Advancements in V-Ray RT GPU Vlado Koylazov, CTO & Co-founder Blagovest Taskov, RT GPU Team