AI WEBINAR
Date/Time: Tuesday, June 23 | 9 am PST
How Orange Successfully Deploys GPU Infrastructure for AI AI - - PowerPoint PPT Presentation
How Orange Successfully Deploys GPU Infrastructure for AI AI WEBINAR Date/Time: Tuesday, June 23 | 9 am PST Whats next in technology and innovation? How Orange Successfully Deploys GPU Infrastructure for AI AI WEBINAR Presenter: Your
Date/Time: Tuesday, June 23 | 9 am PST
Presenter: Stéphane Maillan
Orange AI Infrastructure
Your Host: Tom Leyden VP Marketing AI WEBINAR
What’s next in technology and innovation?
S.Maillan
2 Interne Orange
qAbout me
qGPU
qAI PHASES
qFIRST CONSIDERATION
3 Interne Orange
4 Interne Orange
qGPU : Very High parallel processing capability (limited memory)
qCPU : High parallel processing capability (2TB memory)
qFPGA : Very High parallel processing capability (programmable)
qASIC/AI chips : Extreme parallel processing capability
5 Interne Orange
qTRAINING : DATA + + + + +
qINFERENCE : DATA + + / very low - Real Time response time
q(ANALYTIC) : DATA + + + + + + +
6 Interne Orange
7 Interne Orange
CODE
qDATA
qCOMPUTING RESSOURCES
8 Interne Orange
Efficiently sharing GPU
qDedicated : local GPU machines
qShared : Single server
qDistributed : Cluster
9 Interne Orange
Parallel processing
10 Interne Orange
Architecture
11 Interne Orange
PCI Fabric NVSWITCH Fabric RDMA Fabric
12 Interne Orange
NVSWITCH Fabric
sharing / slicing capability !
13 Interne Orange
NVSWITCH Fabric
Last DGX A100
14 Interne Orange
PCI Fabric
15 Interne Orange
RDMA Fabric
16 Interne Orange
RDMA
17 Interne Orange
GPU DISAGREGATION
18 Interne Orange
Architecture
19 Interne Orange
Data Disagregation : Low latency Software Defined Storage
20 Interne Orange
Commodity Hardware No CPU/RAM Boottleneck Progressive cost Full Scale up/out
Promises
Software Defined Storage
21
NVME
CPU GPU FPGA PMEM NVME CPU GPU FPGA PMEM NVME
RDMA High Bandwidth Low Latency SHARP FPGA TLS Offload
Security
GPUDirect IPSEC Offload NVMe over Fabrics
Fabriq Interconnect
MPI R/CUDA
OFFLOAD
RDMA
In-Network Computing Key for Efficiency
In Network Computing
22 Interne Orange
q GPU Scheduler is a key of efficiency
23 Interne Orange
Interresting GPU Scheduler
qRun.ai
qslurm
24 Interne Orange
feature
qGPU Réservation and Quota
qGPU Job migration
25 Interne Orange
26 Interne Orange
RDMA Fabric
27 Interne Orange
PCIe ROCE Distributed
HW SW
28 Interne Orange
rCUDA - http://www.rcuda.net/
+
29 Interne Orange
GPU DISAGREGATION
30 Interne Orange
In Network Computing
31 Interne Orange
32 Interne Orange
GPU Direct Storage Imbetable performance Imbetable performance API Transport: + 5µs Protection Levels Flexibility Financial Efficiency Scale up/out RAID 0 / 1 / 10 / Erasure coding Volume Latency 40µs-300µs RDDA : 0% CPU sur les server de Stockage …. !!! RDMA & TCP Disk based Licensing Model
33 Interne Orange