Scalable Data Processing at Network transfer rates with nCorium - - PowerPoint PPT Presentation

scalable data processing at network transfer rates with
SMART_READER_LITE
LIVE PREVIEW

Scalable Data Processing at Network transfer rates with nCorium - - PowerPoint PPT Presentation

Scalable Data Processing at Network transfer rates with nCorium Compute in Memory Modules Suresh Devalapalli, Brett Neuman, Arvindh Lalam 11/18/2019 1 OSS Layer Bottlenecks in HPC Facilities 10,000 000s s server ers 10s of Peta


slide-1
SLIDE 1

Scalable Data Processing at Network transfer rates with nCorium Compute in Memory Modules

Suresh Devalapalli, Brett Neuman, Arvindh Lalam

11/18/2019 1

slide-2
SLIDE 2

OSS Layer Bottlenecks in HPC Facilities

11/18/2019 2

Compu pute L e Layer er Storage L e Layer er 10,000’ 000’s s server ers OSS SS

  • 10s of Peta Bytes of data moved in and out of

Storage layer

  • Challenges:
  • 1. Data processing on CPU unable to keep up with

increase in network and storage speeds

  • 2. I/O throughput per node is limited
slide-3
SLIDE 3

nCorium Compute in Memory Module

11/18/2019 3

  • DIMM form-factor
  • Reconfigurable compute cores on the DIMM
  • Off-load functions via libraries:
  • Compression
  • Erasure
  • Encryption
  • Video Transcoding
  • ….etc
  • Data Stream based processing

nCIMM Array in the system

slide-4
SLIDE 4

nCIMMs + nC-Grid = Scalable nCorium-OSS

11/18/2019 4

  • nCIMM to off-load data processing
  • nC-Grid to open wider I/O
  • Scalable Architecture:
  • Number of nCIMMs
  • Functions to support

EDR

nCST (Storage Target)

nCIMM nCIMM

1st Gen Server With 5 nCSTs

EDR

nCST (Storage Target)

nCIMM nCIMM EDR

nCST (Storage Target)

nCIMM nCIMM EDR

nCST (Storage Target)

nCIMM nCIMM EDR

nCST (Storage Target)

nCIMM nCIMM

Management Network Data Network

SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD SSD

CPU

slide-5
SLIDE 5

11 10.5 20.7 30.8 41.1 50.0 9 7.5 15.0 22.5 28.6 34.3 7.5 5.5 10.8 15.9 21.1 26.3 7.2 4.9 9.8 14.6 18.4 22.2 10 20 30 40 50 60

  • Ref. Platform

1-nc STs 2-nc STs 3-nc STs 4-nc STs 5-nc STs

Througbput (GBps) Number of nC-STs

RDMA RDMA + Encode RDMA + SSD RDMA + Encode + SSD

Performance of Gen-1 nCorium OSS: Q’1 19

11/18/2019 5

RDMA + Erasure

3.4X

5x for RDMA 3.4x for RDMA+ Erasure

Performance compared with Intel Gold CPU based system

Gen-1 released in Q1’19

slide-6
SLIDE 6

1.0 1.0 1.0 1.0 4.5 3.8 3.7 4.7 7.0 7.5 7.2 9.9 0.0 2.0 4.0 6.0 8.0 10.0 12.0 RDMA RDMA+ENC RDMA+ENC+SSD RDMA+COMP+ENC+SSD

Normalized to Ref. Platform

  • Ref. Platform vs nCorium G1 and nCorium G2

Reference nCorium G1 nCorium G2

Performance of Gen-2 nCorium OSS

11/18/2019 6

  • Improved reconfigurable cores
  • More off-load functionality
  • Higher Power efficiency

~10x faster

RDMA RDMA+ENC RDMA+ENC+SSD RDMA+COMP+ENC+SSD Reference 11 9 7.2 5 nCorium G1 50 34.3 26.3 23.5 nCorium G2 76.5 67.5 52 49.4

Gen-2 now available

Numbers extrapolated from initial data

slide-7
SLIDE 7

Ongoing Work

11/18/2019 7

  • Other Applications
  • Key-Value Storage
  • Video Transcoding
  • Inferencing
  • Graph Analytics
  • Cyber Security
  • Content Distribution
  • Parallel file system integration
  • Lustre
  • BeeGFS
slide-8
SLIDE 8

Demos / Exhibits

11/18/2019 8

StarLight: Booth 993 EchoStreams: Booth 1490

Thank you!