Server Server Server Server Server Datacenter Network - PowerPoint PPT Presentation

Chanwoo Chung ǂ , Jinhyung Koo, Junsu Im, Arvind ǂ , and Sungjin Lee DGIST and MIT ǂ NVRAMOS ‘19 2019.10.24 DATA -INTENSIVE COMPUTING SYSTEMS LAB ORATORY

Computation Application Application Application Application Application … … Server Server Server Server Server … Datacenter Network (e.g., Ethernet, InfiniBand, …) … Storage Xeon … GB Disk Array CPUs w/ RAID DRAM Storage Node 0 Storage Node 1 Storage Node N It is not mere storage – it is another high-end server !!! High-end Xeon CPUs Power Hungry (e.g., 1700 W) Several GBs of DRAM Expensive (e.g., $2~40,000 w/o SSDs) An array of SSDs Large Volume (e.g., 2-4 U) Large form-factor High TCO (e.g., Cooling) … … 2

▪ HDD is slow – require large DRAM and array of disks ▪ 10 ms latency & 100~300 MB/s throughput ▪ HDD is dumb – the host system makes it smarter ▪ Xeon CPUs with advanced algorithms Aggr. Network Throughput = 20 GB/s 40GbE 40GbE 40GbE 40GbE Host Protocol Translation (e.g., NFS, CIFS, …) Storage Host … Caching/Buffering Parity Mgmt Prefetching Dedup/Compresion Local File System (e.g., EXT4, WAFL, …) Xeon GB Disk Array 300 MB/s 300 MB/s CPUs w/ RAID DRAM … HDD HDD HDD HDD HDD HDD HDD HDD 3

▪ HDD is slow – require large DRAM and array of disks ▪ 10 ms latency & 100~300 MB/s throughput SSDs are not a bottleneck → Network/CPU are new bottlenecks ▪ HDD is dumb – the host system makes it smarter ▪ Xeon CPUs with advanced algorithms Bottleneck!!! Aggr. Network Throughput = 20 GB/s 40GbE 40GbE 40GbE 40GbE Host Protocol Translation (e.g., NFS, CIFS, …) Storage Host … Caching/Buffering Parity Mgmt Prefetching Dedup/Compresion Local File System (e.g., EXT4, WAFL, …) Xeon GB SSD Array 1~10 GB/s 1~10 GB/s CPUs w/ RAID DRAM … Aggr. SDD Throughput = 10~100 GB/s (with 10 SSDs) SSD SSD SSD SSD SSD SSD SSD SSD 3

EMC NetApp HPE Hynix XtremIO SolidFire 3PAR AFA Capacity 36~144TB 46TB 750TB 522TB # of SSDs 18~72 12 120 576 SSD Array Aggr. 18~72 GB/s 12 GB/s 120 GB/s 576 GB/s Throughput* 4~8x 2x 4~12x 3x Ports 10Gb iSCSI 25Gb iSCSI 16Gb FC Gen3 PCIe Network Aggr. 5~10 GB/s 6.25 GB/s 8~24 GB/s 48 GB/s Throughput ※ Aggr. SSD throughput was estimated assuming each SSD offers 1GB/s throughput ▪ Supported by the latest works ▪ K. Kourtis et al., “Reaping the performance of fast NVM storage with uDepot ,” USENIX FAST ‘19 ▪ J. Kim et al., “Alleviating Garbage Collection Interference through Spatial Separation in All Flash Arrays,” USENIX ATC ‘19 4

▪ Supported by the latest works ▪ K. Kourtis et al., “Reaping the performance of fast NVM storage with uDepot ,” USENIX FAST ‘19 ▪ J. Kim et al., “Alleviating Garbage Collection Interference through Spatial Separation in All Flash Arrays,” USENIX ATC ‘19 4

▪ HDD is slow – require large DRAM and array of disks ▪ 10 ms latency & 100~300 MB/s throughput SSDs are not a bottleneck → Network/CPU are new bottlenecks ▪ HDD is dumb – the host system makes it smarter ▪ Xeon CPUs with advanced algorithms SSDs are smart enough, supporting many features → Duplicate storage management hurts performance Bottleneck!!! Aggr. Network Throughput = 20 GB/s 40GbE 40GbE 40GbE 40GbE Host Protocol Translation (e.g., NFS, CIFS, …) Storage Host … Caching/Buffering Parity Mgmt Prefetching Dedup/Compresion Local File System (e.g., EXT4, WAFL, …) Xeon GB SSD Array 1~10 GB/s 1~10 GB/s CPUs w/ RAID DRAM Ctrl Ctrl Ctrl Ctrl Ctrl Ctrl Ctrl Ctrl … SSD SSD SSD SSD SSD SSD SSD SSD 5

▪ 4 embedded CPUs (ARM) running at 700 MHz to 1.4 GHz and > 1~16GB DRAM that a desktop PC had 10 years ago ▪ Those resources are required for running firmware (i.e., FTL) PCIe Interface (1~10 GB/s) Host-to-PCIe Controller ARM CPU ARM CPU Block I/O-to-Flash I/O Interfacing (Max 1.4 GHz) (Max 1.4 GHz) DRAM Remapping Wear-Leveling Cleaning (>4 GB) ARM CPU ARM CPU Parity Mgmt. Deduplication Compression (Max 1.4 GHz) (Max 1.4 GHz) RAID … NAND NAND NAND NAND NAND NAND NAND NAND CHIP CHIP CHIP CHIP CHIP CHIP CHIP CHIP 6

Computation Application Application Application Application Application … … Server Server Server Server Server … Datacenter Network (e.g., Ethernet, InfiniBand, …) … Storage Xeon … GB Disk Array CPUs w/ RAID DRAM Storage Node 0 Storage Node 1 Storage Node N Let’s assume that this storage node has 8TB 72 SSDs (EMC XtremIO) ▪ # of ARM cores: 4 cores x 72 = 288 ARM cores ▪ Aggregate DRAM: 8 GB x 72 = 576 GB DRAM Just for managing NAND flash Q: Is this a storage node or a low-power microserver? 7

▪ Use simple SSD? ▪ Software Defined Flash (ASPLOS ’14) ▪ Application- managed Flash (USENIX FAST ’16) ▪ LightNVM (USENIX FAST ’17) → Network/CPU are still bottleneck ▪ Use better SSD organization? ▪ SWAN (HotStorage ’16; USENIX ATC ‘19) → Still rely on power-hungry and expensive host ▪ Any other solution? 8

▪ Motivation ▪ Basic Idea ▪ LightStore Software ▪ LightStore Controller ▪ LightStore Adapters ▪ Experimental Results ▪ Conclusion 9

▪ Get rid of a space-consuming, expensive, power-hungry host server ▪ Put and run everything in SSDs ▪ Attach SSDs to a datacenter network ▪ Let application servers directly talk to SSDs … Application Application Application Server Server Server Datacenter Network Host Protocol Translation (e.g., NFS, CIFS, …) … Parity Mgmt Prefetching Caching/Buffering Local File System (e.g., EXT4, WAFL, …) Ctrl Ctrl Ctrl Ctrl Ctrl Ctrl Ctrl … SSD SSD SSD SSD SSD SSD SSD 10

▪ Get rid of a space-consuming, expensive, power-hungry host server ▪ Put and run everything in SSDs ▪ Attach SSDs to a datacenter network ▪ Let application servers directly talk to SSDs … Application Application Application Server Server Server Datacenter Network Host-to-PCIe Controller Host Protocol Translation DRAM High-level Flash Management (2~4 GB) Low-level Flash Management RAID Ctrl Ctrl Ctrl Ctrl Ctrl Ctrl Ctrl … … SSD SSD SSD SSD SSD SSD SSD NAND NAND NAND NAND NAND NAND NAND NAND 10

▪ Get rid of a space-consuming, expensive, power-hungry host server ▪ Put and run everything in SSDs ▪ Attach SSDs to a datacenter network ▪ Let application servers directly talk to SSDs … Application Application Application Server Server Server Datacenter Network Host-to-PCIe Controller Ethernet Controller Host Protocol Translation DRAM High-level Flash Management (2~4 GB) Low-level Flash Management RAID Ctrl Ctrl Ctrl Ctrl Ctrl Ctrl Ctrl … … SSD SSD SSD SSD SSD SSD SSD NAND NAND NAND NAND NAND NAND NAND NAND 10

▪ Get rid of a space-consuming, expensive, power-hungry host server ▪ Put and run everything in SSDs ▪ Attach SSDs to a datacenter network ▪ Let application servers directly talk to SSDs … Application Application Application Server Server Server Datacenter Network Host-to-PCIe Controller Ethernet Controller Host Protocol Translation DRAM High-level Flash Management (2~4 GB) Low-level Flash Management RAID Ctrl Ctrl Ctrl Ctrl Ctrl Ctrl Ctrl … … SSD SSD SSD SSD SSD SSD SSD NAND NAND NAND NAND NAND NAND NAND NAND Deliver Flash’s low latency & high throughput to network ports! 10

▪ Get rid of a space-consuming, expensive, power-hungry host server ▪ Put and run everything in SSDs ▪ Attach SSDs to a datacenter network ▪ Let application servers directly talk to SSDs … Application Application Application Server Server Server Datacenter Network An x86 storage server with N SSDs is replaced with N SSDs Low Power (e.g., 100 W / 10 SSDs) Cheap (e.g., Zero server cost) Small Volume (e.g., Less than 1U) Ctrl Ctrl Ctrl Ctrl Ctrl Ctrl Ctrl … Low TCO (e.g., Less Cooling) SSD SSD SSD SSD SSD SSD SSD Scalability (No network bottleneck) 10

▪ Can we run complicated server software on wimpy ARM cores? ▪ How can we provide the same interface with application servers? ▪ How can we manage unreliable NAND without more ARM cores? 11

Server Server Server Server Server Datacenter Network - PowerPoint PPT Presentation

Chanwoo Chung , Jinhyung Koo, Junsu Im, Arvind , and Sungjin Lee DGIST and MIT NVRAMOS 19 2019.10.24 DATA -INTENSIVE COMPUTING SYSTEMS LAB ORATORY Computation Application Application Application Application Application

Google Datacenter CS 142 Lecture Notes: Datacenters Slide 1 Datacenter Organization Single

The Time-less Datacenter Paul Borrill and Alan H. Karp Earth Computing The Datacenter Resilience

Scaling Datacenter Accelerators With Compute-Reuse Architectures Adi Fuchs and David Wentzlaff

FLAT DATACENTER STORAGE CS 744 - Big Data Systems Fall 2018 Presenter - Arjun Balasubramanian

CompSci 514: Computer Networks Lecture 14 Datacenter Transport protocols II Xiaowei Yang

Datacenter Transformation Datacenter Transformation

CompSci 514: Computer Networks Lecture 15 Practical Datacenter Networks Xiaowei Yang Overview

Fastpass A Centralized Zero-Queue Datacenter Network Jonathan Perry Amy Ousterhout Hari

AC DC TCP: Virtual Congestion Control Enforcement for Datacenter Networks Ke Keqiang He He ,

FireFly: A Reconfigurable Wireless Datacenter Fabric using Free-Space Optics Navid Hamedazimi,

Content Server Caching Network Client Web Server Browser Avoid Network Latency Avoid Queuing

DATACENTERS ERS DATACE CENT NTERS What is a Dataceneter? What makes up a Datacenter

ITAC Project & Change Review FY17 ADOR Datacenter Modernization Arizona Department of Revenue

Understanding Understanding Lifecycle Management Lifecycle Management Complexity of Datacenter

THE DATACENTER NEEDS AN OPERATING SYSTEM MATEI ZAHARIA, BENJAMIN HINDMAN, ANDY KONWINSKI, ALI

Computing Can Reduce Datacenter Power Consumption Anne M. Holler Senior Staff Engineer,

1 Privacy has become the mainstay of security We have loads of data that we freely give away via

1.264 Lecture 19 System architecture, concluded Disk performance (RAID) Why are disks a problem?

Congressional Budget Office What Changes in Federal Policy Might Spur Innovation? Presentation

Butler Advantage S t e p h a n i e H e a r n | E x e c u t i v e D i r e c t o r | B u t l e

Digital Preservation with libsafe July 2014 Paseo de la Castellana, 153 28046 Madrid Tel:

Coalition for Government Procurement May 14, 2014 The Case for Category Management

WELCOME Graduate Studies Funding Workshop 2013 Funding Your Graduate Program Internal

USC GIST Program Thesis Format and Presentation Guidelines This short guide accompanies the style

Sambuz

Useful Links

Newsletter

Mail Us

Server Server Server Server Server Datacenter Network - PowerPoint PPT Presentation

Chanwoo Chung , Jinhyung Koo, Junsu Im, Arvind , and Sungjin Lee DGIST and MIT NVRAMOS 19 2019.10.24 DATA -INTENSIVE COMPUTING SYSTEMS LAB ORATORY Computation Application Application Application Application Application

Google Datacenter CS 142 Lecture Notes: Datacenters Slide 1 Datacenter Organization Single

The Time-less Datacenter Paul Borrill and Alan H. Karp Earth Computing The Datacenter Resilience

Scaling Datacenter Accelerators With Compute-Reuse Architectures Adi Fuchs and David Wentzlaff

FLAT DATACENTER STORAGE CS 744 - Big Data Systems Fall 2018 Presenter - Arjun Balasubramanian

CompSci 514: Computer Networks Lecture 14 Datacenter Transport protocols II Xiaowei Yang

Datacenter Transformation Datacenter Transformation

CompSci 514: Computer Networks Lecture 15 Practical Datacenter Networks Xiaowei Yang Overview

Fastpass A Centralized Zero-Queue Datacenter Network Jonathan Perry Amy Ousterhout Hari

AC DC TCP: Virtual Congestion Control Enforcement for Datacenter Networks Ke Keqiang He He ,

FireFly: A Reconfigurable Wireless Datacenter Fabric using Free-Space Optics Navid Hamedazimi,

Content Server Caching Network Client Web Server Browser Avoid Network Latency Avoid Queuing

DATACENTERS ERS DATACE CENT NTERS What is a Dataceneter? What makes up a Datacenter

ITAC Project &amp; Change Review FY17 ADOR Datacenter Modernization Arizona Department of Revenue

Understanding Understanding Lifecycle Management Lifecycle Management Complexity of Datacenter

THE DATACENTER NEEDS AN OPERATING SYSTEM MATEI ZAHARIA, BENJAMIN HINDMAN, ANDY KONWINSKI, ALI

Computing Can Reduce Datacenter Power Consumption Anne M. Holler Senior Staff Engineer,

1 Privacy has become the mainstay of security We have loads of data that we freely give away via

1.264 Lecture 19 System architecture, concluded Disk performance (RAID) Why are disks a problem?

Congressional Budget Office What Changes in Federal Policy Might Spur Innovation? Presentation

Butler Advantage S t e p h a n i e H e a r n | E x e c u t i v e D i r e c t o r | B u t l e

Digital Preservation with libsafe July 2014 Paseo de la Castellana, 153 28046 Madrid Tel:

Coalition for Government Procurement May 14, 2014 The Case for Category Management

WELCOME Graduate Studies Funding Workshop 2013 Funding Your Graduate Program Internal

USC GIST Program Thesis Format and Presentation Guidelines This short guide accompanies the style

Sambuz

Useful Links

Newsletter

Mail Us

ITAC Project & Change Review FY17 ADOR Datacenter Modernization Arizona Department of Revenue