CS 744: DATACENTER AS A COMPUTER Shivaram Venkataraman Fall 2020 - - PowerPoint PPT Presentation

cs 744 datacenter as a computer
SMART_READER_LITE
LIVE PREVIEW

CS 744: DATACENTER AS A COMPUTER Shivaram Venkataraman Fall 2020 - - PowerPoint PPT Presentation

CS 744: DATACENTER AS A COMPUTER Shivaram Venkataraman Fall 2020 ANNOUNCEMENTS - Assignments - Assignment zero is due! - Form groups for Assignment 1 on Piazza - Class format - Review - Lecture - Discussion Applications Machine


slide-1
SLIDE 1

CS 744: DATACENTER AS A COMPUTER

Shivaram Venkataraman Fall 2020

slide-2
SLIDE 2

ANNOUNCEMENTS

  • Assignments
  • Assignment zero is due!
  • Form groups for Assignment 1 on Piazza
  • Class format
  • Review
  • Lecture
  • Discussion
slide-3
SLIDE 3

Scalable Storage Systems Datacenter Architecture Resource Management Computational Engines Machine Learning SQL Streaming Graph Applications

slide-4
SLIDE 4

OUTLINE

  • Hardware Trends
  • Datacenter design
  • WSC workloads
  • Discussion
slide-5
SLIDE 5

Why is One Machine Not Enough?

slide-6
SLIDE 6

What’s in a Machine?

Interconnected compute and storage Newer Hardware

  • GPUs, FPGAs
  • RDMA, NVlink

Memory Bus Ethernet SATA PCIe v4

slide-7
SLIDE 7

Scale Up: Make More Powerful Machines

Moore’s law – Stated 52 years ago by Intel founder Gordon Moore – Number of transistors on microchip double every 2 years – Today “closer to 2.5 years” Intel CEO Brian Krzanich

slide-8
SLIDE 8

Dennard Scaling is the Problem

Suggested that power requirements are proportional to the area for transistors – Both voltage and current being proportional to length – Stated in 1974 by Robert H. Dennard (DRAM inventor) Broken since 2005

“Adapting to Thrive in a New Economy of Memory Abundance,” Bresniker et al

slide-9
SLIDE 9

Dennard Scaling is the Problem

Performance per-core is stalled Number of cores is increasing

“Adapting to Thrive in a New Economy of Memory Abundance,” Bresniker et al

slide-10
SLIDE 10

Memory TRENDS

slide-11
SLIDE 11

MEMORY TAKEAWAY

Growing +15% per year

Data access from memory is getting more expensive !

slide-12
SLIDE 12

HDD CAPACITY

slide-13
SLIDE 13

HDD BANDWIDTH

Disk bandwidth is not growing

slide-14
SLIDE 14

SSDs

Performance: – Reads: 25us latency – Write: 200us latency – Erase: 1,5 ms Steady state, when SSD full – One erase every 64 or 128 reads (depending on page size) Lifetime: 100,000-1 million writes per page

slide-15
SLIDE 15

SSD VS HDD COST

slide-16
SLIDE 16

Ethernet Bandwidth

1998 1995 2002 2017

Growing 33-40% per year !

slide-17
SLIDE 17

AMAZON EC2 (2019)

slide-18
SLIDE 18

TRENDS SUMMARY

CPU speed per core is flat Memory bandwidth growing slower than capacity SSD, NVMe replacing HDDs Ethernet bandwidth growing

slide-19
SLIDE 19

DATACENTER ARCHITECHTURE

Memory Bus Ethernet SATA PCIe

Server Server

slide-20
SLIDE 20

STORAGE HIERARCHY (DC AS A COMPUTER v2)

slide-21
SLIDE 21

Warehouse-Scale Computers

Single organization Homogeneity (to some extent) Cost efficiency at scale – Multiplexing across applications and services – Rent it out! Many concerns – Infrastructure – Networking – Storage – Software – Power/Energy – Failure/Recovery – …

slide-22
SLIDE 22

SOFTWARE IMPLICATIONS

Workload Diversity Reliability Single organization Storage Hierarchy

slide-23
SLIDE 23

BigData

WORKLOAD: Partition-Aggregate

Top-level Aggregator Mid-level Aggregators Workers

slide-24
SLIDE 24

WORKLOAD: SCHOLAR SIMILARITY

Reduce Stage Map Stage

slide-25
SLIDE 25

VIDEO ENCODING

slide-26
SLIDE 26

MACHINE LEARNING

slide-27
SLIDE 27

DISCUSSION

https://forms.gle/CrrrhCPYHerwXNEt5

slide-28
SLIDE 28

Discussion

Scale-up vs Scale-out

slide-29
SLIDE 29

DISCUSSION

Microsoft Word vs. online document editor like Google Docs

slide-30
SLIDE 30

DISCUSSION

slide-31
SLIDE 31

NEXT STEPS

Next class: Storage Systems Assignment 1 out Thursday. Submit groups before that!