Nice to meet you! The Network Matters Cloud-based applications - PowerPoint PPT Presentation

Competitive Clustering of Stochastic Communication Patterns on the Ring Chen Avin Louis Cohen Stefan Schmid Nice to meet you!

The Network Matters ❏ Cloud-based applications generate significant network traffic ❏ E.g., scale-out databases, streaming, batch processing applications ❏ E.g., Hadoop Terrasort job: Shuffle phase

Example: VM Placememt ❏ Virtual machine placement affects bandwidth costs ❏ Example: map reduce in Clos datacenter

Example: VM Placememt ❏ Virtual machine placement affects bandwidth costs ❏ Example: map reduce in Clos datacenter reducers reducers mappers mappers tenant 1 tenant 2 tenant 2 tenant 1

Example: VM Placememt ❏ Virtual machine placement affects bandwidth costs ❏ Example: map reduce in Clos datacenter Distributed across pods: costly shuffling! reducers reducers mappers mappers tenant 1 tenant 2 tenant 2 tenant 1

Example: VM Placememt ❏ Virtual machine placement affects bandwidth costs ❏ Example: map reduce in Clos datacenter Locally clustered within a rack or pod: efficient! mappers reducers reducers mappers tenant 2 tenant 2 tenant 1 tenant 1

Example: VM Placememt ❏ Virtual machine placement affects bandwidth costs ❏ Example: map reduce in Clos datacenter Communication patterns are often clustered (but can change Locally clustered over time). within a rack or pod: efficient! mappers reducers reducers mappers tenant 2 tenant 2 tenant 1 tenant 1

How to support local communication?

How to support local communication? Option 1: Change the topology (?!)

How to support local communication? Option 1: Change the topology (?!) ❏ Theory of demand-aware networks ❏ Prototypes emerging: e.g., ProjectToR (SIGCOMM 2016) ❏ Based on lasers and mirrors

How to support local communication? Option 1: Change the topology (?!) ❏ Theory of demand-aware networks ❏ We are working on Prototypes emerging: e.g., ProjectToR (SIGCOMM 2016) it! E.g., „ SplayNets @ ❏ TON 2016“. Based on lasers and mirrors But not today!

How to support local communication? Option 1: Change the topology (?!) Option 2: Cluster the nodes ❏ ❏ Theory of demand-aware networks Migrate frequently ❏ communicating nodes closer Prototypes emerging: e.g., ProjectToR together (SIGCOMM 2016) ❏ Based on lasers and mirrors

How to support local communication? Option 1: Change the topology (?!) Option 2: Cluster the nodes ❏ ❏ Theory of demand-aware networks Migrate frequently Today! ❏ communicating nodes closer Prototypes emerging: e.g., ProjectToR together (SIGCOMM 2016) ❏ Based on lasers and mirrors

How to support local communication? Option 1: Change the topology (?!) Option 2: Cluster the nodes ❏ ❏ Theory of demand-aware networks Migrate frequently ❏ communicating nodes closer Prototypes emerging: e.g., ProjectToR together (SIGCOMM 2016) ❏ Based on lasers and mirrors ❏ Challenges of communication pattern clustering: ❏ Communication patterns are not known ahead of time… ❏ … and may even change over time!

How to support local communication? Option 1: Change the topology (?!) Option 2: Cluster the nodes ❏ ❏ Theory of demand-aware networks Migrate frequently ❏ communicating nodes closer Prototypes emerging: e.g., ProjectToR together (SIGCOMM 2016) ❏ Based on lasers and mirrors ❏ Challenges of communication pattern clustering: ❏ Communication patterns are not known ahead of time… Thus: Need to repartition ❏ … and may even change over time! clusters in an online manner, depending on demand!

Example: A Re Partitioning Problem ❏ Example: 4 clusters of size 4 How to cluster?

Example: A Re Partitioning Problem ❏ Example: 4 clusters of size 4 How to cluster? Thickness of line = amount of communication

Example: A Re Partitioning Problem ❏ Example: 4 clusters of size 4

Example: A Re Partitioning Problem ❏ Example: 4 clusters of size 4 Most communication within cluster (intra- cluster)… … little inter-cluster communication.

Example: A Re Partitioning Problem ❏ Example: 4 clusters of size 4 1 2 3 4 5 6 ❏ Now assume: changes in communication pattern! ❏ E.g., more communication (1,3),(3,4),(2,5) but less (5,6)

Example: A Re Partitioning Problem ❏ Example: 4 clusters of size 4 1 1 2 3 4 5 5 6 ❏ Now assume: changes in communication pattern! ❏ E.g., more communication (1,3),(3,4),(2,5) but less (5,6)

Example: A Re Partitioning Problem ❏ Example: 4 clusters of size 4 1 1 2 3 4 5 5 6 Nodes 1 and 5 ❏ Now assume: changes in communication pattern! change clusters! ❏ E.g., more communication (1,3),(3,4),(2,5) but less (5,6)

Online Re Partitioning A simple and fundamental model (e.g., a rack): size k („# slots “) servers („ clusters “)

Online Re Partitioning A simple and fundamental model (e.g., a rack): … maximize size k („# slots “) intra-cluster communication! Minimize inter-cluster servers („ clusters “) communication …

Online Re Partitioning A simple and fundamental model (e.g., a rack): … maximize size k („# slots “) intra-cluster Also: minimize communication! migrations (=swap)! Minimize inter-cluster servers („ clusters “) communication …

Online Re Partitioning A simple and fundamental model: In practice: k << (many more servers than VM slots per server)! … maximize size k („# slots “) intra-cluster Also: minimize communication! migrations (=swap)! Minimize inter-cluster servers („ clusters “) communication …

Online Re Partitioning Problem inputs: k, , Communication pattern over time

Online Re Partitioning Problem inputs: k, , α 0 Costs: 1 Objective:

Online Re Partitioning Problem inputs: k, , Two flavors: (1) online (worst-case) pattern (2) learning: from a fixed (unkown) distribution α 0 Costs: 1 Objective:

The Crux: Algorithmic Challenges A) Serve remotely or migrate (“rent or buy”)? When to migrate? If a communication pattern is short-lived, it may not be worthwhile to collocate the nodes: the migration cost cannot be amortized.

The Crux: Algorithmic Challenges A) Serve remotely or migrate (“rent or buy”)? When to migrate? If a communication pattern is short-lived, it may not be worthwhile to collocate the nodes: the migration cost cannot be amortized. B) Where to migrate, and what? If nodes should be collocated, the question becomes where. Should the first node be migrated to the cluster of the second or vice versa? Or shall both be moved together to a new cluster? Moreover, an algorithm may be required to pro-actively migrate (resp. swap) additional nodes.

The Crux: Algorithmic Challenges A) Serve remotely or migrate (“rent or buy”)? When to migrate? If a communication pattern is short-lived, it may not be worthwhile to collocate the nodes: the migration cost cannot be amortized. B) Where to migrate, and what? If nodes should be collocated, the question becomes where. Should the first node be migrated to the cluster of the second or vice versa? Or shall both be moved together to a new cluster? Moreover, an algorithm may be required to pro-actively migrate (resp. swap) additional nodes. C) Which nodes to evict? There may not exist sufficient space in the desired destination cluster. In this case, the algorithm needs to decide which nodes to evict, to free up space.

Online Variant: Competitive Ratio and Augmentation ❏ Goal: minimize competitive ratio

Online Variant: Competitive Ratio and Augmentation ❏ Goal: minimize competitive ratio ❏ Two flavors: without and with augmentation

Let’s first look at special case: k =2

Let’s first look at special case: k =2 Need to find pairs!

Let’s first look at special case: k =2 Need to find pairs! Clusters of size 2: A new type of online matching problem!

Special Cases: =2

Special Cases: =2 2 Clusters: A generalization of online caching!

Special Cases: =2 (“Online Caching”) ❏ For 2 clusters: can emulate Models disk online caching! Models cache ❏ k items, cache size k -1 cache disk

Special Cases: =2 (“Online Caching”) ❏ For 2 clusters: can emulate … plus some online caching! dummy item ❏ k items, cache size k -1 d k -1 Cache… cache disk

Special Cases: =2 (“Online Caching”) ❏ For 2 clusters: can emulate online caching! ❏ k items, cache size k -1 d i ❏ When item i is requested in original caching problem: ❏ Introduce many requests k -1 between d and i : forces i to cache (if it is not yet) cache disk

Special Cases: =2 (“Online Caching”) ❏ For 2 clusters: can emulate online caching! ❏ k items, cache size k -1 d i ❏ When item i is requested in original caching problem: ❏ Introduce many requests k -1 between d and i : forces i to cache (if it is not yet) ❏ Which one to evict? Caching problem! cache disk

Nice to meet you! The Network Matters Cloud-based applications - PowerPoint PPT Presentation

Competitive Clustering of Stochastic Communication Patterns on the Ring Chen Avin Louis Cohen Stefan Schmid Nice to meet you! The Network Matters Cloud-based applications generate significant network traffic E.g., scale-out

S et the Bar Low. Be a WINNER every time. Public Power Matters Public Power Matters Innovation

PORTUGAL Nice wheather, Nice people Nice country! POR NSO Anbal Marianito Lausanne

Checki king in and Treating High-Achievi ving Students Meet Meet you your r Doctor Doctor

Rational Phosphorus Rational Phosphorus Management in Biosolids Management in Biosolids

All Things Nice Company Presentation Company Profile All Things Nice (ATN) is a platform to

All Things Nice Company Presentation Company Profile All Things Nice (ATN) is a platform to

CWB Network Information exChange Environment (NICE) Mark Cheng Central Weather Bureau, Taiwan,

20 January 2017 1 Purpose Where Every Child Matters, Every Staff Matters Parents to know

Titus 1:5-9 It matters who you are as a leader and it matters what you do as a leader. God is

Head of Secondary Mr Simon Oakley Y7 Meet the Tutor Evening Y7 Meet the Tutor Year Group

How can you use NICE resources to help adopt evidence into practice? Jane Moore Implementation

What your Team needs to know and do at a CARA Meet PRIOR TO THE MEET 1. OBTAIN MEET SCHEDULE

Curriculum matters Mark Phillips Senior HMI, London Monday 3 July 2017 Curriculum matters - 3

When (Low ) Pow er Really Matters When (Low ) Pow er Really Matters When (Low ) Pow er Really

Questions? Questions? Questions? Questions? Questions? Questions? Questions? Questions?

Presentation for County Management and Risk Conference If You Cant Say Something Nice, What

A Spectral Algorithm for Learning Class-Based n -gram Models of Natural Language Karl Stratos

On the Approximability of Information Theoretic Clustering Ferdiando Cicalese, U. Verona Eduardo

Machine learning theory Theory of clustering Hamid Beigy Sharif university of technology June

Percolation Theory Percolation Theory Jie Gao Computer Science Department Stony Brook

Cluster algebras and applications Bernhard Keller Universit Paris Diderot Paris 7 DMV

Cluster Production in pBUU - Past and Future Pawel Danielewicz National Superconducting

Completion of Discrete Cluster Categories of type A . Emine Yldrm, joint with Ba Nguyen and

Nature-based Solutions in Practice: Examples from the Moore Foundations Conservation and