r elated w ork
play

R ELATED W ORK : P ERFORMANCE E VALUATION IN C LUSTERS Analytic - PowerPoint PPT Presentation

The imagination driving Australias ICT future. C OMPREHENSIVE T HROUGHPUT E VALUATION OF LAN S IN C LUSTERS OF PC S WITH S WITCHBENCH or How to Bring Your Switch to Its Knees Felix Rauch National ICT Australia felix.rauch@nicta.com.au The


  1. The imagination driving Australia’s ICT future. C OMPREHENSIVE T HROUGHPUT E VALUATION OF LAN S IN C LUSTERS OF PC S WITH S WITCHBENCH or How to Bring Your Switch to Its Knees Felix Rauch National ICT Australia felix.rauch@nicta.com.au

  2. The imagination driving Australia’s ICT future. C LUSTERS OF PC S Harness the power of many compute nodes coupled together. Rack-mounted compute cluster Network of workstations Successful because: • Commodity off-the-shelf components (PCs, LAN) • Often do-it-yourself approach • Cost-effective high-performance computing S WITCHBENCH — H OW TO B RING Y OUR S WITCH TO ITS K NEES 2

  3. The imagination driving Australia’s ICT future. U NDERSTANDING P ERFORMANCE IN C LUSTERS OF C OMMODITY PC S PC PC node node PC PC node node PC PC node node PC PC node node Switchbench measures the overall network performance. S WITCHBENCH — H OW TO B RING Y OUR S WITCH TO ITS K NEES 3

  4. The imagination driving Australia’s ICT future. U NDERSTANDING P ERFORMANCE IN C LUSTERS OF C OMMODITY PC S PC PC node node PC PC node node PC PC node node PC PC node node Switchbench measures the overall network performance. S WITCHBENCH — H OW TO B RING Y OUR S WITCH TO ITS K NEES 4

  5. The imagination driving Australia’s ICT future. U NDERSTANDING P ERFORMANCE IN C LUSTERS OF C OMMODITY PC S PC PC node node PC PC node node PC PC node node PC PC node node S WITCHBENCH — H OW TO B RING Y OUR S WITCH TO ITS K NEES 5

  6. The imagination driving Australia’s ICT future. U NDERSTANDING P ERFORMANCE IN C LUSTERS OF C OMMODITY PC S PC PC node node PC PC node node PC PC node node PC PC node node Switchbench measures the overall network performance. S WITCHBENCH — H OW TO B RING Y OUR S WITCH TO ITS K NEES 5- A

  7. The imagination driving Australia’s ICT future. O VERVIEW • Introduction • Network Performance • Evaluation principles • Switchbench microbenchmarks with evaluation examples • Conclusions S WITCHBENCH — H OW TO B RING Y OUR S WITCH TO ITS K NEES 6

  8. The imagination driving Australia’s ICT future. N ETWORK P ERFORMANCE IN C LUSTERS OF PC S Supercomputers: Commodity Clusters: • Balanced • Cheap (commodity) parts • Full bisection • One-fits-all (LAN) • Remote deposit • Sometimes hacks to im- prove performance ➜ Built by design ➜ Built by shopping S WITCHBENCH — H OW TO B RING Y OUR S WITCH TO ITS K NEES 7

  9. The imagination driving Australia’s ICT future. N ETWORK P ERFORMANCE IN C LUSTERS OF PC S Supercomputers: Commodity Clusters: • Balanced • Cheap (commodity) parts • Full bisection • One-fits-all (LAN) • Remote deposit • Sometimes hacks to im- prove performance ➜ Built by design ➜ Built by shopping Problems when choosing commodity components (they are all different!): • make sure products adhere to specifications (not all do!) • know performance characteristics (they differ widely!) S WITCHBENCH — H OW TO B RING Y OUR S WITCH TO ITS K NEES 7- A

  10. The imagination driving Australia’s ICT future. N ETWORK P ERFORMANCE IN C LUSTERS OF PC S Supercomputers: Commodity Clusters: • Balanced • Cheap (commodity) parts • Full bisection • One-fits-all (LAN) • Remote deposit • Sometimes hacks to im- prove performance ➜ Built by design ➜ Built by shopping Problems when choosing commodity components (they are all different!): • make sure products adhere to specifications (not all do!) • know performance characteristics (they differ widely!) ➜ Need benchmark tools for comprehensive evaluation. S WITCHBENCH — H OW TO B RING Y OUR S WITCH TO ITS K NEES 7- B

  11. The imagination driving Australia’s ICT future. R ELATED W ORK : P ERFORMANCE E VALUATION IN C LUSTERS Analytic models: • LogP (Culler 1993) • LogGP (Alexandrov 1995) Overall benchmark for parallel machines: • High-Performance Linpack (Dongarra 1979) Point-to-point network benchmarks: • Netperf (Jones) • NetPIPE (Turner) • TTCP (PCAUSA) Distributed network benchmark framework: • IPbench (Wienand 2004) S WITCHBENCH — H OW TO B RING Y OUR S WITCH TO ITS K NEES 8

  12. The imagination driving Australia’s ICT future. B ANDWITH VS . L ATENCY How to evaluate networks / switches? Latency vs. bandwidth: • Latency mostly “given by nature”. Addressed with latency hiding techniques. • One can purchase (additional) bandwidth. There are more interesting cost/performance tradeoffs for additional bandwidth than for lower latency. ➜ Focus on bandwidth How to measure bandwith of entire networks? S WITCHBENCH — H OW TO B RING Y OUR S WITCH TO ITS K NEES 9

  13. The imagination driving Australia’s ICT future. N ETWORK L IMITATIONS Three main limitations: End nodes Hardware: Network interface controller, CPU, memory, I/O bus. Software: Communication protocol stack. Switches Processing limit (number of packets per second). Internal bandwidth limitation. Bisection bandwidth Network architecture (topology). S WITCHBENCH — H OW TO B RING Y OUR S WITCH TO ITS K NEES 10

  14. The imagination driving Australia’s ICT future. F ULL B ISECTION B ANDWIDTH A network with N nodes has full bisection bandwidth if the sum of the link bandwidths between any two halves of the network is N/ 2 times the bandwidth of a single link. ⇔ Nodes of any two halves can communicate at full speed with each other. S WITCHBENCH — H OW TO B RING Y OUR S WITCH TO ITS K NEES 11

  15. The imagination driving Australia’s ICT future. F ULL B ISECTION B ANDWIDTH A network with N nodes has full bisection bandwidth if the sum of the link bandwidths between any two halves of the network is N/ 2 times the bandwidth of a single link. ⇔ Nodes of any two halves can communicate at full speed with each other. S WITCHBENCH — H OW TO B RING Y OUR S WITCH TO ITS K NEES 12

  16. The imagination driving Australia’s ICT future. F ULL B ISECTION B ANDWIDTH A network with N nodes has full bisection bandwidth if the sum of the link bandwidths between any two halves of the network is N/ 2 times the bandwidth of a single link. ⇔ Nodes of any two halves can communicate at full speed with each other. S WITCHBENCH — H OW TO B RING Y OUR S WITCH TO ITS K NEES 13

  17. The imagination driving Australia’s ICT future. F ULL B ISECTION B ANDWIDTH A network with N nodes has full bisection bandwidth if the sum of the link bandwidths between any two halves of the network is N/ 2 times the bandwidth of a single link. ⇔ Nodes of any two halves can communicate at full speed with each other. S WITCHBENCH — H OW TO B RING Y OUR S WITCH TO ITS K NEES 14

  18. The imagination driving Australia’s ICT future. F ULL B ISECTION B ANDWIDTH A network with N nodes has full bisection bandwidth if the sum of the link bandwidths between any two halves of the network is N/ 2 times the bandwidth of a single link. ⇔ Nodes of any two halves can communicate at full speed with each other. Important for programs with global communication patterns. Important communication pattern requiring full bisection: • All-to-all personalised communication (AAPC). Every node exchanges some data with every other node. S WITCHBENCH — H OW TO B RING Y OUR S WITCH TO ITS K NEES 15

  19. The imagination driving Australia’s ICT future. I MPLEMENTATION • Based on earlier work done at ETH Zurich, together with C. Kurmann & T. Stricker. • GNU public license. • Core functionality in two small C programs. • Shell scripts support: – starting programs on many nodes (by ssh) – specify node ranges – reordering of virtual node numbers to match physical layout • Results in human-readable text file. • Implemented and tested on GNU/Linux. S WITCHBENCH — H OW TO B RING Y OUR S WITCH TO ITS K NEES 16

  20. The imagination driving Australia’s ICT future. B ENCHMARK : D AISY C HAIN Virtual TCP daisy chain through an increasing number of nodes. V Next-neighbour communication PC PC node node X Bisection bandwidth not tested PC PC node V Full-speed duplex connections on node all ports PC PC node node V Limited by switch performance PC PC V Increase load to find switch’s limit node node S WITCHBENCH — H OW TO B RING Y OUR S WITCH TO ITS K NEES 17

  21. The imagination driving Australia’s ICT future. B ENCHMARK : D AISY C HAIN Virtual TCP daisy chain through an increasing number of nodes. V Next-neighbour communication PC PC node node X Bisection bandwidth not tested PC PC node V Full-speed duplex connections on node all ports PC PC node node V Limited by switch performance PC PC V Increase load to find switch’s limit node node Result: Bandwidth of TCP chain. Taken from Dolly partition-casting tool (disk cloning): • Successfully used to install large clusters S WITCHBENCH — H OW TO B RING Y OUR S WITCH TO ITS K NEES 17- A

  22. The imagination driving Australia’s ICT future. D AISY -C HAIN B ENCHMARK : E XAMPLE E VALUATION P LATFORM Cluster with 16 nodes: • 2 Intel PentiumIII, 1 GHz • 512 MByte RAM • Intel Ethernet Pro 100, Fast Ethernet adapter • Packet Engines G-NIC II, Gigabit Ethernet adapter Experiments to compare performance characteristics of 3 different switches: • Cisco 2900 XL Fast Ethernet switch (24 ports) • ATI FS724I Fast Ethernet switch (24 ports) • Cabletron SSR8600 Gigabit Ethernet switch (16 ports configured) S WITCHBENCH — H OW TO B RING Y OUR S WITCH TO ITS K NEES 18

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend