toward understanding heterogeneity in computing
play

Toward Understanding Heterogeneity in Computing Arnold L. Rosenberg - PowerPoint PPT Presentation

Toward Understanding Heterogeneity in Computing Arnold L. Rosenberg Ron C. Chiang Electrical & Computer Engineering Colorado State University Fort Collins, CO, 80523, USA Heterogeneity in Computing One encounters HETEROGENEITY in virtually


  1. Toward Understanding Heterogeneity in Computing Arnold L. Rosenberg Ron C. Chiang Electrical & Computer Engineering Colorado State University Fort Collins, CO, 80523, USA

  2. Heterogeneity in Computing One encounters HETEROGENEITY in virtually all modern computing systems

  3. Heterogeneity in Computing One encounters heterogeneity in virtually all modern computing systems • Computers in clusters/grids differ in power ( NODE-HETEROGENEITY ).

  4. Heterogeneity in Computing One encounters heterogeneity in virtually all modern computing systems • Computers in clusters/grids differ in power ( node-heterogeneity ). • Computers intercommunicate across varied networks ( LINK-HETEROGENEITY ).

  5. Heterogeneity in Computing One encounters heterogeneity in virtually all modern computing systems • Computers in clusters/grids differ in power ( node-heterogeneity ). • Computers intercommunicate across varied networks ( link-heterogeneity ). WE FOCUS ON NODE-HETEROGENEITY .

  6. “Big” Questions about Heterogeneity Heterogeneity complicates the efficient use of multicomputer platforms

  7. “Big” Questions about Heterogeneity Heterogeneity complicates the efficient use of multicomputer platforms — BUT CAN IT ENHANCE THEIR PERFORMANCE?

  8. “Big” Questions about Heterogeneity Heterogeneity complicates the efficient use of multicomputer platforms — but can it enhance their performance? HOW DOES ONE STUDY THIS QUESTION RIGOROUSLY?

  9. Detailed Questions about Heterogeneity • WHAT MAKES ONE CLUSTER MORE POWERFUL THAN ANOTHER?

  10. Detailed Questions about Heterogeneity • What makes one cluster more powerful than another? • ARE YOU BETTER OFF . . . — WITH ONE SUPER-FAST COMPUTER AND MANY “AVERAGE” ONES?

  11. Detailed Questions about Heterogeneity • What makes one cluster more powerful than another? • ARE YOU BETTER OFF . . . — WITH ONE SUPER-FAST COMPUTER AND MANY “AVERAGE” ONES? — WITH ALL COMPUTERS “MODERATELY” FAST?

  12. Detailed Questions about Heterogeneity • What makes one cluster more powerful than another? • Are you better off with — one super-fast computer and many “average” ones — or with all computers “moderately” fast? • IF YOU COULD “SPEED UP” JUST ONE COMPUTER . . . WHICH ONE WOULD YOU CHOOSE?

  13. Detailed Questions about Heterogeneity • What makes one cluster more powerful than another? • Are you better off with — one super-fast computer and many “average” ones — or with all computers “moderately” fast? • IF YOU COULD “SPEED UP” JUST ONE COMPUTER . . . WHICH ONE WOULD YOU CHOOSE? — THE FASTEST ONE?

  14. Detailed Questions about Heterogeneity • What makes one cluster more powerful than another? • Are you better off with — one super-fast computer and many “average” ones — or with all computers “moderately” fast? • IF YOU COULD “SPEED UP” JUST ONE COMPUTER . . . WHICH ONE WOULD YOU CHOOSE? — THE FASTEST ONE? — THE SLOWEST ONE?

  15. A Formal Framework for Studying the Questions Cluster C has computers C 1 , C 2 , . . . , C n

  16. A Formal Framework for Studying the Questions Cluster C has computers C 1 , C 2 , . . . , C n C i completes one unit of work in ρ i time units.

  17. A Formal Framework for Studying the Questions Cluster C has computers C 1 , C 2 , . . . , C n C i completes one unit of work in ρ i time units. C ’s heterogeneity profile: P C = � ρ 1 , ρ 2 , . . . , ρ n �

  18. A Formal Framework for Studying the Questions Cluster C has computers C 1 , C 2 , . . . , C n C i completes one unit of work in ρ i time units. C ’s heterogeneity profile: P C = � ρ 1 , ρ 2 , . . . , ρ n � One finds in M. Adler, Y. Gong, A.L. Rosenberg (2008): On “exploiting” node-heterogeneous clusters optimally. Theory of Computing Systems 42 , 465–487 a solution to the CLUSTER-EXPLOITATION PROBLEM . . . — a search for a schedule that maximizes C ’s rate of completing work

  19. A Formal Framework for Studying the Questions Cluster C has computers C 1 , C 2 , . . . , C n C i completes one unit of work in ρ i time units. C ’s heterogeneity profile: P C = � ρ 1 , ρ 2 , . . . , ρ n � One finds in M. Adler, Y. Gong, A.L. Rosenberg (2008): On “exploiting” node-heterogeneous clusters optimally. Theory of Computing Systems 42 , 465–487 a solution to the CLUSTER-EXPLOITATION PROBLEM THE OPTIMAL SCHEDULE FOR C DEPENDS ONLY ON P C

  20. A Formal Framework for Studying the Questions Cluster C has computers C 1 , C 2 , . . . , C n C i completes one unit of work in ρ i time units. C ’s heterogeneity profile: P C = � ρ 1 , ρ 2 , . . . , ρ n � One finds in M. Adler, Y. Gong, A.L. Rosenberg (2008): On “exploiting” node-heterogeneous clusters optimally. Theory of Computing Systems 42 , 465–487 a solution the CLUSTER-EXPLOITATION PROBLEM The optimal schedule for C depends only on P C THE WORK COMPLETED UNDER THIS SCHEDULE IS OUR MEASURE OF C ’s “POWER”

  21. A Formal Framework for Studying the Questions Cluster C has computers C 1 , C 2 , . . . , C n C i completes one unit of work in ρ i time units. C ’s heterogeneity profile: P C = � ρ 1 , ρ 2 , . . . , ρ n � C ’s “power”: the work completed by the optimal solution to the CLUSTER-EXPLOITATION PROBLEM The expression for this work is complicated . . . — so we also measure C ’s “power” by its HECR: Homogeneous Equivalent Computing Rate

  22. A Formal Framework for Studying the Questions Cluster C has computers C 1 , C 2 , . . . , C n C i completes one unit of work in ρ i time units. C ’s heterogeneity profile: P C = � ρ 1 , ρ 2 , . . . , ρ n � C ’s HECR ( Homogeneous Equivalent Computing Rate ) . . . the computing rate ρ ( C ) such that the HOMOgeneous cluster with profile � ρ ( C ) , ρ ( C ) , . . . , ρ ( C ) � completes work at the same rate as C .

  23. ON TO OUR QUESTIONS!

  24. Which ONE Computer Should You Speed UP?

  25. Which Computer to Speed Up: Additive Speedup Speeding up computer C i additively by the amount ϕ . . . replaces profile P C = � ρ 1 , . . . , ρ i − 1 , ρ i , ρ i +1 , . . . , ρ n � by profile P C = � ρ 1 , . . . , ρ i − 1 , ρ i − ϕ , ρ i +1 , . . . , ρ n � Say that 0 < ϕ < min i { ρ i } , so every C i can be sped up.

  26. Which Computer to Speed Up: Additive Speedup Speeding up computer C i additively by the amount ϕ : � ρ 1 , . . . , ρ i − 1 , ρ i , ρ i +1 , . . . , ρ n � − → � ρ 1 , . . . , ρ i − 1 , ρ i − ϕ , ρ i +1 , . . . , ρ n � Theorem . Under the additive-speedup scenario, the most advantageous single computer to speed up is C ’s fastest computer.

  27. Which Computer to Speed Up: Additive Speedup Speeding up computer C i additively by the amount ϕ : � ρ 1 , . . . , ρ i − 1 , ρ i , ρ i +1 , . . . , ρ n � − → � ρ 1 , . . . , ρ i − 1 , ρ i − ϕ , ρ i +1 , . . . , ρ n � Theorem . Under the additive-speedup scenario, the most advantageous single computer to speed up is C ’s fastest computer. Initial profile: � 1 , 1 / 2 , 1 / 3 , 1 / 4 � Speedup amount: ϕ = 1 / 16 Speed up Work ratio i computer C i OLD ÷ NEW 1 � 15 / 16 , 1 / 2 , 1 / 3 , 1 / 4 � 1 . 008 2 � 1 , 7 / 16 , 1 / 3 , 1 / 4 � 1 . 014 3 � 1 , 1 / 2 , 13 / 48 , 1 / 4 � 1 . 034 4 � 1 , 1 / 2 , 1 / 3 , 3 / 16 � 1 . 159

  28. Which Computer to Speed Up: Additive Speedup Speeding up computer C i additively by the amount ϕ : � ρ 1 , . . . , ρ i − 1 , ρ i , ρ i +1 , . . . , ρ n � − → � ρ 1 , . . . , ρ i − 1 , ρ i − ϕ , ρ i +1 , . . . , ρ n � Theorem . Under the additive-speedup scenario, the most advantageous single computer to speed up is C ’s fastest computer. Speed up Work ratio i computer C i OLD ÷ NEW 1 � 15 / 16 , 1 / 2 , 1 / 3 , 1 / 4 � 1 . 008 2 � 1 , 7 / 16 , 1 / 3 , 1 / 4 � 1 . 014 3 � 1 , 1 / 2 , 13 / 48 , 1 / 4 � 1 . 034 4 � 1 , 1 / 2 , 1 / 3 , 3 / 16 � 1 . 159 INTUITION: MORE BANG FOR THE BUCK

  29. Which Computer to Speed Up: Multiplicative Speedup Speeding up computer C i multiplicatively by factor ψ . . . replaces profile P C = � ρ 1 , . . . , ρ i − 1 , ρ i , ρ i +1 , . . . , ρ n � by profile P C = � ρ 1 , . . . , ρ i − 1 , ψρ i , ρ i +1 , . . . , ρ n � Say that 0 < ψ < 1 , so every C i can be sped up.

  30. Which Computer to Speed Up: Multiplicative Speedup Speeding up computer C i multiplicatively by factor ψ : � ρ 1 , . . . , ρ i − 1 , ρ i , ρ i +1 , . . . , ρ n � − → � ρ 1 , . . . , ρ i − 1 , ψρ i , ρ i +1 , . . . , ρ n � Say that 0 < ψ < 1 , so every C i can be sped up finitely. “ Theorem .” Under the multiplicative-speedup scenario: The most advantageous single computer to speed up is C ’s fastest computer . . .

  31. Which Computer to Speed Up: Multiplicative Speedup Speeding up computer C i multiplicatively by factor ψ : � ρ 1 , . . . , ρ i − 1 , ρ i , ρ i +1 , . . . , ρ n � − → � ρ 1 , . . . , ρ i − 1 , ψρ i , ρ i +1 , . . . , ρ n � Say that 0 < ψ < 1 , so every C i can be sped up finitely. “ Theorem .” Under the multiplicative-speedup scenario: The most advantageous single computer to speed up is C ’s fastest computer . . . — UNLESS

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend