Dynamic Load Balancing in Dynamic Load Balancing in Charm+ + Charm+ +
Abhinav S Bhatele Parallel Programming Lab, UIUC
Dynamic Load Balancing in Dynamic Load Balancing in Charm+ + - - PowerPoint PPT Presentation
Dynamic Load Balancing in Dynamic Load Balancing in Charm+ + Charm+ + Abhinav S Bhatele Parallel Programming Lab, UIUC Outline Outline Dynamic Load Balancing framework in Charm+ + Measurement Based Load Balancing Examples:
Abhinav S Bhatele Parallel Programming Lab, UIUC
5 1 1 5 2 2 5 3 3 5 4 4 5 5
Memory usage (MB)
1 2 8 K 2 5 6 K 5 1 2 K 1 M
Number of objects
32K processors 64K processors
lb_test benchmark is a parameterized program that creates a specified number of communicating objects in 2D- mesh.
Simulation performed with the performance simulator BigSim
5 1 1 5 2 2 5 3 3 5 4
E x e c u t i
T i me ( i n s e c
d s )
1 2 8 K 2 5 6 K 5 1 2 K 1 M
N u mb e r
O b j e c t s G r e e d y L B G r e e d y C
B R e f i n e L B
Execution time of load balancing algorithms on a 64K processor simulation
…
1023 65535 64512
…
1024
…
2047 64511 63488
… …...
1024 63488 64512 1
64K processor hierarchical tree
Apply different strategies at each level
Level 0 Level 1 Level 2
1024
64
…
1023 65535 64512
…
1024
…
2047 64511 63488
… …...
1024 63488 64512 1 Load Data (OCG)
Refinement- based Load balancing
Greedy- based Load balancing
Load Data token
5 1 1 5 2 2 5 3 3 5 4 4 5 5 Memory usage (MB) 2 5 6 K 5 1 2 K 1 M Number of Objects
CentralLB HybridLB
Simulation of lb_test (for 64k processors)
5 1 1 5 2 2 5 3 3 5 4 4 5 Time(s) 2 5 6 K 5 1 2 K 1 M Number of Objects
Simulation of lb_test for 64K processors
GreedyCommLB HybridLB(GreedyCommLB)
22.63MB 22.57MB 6.8MB Memory 16384 8192 4096 N procs lb_test benchmark’s actual run on BG/ L at IBM (512K objects)
. 2 . 4 . 6 . 8 . 1 . 1 2
Maximum predicted load (seconds)
2 5 6 K 5 1 2 K 1 M
Number of Objects
Simulation of lb_test for 64K processors
GreedyCommLB HybridLB
t)
p)