Scaling Challenges in NAMD: Past and Future
NAMD Team: Abhinav Bhatele Eric Bohm Sameer Kumar David Kunzman Chao Mei Chee Wai Lee Kumaresh P. James Phillips Gengbin Zheng Laxmikant Kale Klaus Schulten
Scaling Challenges in NAMD: Past and Future Outline NAMD: An - - PowerPoint PPT Presentation
NAMD Team: Chee Wai Lee Abhinav Bhatele Kumaresh P. Eric Bohm James Phillips Sameer Kumar Gengbin Zheng David Kunzman Laxmikant Kale Chao Mei Klaus Schulten Scaling Challenges in NAMD: Past and Future Outline NAMD: An Introduction
NAMD Team: Abhinav Bhatele Eric Bohm Sameer Kumar David Kunzman Chao Mei Chee Wai Lee Kumaresh P. James Phillips Gengbin Zheng Laxmikant Kale Klaus Schulten
2
– Conflicting Adaptive Runtime Techniques – PME Computation – Memory Requirements
– Load Balancing – Parallel I/O – Fine-grained Parallelization
3
4
5
6
7
Bonded Computes Non-bonded Computes Patch Integration Patch Integration Reductions Multicast Point to Point Point to Point PME
Non-bonded Work Bonded Work Integration PME Communication
8
9
10
11
12
13
14
15
16
17
18
19
20
Memory Reduction
0.01 0.1 1 10 100 1000
IAPP DHFR Lysozyme ApoA1 F1- ATPase STMV Bar Domain Ribosome
Benchmark Memory Usage (MB) Original New
< 0.5 MB < 0.5 MB
21
1 million atom simulation on 64K processors (LLNL BG/L)
22
5570 atom simulation on 512 processors at 1.2 ms/step
23
Time for ApoA1 (ms/step)
24
3.46 4.2 5.52 7.48 11.42 19.59 NAMD CO mode (No MTS) 2.04 2.71 3.78 5.8 9.73 16.83 NAMD CO mode (1 pe/node) 2.09 3.14 5.39 9.97 18.95 38.42 Blue Matter (2 pes/node)
5.3 5.62 9.99 11.99 NAMD VN mode (No MTS) 2.11 2.29 3.06 4.06 6.26 9.82 NAMD VN mode (2 pes/node) 16384 8192 4096 2048 1024 512 Number of Nodes
25
Time (ms/step) for Desmond on 2.4 GHz Opterons and NAMD on 2.6 GHz Xeons
26
1.5 2.0 7.1 9.4 256 1.1 1.4 4.2 5.2 512 1.0
3.0 1024
6.3 11.5 21.0 41.4 Desmond DHFR 2.0 18.2 33.5 64.3 126.8 256.8 Desmond ApoA1 2.4 4.3 8.09 14.9 27.3 NAMD DHFR 1.9 13.4 26.5 50.7 104.9 199.3 NAMD ApoA1 2048 128 64 32 16 8 Number of Cores
27
28
29