Thomas Jefferson National Accelerator Facility
NP SciDAC Project: JLab Site Report
Bálint Joó Jefferson Lab, Oct 18, 2013
Thursday, October 17, 2013
NP SciDAC Project: JLab Site Report Blint Jo Jefferson Lab, Oct - - PowerPoint PPT Presentation
NP SciDAC Project: JLab Site Report Blint Jo Jefferson Lab, Oct 18, 2013 Thomas Jefferson National Accelerator Facility Thursday, October 17, 2013 JLab Year 1 Tasks T1: Extend the Just-In-Time (JIT) based version of QDP++ to use
Thomas Jefferson National Accelerator Facility
Thursday, October 17, 2013
Thomas Jefferson National Accelerator Facility
Thursday, October 17, 2013
Thomas Jefferson National Accelerator Facility
Thursday, October 17, 2013
Thomas Jefferson National Accelerator Facility
Good Mem B/W region shoulder
Thursday, October 17, 2013
Thomas Jefferson National Accelerator Facility
400 600 800 1000 1200 1400 1600 Nodes 2000 3000 4000 5000 6000 7000 8000 9000 10000 Trajectory time in seconds CPU (all MPI) GPU (JITPTX+QUDA) V=40x40x40x256, mπ ~ 230 MeV, Anisotropic clover 0.65x execution time =1.53x speedup 0.5x execution time =2x speedup
Start of shoulder region (~14
4 sites/node)
Thursday, October 17, 2013
Thomas Jefferson National Accelerator Facility
Thursday, October 17, 2013
Thomas Jefferson National Accelerator Facility
Vaidyanathan primarily,
Thursday, October 17, 2013
Thomas Jefferson National Accelerator Facility
!" #!!!!" $!!!!" %!!!!" &!!!!" '!!!!" (!!!!" )!!!!" *+*" ,-./*0+*" *12*+*" *12,-./*0+*" 32*+3" 42*+4" !"#$%&%'%()*+,-& 56-"789:";8-<"=>:7"??@A%" 789:"$B#(":7C<,-9" &":7C<,-9D;8C<" %":7C<,-9D;8C<" $":7C<,-9D;8C<" #":7C<,-D;8C<"
Thursday, October 17, 2013
Thomas Jefferson National Accelerator Facility
!"!# !"!# ""$# "%"# "$%# "&'# !((# "))# !""# !"*# "$!# "&'# ")&# *!&# !(%# ")!# !!'# !!'# "+'# "%!# "*'# *+!# !'$# ""(# !!&# !!&# "!%# "(+# "$"# *+&# !(!# "$!# !""# !""# "*(# "&)# ")(# *"+# !("# "$$#
+# )+# !++# !)+# "++# ")+# *++# ,-./01234435# 6/01234435# ,-./01234435# 6/01234435# ,-./01234435# 6/01234435# ,-./01234435# 6/01234435# 7-839:#;3/-:##<)="'(+#>?@A=<BC# 7-839:#;3/-#BDEF#>G@6C#)!!+B# 7-839:#;3/-#BDEF#>G@6C# A!BHI=%!!+B# @J7K7L:#G31932#G"+0# ME94/-#K49N4D# JO"$P"$P"$P!"(## JO*"P*"P*"P!"(# JO$+P$+P$+P&'# JO$(P$(P"$P'$# JO*"P$+P"$P&'#
From: B. Joo, D. D. Kalamkar, K. Vaidyanathan, M. Smelyanskiy, K. Pamnani,V. W. Lee, P. Dubey, W. Watson ||| “Lattice QCD on Intel(R) Xeon Phi(tm) Coprocessors”, Proceedings of ISCʼ13 (Leipzig) Lecture Notes in Computer Science Vol 7905 (to appear),
Thursday, October 17, 2013
Thomas Jefferson National Accelerator Facility
!"#$ ""%&$ &"'($ )!''$ )#()$
&&%"$ *('($ +(+!$
($ "((($ &((($ )((($ *((($ +((($ !((($ &$ *$ %$ "!$ )&$ ,-./01$23$4025$678$-589:$ ;<)&=)&=)&=&+!$ ;<*%=*%=*%=&+!$ !""# $%&# '"(&# %")$# %$!!# '**&# %&!"# )&)"# (# "((# '(((# '"((# %(((# %"((# )(((# )"((# !(((# !"((# %# !# $# '*# )%# +,-./0#12#3/14#567#8479:# ;<)%=)%=)%=%"*# ;<!$=!$=!$=%"*#
Wilson Dslash Wilson CG
From: B. Joo, D. D. Kalamkar, K. Vaidyanathan, M. Smelyanskiy, K. Pamnani, V. W. Lee, P. Dubey, W. Watson |||, “Lattice QCD on Intel(R) Xeon Phi(tm) Coprocessors”, Proceedings of ISCʼ13 (Leipzig) Lecture Notes in Computer Science Vol 7905 (to appear),
Thursday, October 17, 2013
Thomas Jefferson National Accelerator Facility
to multiple nodes
introducing second comms direction
progress rather than attainable bandwidth or latency
to 2D comms. More likely due to B/W constraints...
nodes (CG to 16.8 TF)
1 2 4 8 16 32 64 128 number of nodes 50 100 150 200 250 300 GFLOPS per Node
Without Proxy With CML Proxy
Wilson Dslash, Weak Scaling, 48x48x24x64 sites per node, single precision
Communication in 1 dimension Communication in 2 dimensions
31 TF !!
Thursday, October 17, 2013
Thomas Jefferson National Accelerator Facility
ISCʼ13 paper
the performance difference but
to different virtual topology on 16 nodes
and 3.6TF reached in CG on Stampede
4 8 16 32 64 Number of Nodes 1000 2000 3000 4000 5000 6000 GFLOPS
Dslash Endeavor Dslash Stampede CG Endeavor CG Stampede
48x48x48x256 sites, strong scaling, single precision, using CML proxy
Thursday, October 17, 2013
Thomas Jefferson National Accelerator Facility
Thursday, October 17, 2013
Thomas Jefferson National Accelerator Facility
Thursday, October 17, 2013
Thomas Jefferson National Accelerator Facility
Thursday, October 17, 2013
Thomas Jefferson National Accelerator Facility
Thursday, October 17, 2013