SLIDE 12 πππ π β 0: πΆπͺ: π β 1 πππ π β 0: πΆπͺ: π β 1 πππ π β 0: πΆπͺ: πΏ β 1 πππ πβ² β [π: πΆπ½: π + ππΆ β 1] πππ πβ² β [π: π΅π½: π + ππΆ β 1] πππ πβ² β [π: π³π½: π + ππΆ β 1]
πππ πβ²β² β πβ²: 1: πβ² + πΏπ β 1 πππ πβ²β² β πβ²: 1: πβ² + ππ β 1 πππ πβ²β² β [πβ²: 1: πβ² + ππ β 1] π·πβ²β²,πβ²β² = π·πβ²β²,πβ²β² + π΅πβ²β²,πβ²β² Γ πΆπβ²β²,πβ²β²
2nd Level Blocking
ππ + ππ + ππ Γ ππ β€ ππ
= Γ = Γ
π π
ππ ππ
Graphic from βHow To Write Fast Numerical Code: A Small Introductionβ Srinivas Chellappa, Franz Franchetti, and Markus PΓΌschel
Unroll Loop