SLIDE 36 Conc Conclus usion ion
■ Performance analysis of SpGEMM on Intel KNL and multicore architectures
– Optimizing implementation for these architectures
■ Identify t the b bottlenecks
– Evaluation in various use cases
■ Clarify w which S SpGEMM a algorithm w works w well
– Highlighting the benefit o
leaving m matrices u unsorted – Empirical r recipe for selecting the best-performing algorithm for a specific application scenario
35 35
Source code is publicly available at https://bitbucket.org/YusukeNagasaka/mtspgemmlib