Satisfying Dataflow Programs Constraints on Multicore Architectures
Citi lab PhD day 2014 Manuel Selva Supervisor: Lionel Morel Director: Stéphane Frénot Bull: Frédéric Soinne 27th March 2014
1 / 14
Satisfying Dataflow Programs Constraints on Multicore Architectures - - PowerPoint PPT Presentation
Satisfying Dataflow Programs Constraints on Multicore Architectures Citi lab PhD day 2014 Manuel Selva Supervisor: Lionel Morel Director: Stphane Frnot Bull: Frdric Soinne 27th March 2014 1 / 14 More and more parallelism Nehalem
1 / 14
(source: www.intel.com)
(source: www.bit-tech.net)
(source: www.cultofmac.com)
2 / 14
Parser Text.Y Text.U Text.V Mot.Y Mot.U Mot.V Merger Display
3 / 14
Parser Text.Y Text.U Text.V Mot.Y Mot.U Mot.V Merger Display
3 / 14
Parser Text.Y Text.U Text.V Mot.Y Mot.U Mot.V Merger Display
3 / 14
Parser Text.Y Text.U Text.V Mot.Y Mot.U Mot.V Merger Display
3 / 14
Parser Text.Y Text.U Text.V Mot.Y Mot.U Mot.V Merger Display
3 / 14
Parser Text.Y Text.U Text.V Mot.Y Mot.U Mot.V Merger Display
256 256 256 768
3 / 14
Parser Text.Y Text.U Text.V Mot.Y Mot.U Mot.V Merger Display
Task 5 Mot.Y; Task 4 Text.V; Task 3 Text.U; Task 2 Text.Y; Task 1 Parser; Task 6 Mot.U; Task 7 Mot.V; Task 8 Merger; Task 9 Display;
DF Compiler
4 / 14
Parser Text.Y Text.U Text.V Mot.Y Mot.U Mot.V Merger Display
Task 5 Mot.Y; Task 4 Text.V; Task 3 Text.U; Task 2 Text.Y; Task 1 Parser; Task 6 Mot.U; Task 7 Mot.V; Task 8 Merger; Task 9 Display;
DF Compiler
C1 C2 C3 C4 C5 C6 C7 C8 C9 C10
Core1 Core2 Core3 Core4 RAM 1 Core5 Core6 Core7 Core8 DF Mapper RAM 2 Dual socket processor
T1 T2 T5 T6 T3 T4 T7 T8 T9 C1 C2 C3 C4 C5 C6 C7 C8 C9 C10
4 / 14
5 / 14
6 / 14
th1 th2 th3 th4 th5 th6 th7 th8 th9
6 / 14
Parser Text.Y Text.U Text.V Mot.Y Mot.U Mot.V Merger Display 25f/s
Task 5 Mot.Y; Task 4 Text.V; Task 3 Text.U; Task 2 Text.Y; Task 1 Parser; Task 6 Mot.U; Task 7 Mot.V; Task 8 Merger; Task 9 Display;
DF Compiler
7 / 14
Parser Text.Y Text.U Text.V Mot.Y Mot.U Mot.V Merger Display 25f/s
Task 5 Mot.Y; Task 4 Text.V; Task 3 Text.U; Task 2 Text.Y; Task 1 Parser; Task 6 Mot.U; Task 7 Mot.V; Task 8 Merger; Task 9 Display;
DF Compiler
Task 8 Merger;
Merger instrumented to measure throughput
7 / 14
Parser Text.Y Text.U Text.V Mot.Y Mot.U Mot.V Merger Display 25f/s
Task 5 Mot.Y; Task 4 Text.V; Task 3 Text.U; Task 2 Text.Y; Task 1 Parser; Task 6 Mot.U; Task 7 Mot.V; Task 8 Merger; Task 9 Display;
DF Compiler
Task 5 Mot.Y; Task 4 Text.V; Task 3 Text.U; Task 2 Text.Y; Task 1 Parser; Task 6 Mot.U; Task 7 Mot.V; Task 8 Merger; Task 9 Display;
Tasks instrumented to measure actors execution times
7 / 14
Parser Text.Y Text.U Text.V Mot.Y Mot.U Mot.V Merger Display 25f/s
Task 5 Mot.Y; Task 4 Text.V; Task 3 Text.U; Task 2 Text.Y; Task 1 Parser; Task 6 Mot.U; Task 7 Mot.V; Task 8 Merger; Task 9 Display;
DF Compiler Core1 Core2 Core3 Core4 RAM 1 Core5 Core6 Core7 Core8 DF Mapper RAM 2 Dual socket processor
T1 T2 T5 T6 T3 T4 T7 T8 T9 C1 C2 C3 C4 C5 C6 C7 C8 C9 C10
RAM 1 RAM 2 Memory monitoring using PMU: RAM controllers load QPI traffic
7 / 14
Parser Text.Y Text.U Text.V Mot.Y Mot.U Mot.V Merger Display 25f/s
Task 5 Mot.Y; Task 4 Text.V; Task 3 Text.U; Task 2 Text.Y; Task 1 Parser; Task 6 Mot.U; Task 7 Mot.V; Task 8 Merger; Task 9 Display;
DF Compiler Core1 Core2 Core3 Core4 RAM 1 Core5 Core6 Core7 Core8 DF Mapper RAM 2 Dual socket processor
T1 T2 T5 T6 T3 T4 T7 T8 T9 C1 C2 C3 C4 C5 C6 C7 C8 C9 C10
RAM 1 RAM 2 Memory monitoring using PMU: RAM controllers load QPI traffic
7 / 14
Parser Text.Y Text.U Text.V Mot.Y Mot.U Mot.V Merger Display
Task 5 Mot.Y; Task 4 Text.V; Task 3 Text.U; Task 2 Text.Y; Task 1 Parser; Task 6 Mot.U; Task 7 Mot.V; Task 8 Merger; Task 9 Display;
DF Compiler Core1 Core2 Core3 Core4 RAM 1 Core5 Core6 Core7 Core8 DF Mapper RAM 2 Dual socket processor
T1 T2 T5 T6 T3 T4 T7 T8 T9 C1 C2 C3 C4 C5 C6 C7 C8 C9 C10
8 / 14
Parser Text.Y Text.U Text.V Mot.Y Mot.U Mot.V Merger Display
Task 5 Mot.Y; Task 4 Text.V; Task 3 Text.U; Task 2 Text.Y; Task 1 Parser; Task 6 Mot.U; Task 7 Mot.V; Task 8 Merger; Task 9 Display;
DF Compiler Core1 Core2 Core3 Core4 RAM 1 Core5 Core6 Core7 Core8 DF Mapper RAM 2 Dual socket processor
T1 T2 T5 T6 T3 T4 T7 T8 T9 C1 C2 C3 C4 C5 C6 C7 C8 C9 C10
8 / 14
Parser Text.Y Text.U Text.V Mot.Y Mot.U Mot.V Merger Display
Task 5 Mot.Y; Task 4 Text.V; Task 3 Text.U; Task 2 Text.Y; Task 1 Parser; Task 6 Mot.U; Task 7 Mot.V; Task 8 Merger; Task 9 Display;
DF Compiler Core1 Core2 Core3 Core4 RAM 1 Core5 Core6 Core7 Core8 DF Mapper RAM 2 Dual socket processor
T1 T2 T5 T6 T3 T4 T7 T8 T9 C1 C2 C3 C4 C5 C6 C7 C8 C9 C10
8 / 14
9 / 14
10 / 14
11 / 14
12 / 14
13 / 14
14 / 14