Data-loading (for ML applications) using TDFs
Stefan Wunsch stefan.wunsch@cern.ch 2018-02-22
1
Data-loading (for ML applications) using TDFs Stefan Wunsch - - PowerPoint PPT Presentation
Data-loading (for ML applications) using TDFs Stefan Wunsch stefan.wunsch@cern.ch 2018-02-22 1 Motivation Most of the data analysis of the high-level HEP analyses happens in the Python domain (frameworks of analysis groups on top of flat
1
2
3
4
◮ Efficient selection of data (Filter) ◮ Define new variables (Define) ◮ Other fancy operations (ForEach) ◮ . . .
5
6
0.7 1.4 2.1 2.8 Size of data in MB 10 20 30 40 50 Elapsed time in seconds
TDF with 1 thread TDF with 2 threads TDF with 3 threads TDF with 4 threads root_numpy
7
8
◮ Input argument handling: Interpreted as float*, shape
◮ Return value handling: Not supported (?) 9