Convolutional neural network for centrality in fixed target - PowerPoint PPT Presentation

Convolutional neural network for centrality in fixed target experiments Denis Uzhva 6 june 2019 Saint Petersburg University Laboratory of Ultra Hight Energy Physics WPCF2019

Table of contents 1. Introduction 2. Machine learning in HEP 3. Results and comparison 4. Conclusions 1

Introduction

The critical point of QGP to hadronic matter transition Quark matter phase diagram 2

Fluctuations of centrality The critical point can be found (if it exists) by analysis of the fluctuations of centrality Types of the fluctuations 3

The scheme of NA61/SHINE The centrality is measured by using only forward energy from the Projectile Spectator Detector (PSD) 4

Energy cloud SHIELD MC + GEANT4 model of PSD (Li7 + Be9). We have a dataset of 80000 minimum bias events Histogram of the events 5

The reality behind measurements What we measure vs. what we want to measure The problems are based on energy leakage, sandwich structure, electronics resolution and existence of matter between the PSD and the target 6

Cut-based analysis Let’s choose 15.8% most central events (both by E true and E meas ). The accuracy ǫ is calcutated as ǫ = TP + TN / ( TP + TN + FP + FN ) ǫ = 93 . 1% 7

NA61/SHINE’s PSD data as pictures In fact, data from the PSD can be considered as 3D pics, so that we can try to use convolutional neural networks for analysis 8

Machine learning in HEP

What is it all about... A modern and multipurpose method of solving various problems 9

The tasks for ML Image processing 10

The tasks for ML Curves separate two classes 10

Convolutional Neural Networks The concept of CNN is motivated by the way a real eye works Cat-Dog classification with CNN (source: https://sourcedexter.com/quickly-setup-tensorflow-image-recognition/) 11

Convolutional Neural Networks A concept of CNN is motivated by the way a real eye works Convolution explained 11

Machine learning... in HEP? ML takes care of Big Data JETP seminar “First Oscillation Results from NOvA”, 2018 12

Results and comparison

The task Basically, we want to distinguish two classes of centrality: a) 15.8% of most central events, b) others. The dataset of 80000 minimum bias events is obtained with SHIELD MC + GEANT4 model of PSD (Li7 + Be9), 60k are for training, 20k are for validation. The modules we choose Only the central “+”-shaped set of PSD modules are of interest, as it is on the experiment 13

Imperfection of the simulations • No matter between target and PSD :( 14

Imperfection of the simulations • No matter between target and PSD :( • The electronics are not simulated :(( 14

Definition of centrality Therefore, 2 CNN models were trained (CNNn and CNNe) 15

Histogram analysis (by energy) Cut-based: ǫ = 93.0% 16

CNN separation (1st class, CNNe) The events the CNN considered to be from the 1st class 17

CNN separation (2nd class, CNNe) The events the CNN considered to be from the 2nd class 18

Histogram analysis (by spectators) Cut-based: ǫ = 86.7% 19

CNN separation (1st class, CNNn) The events the CNN considered to be from the 1st class 20

CNN separation (2nd class, CNNn) The events the CNN considered to be from the 2nd class 21

Accuracy of the CNN CNN shows better results in accuracy, especially in the task of N spec classification Forward energy N spec Cut-based 93.0% 86.7% CNN 93.7% 92.8% 22

Average multiplicities and variances The � N � and ω values were calculated for the events from the 1st centrality class. Here centrality = forward energy � N � ω Forward energy 19.59 6.07 Cut-based 18.56 7.02 CNNe 18.69 6.82 By forward energy 23

Average multiplicities and variances The � N � and ω values were calculated for the events from the 1st centrality class; centrality = number of spectators � N � ω 15.69 7.58 N spec Cut-based 18.56 7.02 CNNn 16.36 7.35 By number of spectators 24

Conclusions

Further ideas I. Cross-validation on different MC II. Modifications of the CNN III. Implementation to the real data 25

Implementation to other experiments! Moreover, such CNN can be used in other experiments like NICA or FAIR, since they have pretty similar calorimeters 26

Special thanks The work was supported by the Russian Science Foundation grant number 17-72-20045 27

A simple neural net The most popular way to create A.I. today is to develop a clever enough artifical neural network. Here is the example of one. A very simple ANN ELU and RELU functions 30

CNN architecture We vary the parameters of the neural network in order to achieve superior accuracy. CNN for centrality classification 31

CNN architecture, but much simpler In order to understand the concept of training, consider a simplified model The X and z pair is the input data and labels respectively, ˆ w is the weight multitensor, x is a prediction, SCE stands for “sigmoid crossentropy”, Adam is the optimizer 32

Sigmoid crossentropy In binary classification, the loss function can be calculated in this way: L ( x , z ) = − z · log σ ( x ) − (1 − z ) · log(1 − σ ( x )) , σ ( x ) = 1 / (1 + exp( − x )) . x is a prediction ( x = x (ˆ w , X ) – function of weights ˆ w and input data X ), z is a label 33

Adam optimizer The parameters update iteratively as follows: t := t + 1; � 2 / (1 − β t l t := l t − 1 · 1 − β t 1 ); m t := β 1 · ˆ ˆ m t − 1 + (1 − β 1 ) · ˆ g t − 1 ; g 2 v t := β 2 · ˆ ˆ v t − 1 + (1 − β 2 ) · ˆ t − 1 ; � w t := ˆ ˆ w t − 1 − l t · ˆ m t / ( ˆ v t + ǫ ); where t is epoch number, β 1 and β 2 are momenta, l t is learning rate, ˆ m t is “moving average” of gradient, ˆ v t is “moving average” of squared gradient, ˆ w t is some value (weight) and ˆ g t − 1 = dL ( x , z ) / d ˆ w at x = x (ˆ w t − 1 , X t − 1 ) and z = z t − 1 with respect to all the weights 34

Data and CNN parameters • Two classes: 0-3 and 4-7 spectators (15.8% centrality), 80000 events 35

Data and CNN parameters • Two classes: 0-3 and 4-7 spectators (15.8% centrality), 80000 events • The best perfomance was obtained with the dropout rate parameter set as 0.1 (only 10% of FC neurons remain unzeroed) 35

Data and CNN parameters • Two classes: 0-3 and 4-7 spectators (15.8% centrality), 80000 events • The best perfomance was obtained with the dropout rate parameter set as 0.1 (only 10% of FC neurons remain unzeroed) • 1 conv layer with 128 features (3x3x5) 35

Data and CNN parameters • Two classes: 0-3 and 4-7 spectators (15.8% centrality), 80000 events • The best perfomance was obtained with the dropout rate parameter set as 0.1 (only 10% of FC neurons remain unzeroed) • 1 conv layer with 128 features (3x3x5) • 1 max pool (2x2) 35

Data and CNN parameters • Two classes: 0-3 and 4-7 spectators (15.8% centrality), 80000 events • The best perfomance was obtained with the dropout rate parameter set as 0.1 (only 10% of FC neurons remain unzeroed) • 1 conv layer with 128 features (3x3x5) • 1 max pool (2x2) • 1 FC layer with 1024 neurons 35

Data and CNN parameters • Two classes: 0-3 and 4-7 spectators (15.8% centrality), 80000 events • The best perfomance was obtained with the dropout rate parameter set as 0.1 (only 10% of FC neurons remain unzeroed) • 1 conv layer with 128 features (3x3x5) • 1 max pool (2x2) • 1 FC layer with 1024 neurons • Learning rate 5*1e-4 35

Data and CNN parameters • Two classes: 0-3 and 4-7 spectators (15.8% centrality), 80000 events • The best perfomance was obtained with the dropout rate parameter set as 0.1 (only 10% of FC neurons remain unzeroed) • 1 conv layer with 128 features (3x3x5) • 1 max pool (2x2) • 1 FC layer with 1024 neurons • Learning rate 5*1e-4 • Batch size 100 35

Accuracy and loss Two classes: 0-3 and 4-7 spectators (15.8% centrality), 80000 events Accuracy (max 93.3% at 53 epoch) Loss 36

ROC-curve and comparison with other ml methods Measuring area under a ROC-curve is another method of defining the accuracy. comparison of ROC-curves given different ml methods 37

Convolutional neural network for centrality in fixed target - PowerPoint PPT Presentation

Convolutional neural network for centrality in fixed target experiments Denis Uzhva 6 june 2019 Saint Petersburg University Laboratory of Ultra Hight Energy Physics WPCF2019 Table of contents 1. Introduction 2. Machine learning in HEP 3.

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Convolutional Neural Networks ---- Off the shelf top notch performances Convolutional Neural

Convolutional Kuan-Ting Lai 2020/3/31 Neural Network Convolutional Neural Networks (CNN)

Introduction CSCE 970 CSCE 970 Lecture 4: Lecture 4: Convolutional Convolutional Neural

Convolutional Neural Nets 4-25-16 Reading Quiz Convolutional neural networks are most commonly

Degree centrality Network Analysis in Python I Important nodes Which nodes are important?

ON TEGRA X1 ALAN WANG, NVIDIA Convolutional Neural Network optimization target Result

Neural Network Part 3: Convolutional Neural Networks CS 760@UW-Madison Goals for the lecture

Convolutional Neural Nets CS447 Natural Language Processing (J. Hockenmaier)

Convolutional Neural Networks for Sentence Classification Yoon Kim New York University 1 / 34

Convolutional Neural Networks 08, 10 & 17 Nov, 2016 J. Ezequiel Soto S. Image Processing

Outline Convolutional Neural Network Architectures for Matching Natural Language Sentences.

Centrality Social and Technological Networks Rik Sarkar University of Edinburgh, 2017.

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

REDEFINING CENTRALITY Redefining Centrality Overview - Regional Integration - Global and Local

Centrality Argimiro Arratia & R. Ferrer-i-Cancho Universitat Polit` ecnica de Catalunya Version

Arena Development in Chatham Summary of Survey Results I Live in Ward... Arena Development in

Alexander Botvina I ITP, TP, Goethe University, Goethe University, F Frankfurt am Main

Cen entralit ity de determin inatio ion in in MPD at t NICA ICA: App pplic icatio ion

A new Space-Charge Model for high-intensity Particle Beams Introduction Basic Accelerator

A world that works for everyone A world that works for everyone Stuart Brown: Play How

A common origin of Baryons and Dark Matter via gravitational collapse of Primordial Black Holes

e + e M 1 M 2 and Power Corrections in B Decays Alex Kagan University of Cincinnati

Be The Change II Presents IT TAKES A VILLAGE: Project Management Intendment Goal : Support

Convolutional neural network for centrality in fixed target - PowerPoint PPT Presentation

Convolutional neural network for centrality in fixed target experiments Denis Uzhva 6 june 2019 Saint Petersburg University Laboratory of Ultra Hight Energy Physics WPCF2019 Table of contents 1. Introduction 2. Machine learning in HEP 3.

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Convolutional Neural Networks ---- Off the shelf top notch performances Convolutional Neural

Convolutional Kuan-Ting Lai 2020/3/31 Neural Network Convolutional Neural Networks (CNN)

Introduction CSCE 970 CSCE 970 Lecture 4: Lecture 4: Convolutional Convolutional Neural

Convolutional Neural Nets 4-25-16 Reading Quiz Convolutional neural networks are most commonly

Degree centrality Network Analysis in Python I Important nodes Which nodes are important?

ON TEGRA X1 ALAN WANG, NVIDIA Convolutional Neural Network optimization target Result

Neural Network Part 3: Convolutional Neural Networks CS 760@UW-Madison Goals for the lecture

Convolutional Neural Nets CS447 Natural Language Processing (J. Hockenmaier)

Convolutional Neural Networks for Sentence Classification Yoon Kim New York University 1 / 34

Convolutional Neural Networks 08, 10 &amp; 17 Nov, 2016 J. Ezequiel Soto S. Image Processing

Outline Convolutional Neural Network Architectures for Matching Natural Language Sentences.

Centrality Social and Technological Networks Rik Sarkar University of Edinburgh, 2017.

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

REDEFINING CENTRALITY Redefining Centrality Overview - Regional Integration - Global and Local

Centrality Argimiro Arratia &amp; R. Ferrer-i-Cancho Universitat Polit` ecnica de Catalunya Version

Arena Development in Chatham Summary of Survey Results I Live in Ward... Arena Development in

Alexander Botvina I ITP, TP, Goethe University, Goethe University, F Frankfurt am Main

Cen entralit ity de determin inatio ion in in MPD at t NICA ICA: App pplic icatio ion

A new Space-Charge Model for high-intensity Particle Beams Introduction Basic Accelerator

A world that works for everyone A world that works for everyone Stuart Brown: Play How

A common origin of Baryons and Dark Matter via gravitational collapse of Primordial Black Holes

e + e M 1 M 2 and Power Corrections in B Decays Alex Kagan University of Cincinnati

Be The Change II Presents IT TAKES A VILLAGE: Project Management Intendment Goal : Support

Convolutional Neural Networks 08, 10 & 17 Nov, 2016 J. Ezequiel Soto S. Image Processing

Centrality Argimiro Arratia & R. Ferrer-i-Cancho Universitat Polit` ecnica de Catalunya Version