Update on Sparse CNNs for Particle ID in ProtoDUNE Carlos Sarasty - - PowerPoint PPT Presentation

update on sparse cnns for particle id in protodune
SMART_READER_LITE
LIVE PREVIEW

Update on Sparse CNNs for Particle ID in ProtoDUNE Carlos Sarasty - - PowerPoint PPT Presentation

Update on Sparse CNNs for Particle ID in ProtoDUNE Carlos Sarasty Segura 1st April 2020 DRA meeting Outline Definition of the ground truth Training using 2D samples Training using 3D samples Summary 2 Semantic Segmentation


slide-1
SLIDE 1

Update on Sparse CNNs for Particle ID in ProtoDUNE

Carlos Sarasty Segura 1st April 2020 DRA meeting

slide-2
SLIDE 2

Outline

  • Definition of the ground truth
  • Training using 2D samples
  • Training using 3D samples
  • Summary

2

slide-3
SLIDE 3

Semantic Segmentation

  • Goal: Apply sparse CNNs for the task of semantic

segmentation at a pixel level in ProtoDUNE

3

Wire Id Time tick

slide-4
SLIDE 4

Ground Truth - First Version

  • Classify each pixel into 7 different classes for supervised learning
  • MIP → Two classes → muons & pions
  • HIP → Protons, kaons & nuclei
  • Showers → Induced by electromagnetic particles such as e-and

e+

  • Michel electrons → From the decay of muons
  • Electromagnetic activity → Electrons from hard scattering, and

low energy e-

  • Neutrons
  • Record the fraction of energy deposited by each class per pixel
  • https://indico.fnal.gov/event/20144/session/17/contribution/93/material/slides/0.pdf

4

slide-5
SLIDE 5

Supervised Learning

  • The dataset consist of about 100.000 2D image samples of up

to 6000 px split into 95% and 5% for train and test respectively

  • 1 feature → Integrated cargue

5

slide-6
SLIDE 6

Event Display Example

6

True Predicted

Wire Id Time tick

slide-7
SLIDE 7

Muon-Pion Separation

7

ROC curve

slide-8
SLIDE 8

Moving to 3D

  • Ground truth:
  • Modify the ground truth definition to separate kaons from the hip

class.

  • Merge neutrons & EM activity into 1 class
  • Features:
  • Increase the number of features from 1 to 7 (3 coordinates per hit,

integrated charge per plane per voxel, number of hits per voxel)

  • Issues:
  • Low statistics for the kaon class → only 5% of files contain kaons

8

slide-9
SLIDE 9

Supervised Learning

  • The dataset with kaons consist of 3943 3D images split

into 95% and 5% for train and test

9

slide-10
SLIDE 10

Muon-Pion Separation

10

ROC curve

slide-11
SLIDE 11

Second case

  • Ground truth:
  • Merge kaons back in with protons into hip class
  • Dataset: Consist of 70k 3D images

11

slide-12
SLIDE 12

Muon-Pion Separation

12

ROC curve

slide-13
SLIDE 13

Event display - True

13

slide-14
SLIDE 14

Event display - Predicted

14

slide-15
SLIDE 15

TO DO:

  • Modify the ground truth:
  • Include Delta rays as a separate class.
  • Separate electron and photon showers
  • Retrain and test the model for electron and photon

separation.

15

slide-16
SLIDE 16

Summary

  • We have trained the network using different definitions
  • f the ground truth and different datasets
  • The performance of the network using 3D samples is

significantly better than the 2D case

  • A training using kaons as a separate class can be

possible with a bigger dataset

  • Comments and suggestions are more than welcome
  • Thanks! :)

16

slide-17
SLIDE 17

Backup slides

17

slide-18
SLIDE 18

Ground Truth

  • The first approach to distinguish the different classes of particles is

based on the pdg and track Id information

  • Geant4 also provides valuable information of the physical

process of a simulated particle and its parent. This information is useful to characterize Michel electrons

  • Non-primary electron
  • Electron’s parent is a muon
  • Same with positrons.
  • Neutrons
  • Check the process → n-capture , neutron Inelastic scattering

18

slide-19
SLIDE 19

Ground Truth

  • EM showers and EM activity
  • In the MC Truth the information of secondary and tertiary

particles from showers is thrown away → shower daughters are tagged with the negative track ID of the parent particle

  • Identify all particles that belong to the same track ID
  • Set a threshold in the number of hits → nhits > 10 ~ 5cm
  • Any other e+/- will be labeled as EM activity

19

slide-20
SLIDE 20

Ground Truth

20

Drift Volume 1

slide-21
SLIDE 21

Ground Truth

21

Event display - True