Modeling detector digitization and read-out with adversarial - PowerPoint PPT Presentation

Modeling detector digitization and read-out with adversarial networks ACAT, Seattle, 2017-08-21 Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 1 Yandex, NRU Higher School of Economics 2 University of California Irvine 3 Yale University

Illustrations from the book ”We have no idea” by D. Whiteson, J. Cham.

Cosmic RAYs Found In Smartphones Experiment › mobile phone’s camera as cosmic rays Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 CORSIKA simulation, by J. Oehlschläger and R. Engel. Illustration of an intensive air shower produced by iron ion, 1 PeV, air shower detector. › cluster of mobile phones as intensive detector; › distributed world-wide observatory; CRAYFIS experiment proposes usage (UHECR): Rays Cosmic Energy Ultra-High of private mobile phones for observing 3 › high energies: > 10 18 eV;

CRAYFIS in a nutshell Illustrations from the book ”We have no idea” by D. Whiteson, J. Cham. Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 4

Challenges Physics: › low signal event rate is expected: › background cosmic rays ( ≈ 10 13 eV): > 1000 per second per 𝑙𝑛 2 ; › UHECR ( ≈ 10 18 eV): less than once per year per 𝑙𝑛 2 ; › an intensive air shower from UHECR occurs in less than microseconds; Data processing: › Getting realistic muon track images: › how muons interact with smartphone cameras (no ground-truth)? › Tracking muons using smart phones: › shortage of computational power and storage space (mobile phones); › high frame rate processing is required ( ∼ 10 Hz); › limited throughput for selected images (end-user Wi-Fi < 1 Mbit/s); Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 5

Getting realistic images of muons (Parti-GAN)

The problem A simulation with simplified geometry and without readout process is relatively simple (GEANT). But there is no CMOS sensors details, precise enough for reliable simulation of particle-sensor interaction: › various type of sensors; › impossible to tune for every phone; › muons are difficult to find & to prove. Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 7

Let’s see if GANs can solve it › simulated by GEANT images of energy deposition; › real images from radioactive source. no labels; › simple toy model adjusted to real data; › approximate ratio of signal/background: 0.001 . › Classical GAN - doesn’t deal with images as input and converges poorly; › Cycle-GAN - should help finding 1:1 mapping, but does not solve all problems alone; › Energy-Based - should help convergence, but still doesn’t work. What if we add physics-based insights into the training of the generator? Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 8 › Dataset:

Overwhelming amount of noise Trick 1: importance sampling (real batch) › increase number of events with higher signal probability in the batch reduce variance of the gradient; › events are reweighted to keep original signal/noise ratio: 𝑗 𝑥 𝑗 𝑚 real (𝑧 𝑗 ) where: › we can use image brightness of the image as a proxy for 𝑞 𝑗 . Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 9 ℒ real = 1 𝑜 ∑ › 𝑥 𝑗 ∼ 1 𝑞 𝑗 - weight to compensate for change in sample distribution; › 𝑞 𝑗 - sampling probability for 𝑗 -th sample;

Structure of a batch 8× 32× 4× 4× 4× original 𝜇 = 0.2 𝜇 = 0.5 𝜇 = 1.5

Helping GAN to learn the signal 𝑘=1 Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 - redistribution term. 𝑙! 𝜇 𝑙 𝑓 −𝜇 𝑛 𝑙 1 𝑘 )) 𝑚 pseudo (𝐻(𝑦 𝑙 ∑ 𝑛 𝑙 𝑥 𝑙 𝑙 › apply reweighting to redistribute number of events to match Poisson(𝜇) : Trick 2: introduce 𝜇 as a GAN parameter to optimize; › signal event rate 𝜇 in real observations is not known exactly; interaction); 11 › simulation does not account for signal event rate (just provides examples of › generate 𝑛 𝑙 samples with 𝑙 events up to a large 𝑙 ; ℒ pseudo = ∑ where: 𝑥 𝑙 =

Technical details. Overview › Cycle-GAN to ensure bijection mapping between GEANT and generated samples; › Batch reweighting to ensure convergence: › importance sampling for real images (decrease variance of gradient estimations); › physics process parameter 𝜇 for generated samples. Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 12 › Energy-Based GAN for [fast] convergence;

Constructing generator. Assumptions › various kinds of noise; › particle-sensor interaction; › Track and noise energies can be added to each other: › simulation results can be added to various noise; › The simulated process is local: › restricted perception field of generator ( 3 × 3 ); › CMOS pixel brightness is of functional dependency on pixel energy deposit (adjacent pixels). Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 13 › Observed samples are result of two independent ’processes’:

Generator structure

Cycle-GAN architecture (Parti-GAN is the same)

Parti-GAN Loss function 𝑀 𝑌 Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 › 𝑍 are weighted by importance sampling coefficients. › 𝑌 are weighted by physical sampling coefficients; 𝐷 , 𝑀 𝑍 › 𝑀 𝑌 𝐸 , 𝑀 𝑍 › 𝑀 𝑌 𝐻(𝑌)) ; ̃ 𝐻(𝐻(𝑌)) , 𝑍 ′′ ∶ 𝐻( ̃ › 𝑌′′ ∶ 𝐻(𝑍 ) in the batch; ̃ › 𝑍 ′ - ’GEANT’ images generated from real images 𝑍 , › 𝑌′ - ’real’ images generated from 𝑌 , 𝐻(𝑌) in the batch; › 𝑌 - bunch of GEANT images, 𝑍 - bunch of real images; 𝐷 (𝑍 , 𝑍 ′′) 𝐷 (𝑌, 𝑌′′) + 𝑀 𝑍 𝐸 (𝑍 , 𝑍 ′) + 𝑀 𝑌 𝐸 (𝑌, 𝑌′) + 𝑀 𝑍 16 𝐸 - EBGAN loss functions; 𝐷 - cycle loss functions (MSE);

Parti-GAN results

Results cross-check: pixel intensity

Summary Parti-GAN learns actual physical process ( 𝜇 ) + preprocessing algorithm: › corrections on electron drift; › thermal noise, readout noise; › physical/hardware readout systems. Parti-GAN matches simulation to real observations (”unpaired image translation”) Parti-GAN is based on CycleGAN with EBGAN loss function + importance sampling It can generate realistic images of muon tracks for any phone model! Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 19

Thank you for attention!

Backup

CRAYFIS collaboration › Brandon Buonacarsi; › Danielle Cranmer; Unaffiliated: › Jianrong Deng; Observatory of China: National Astronomical › Mikhail Usvyatsov; › Maxim Borisyak; › Andrey Ustyuzhanin; Analysis: Yandex School of Data › Kyle Cranmer; New-York University: › Dustin Burns; University California Irvine: › Michael Mulhearn; University of California Davis: › Chase Shimmin; Yale University: › Jeff Swaney; › Zichao Ziliver Yuan; › Emma He; › Eric Albin; › Rob Porter; › Kyle Brodie; › Jay Karimi; › Homer Strong; › Daniel Whiteson; › Jodi Goddard.

Modeling detector digitization and read-out with adversarial - PowerPoint PPT Presentation

Modeling detector digitization and read-out with adversarial networks ACAT, Seattle, 2017-08-21 Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 1 Yandex, NRU Higher School of Economics 2 University of California Irvine 3

APFM Conference Lisbon 19. Oktober 2018 DFMs digitization Project 1. How does digitization

DIGITIZATION POSSIBLE Investor Relations Presentation 3rd quarter 2018 We make digitization

Image Analysis for cultural Heritage Digitization INTRODUCTION Naomi Langer - Digitization

Strategic Plan for Detector R&D at Fermilab Petra Merkel Fermilab Detector R&D

Km3 neutrino detector workshop January 23 20.30-22.00 KM3 Cerenkov neutrino detector, the

A Medium Size Detector for the ILC ... what used to be the TESLA or LD detector concept Ties

DUNE Optical Detector Digitization and Backtracking Update. Jason Stock & Juergen

Mass Digitization on Demand Automation and Terrible Metadata We Digitize for Remote Requests

CSI: Miami Forensic Library Digitization Project Marketing Plan Presentation Product Development

The impact of digitization. The case of Greece. Soft skills our compass to future Skills

Front End Digitization of tRPCs in the ATLAS Muon Spectrometer Phase-I Upgrade X.T. Meng 1,2 1

DOCUMENT DIGITIZATION Rethinking it with Machine Learning Nischal Harohalli Padmanabha QConAI

Commission: Out of touch, out of date, out of pocket April 2017 Commission: Out of touch, out of

Modeling of proteins and complexes High resolution Low resolution Modeling of domains Modeling

Virtual Reality Modeling Virtual Reality Modeling from http://www.okino.com/ Modeling Modeling

Kindergarten Reading Getting Ready for Kindergarten Oregon Trail School District Read, Read

An Overview of the b-Tagging Algorithms in the CMS Offline Software Christophe Saout CERN,

Section 3.2: Recursively Defined Functions and Procedures Function: Has inputs (arguments,

Introduction to the LLVM Compiler System Chris Lattner llvm.org Architect November 4, 2008

653 ELECTRONIC SYSYTEMS WING OVERVIEW I n t e g r i t y - S e r v i c e - E x c e l l e n c e

Full Compressed Affix Tree Representations L.I.R.M.M. Universit e de Montpellier Institut

Persistent Homology in Text Mining ACAT Meeting, Bremen Hubert Wagner (Jagiellonian University)

formatting text with face gestures Alice Strunkmann-Meister, Rodrigo Blsquez Interaction

96 Chapter 6 P olymorphism The is one of the most p o w erful mec hanisms pro

Sambuz

Useful Links

Newsletter

Mail Us

Modeling detector digitization and read-out with adversarial - PowerPoint PPT Presentation

Modeling detector digitization and read-out with adversarial networks ACAT, Seattle, 2017-08-21 Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 1 Yandex, NRU Higher School of Economics 2 University of California Irvine 3

APFM Conference Lisbon 19. Oktober 2018 DFMs digitization Project 1. How does digitization

DIGITIZATION POSSIBLE Investor Relations Presentation 3rd quarter 2018 We make digitization

Image Analysis for cultural Heritage Digitization INTRODUCTION Naomi Langer - Digitization

Strategic Plan for Detector R&amp;D at Fermilab Petra Merkel Fermilab Detector R&amp;D

Km3 neutrino detector workshop January 23 20.30-22.00 KM3 Cerenkov neutrino detector, the

A Medium Size Detector for the ILC ... what used to be the TESLA or LD detector concept Ties

DUNE Optical Detector Digitization and Backtracking Update. Jason Stock &amp; Juergen

Mass Digitization on Demand Automation and Terrible Metadata We Digitize for Remote Requests

CSI: Miami Forensic Library Digitization Project Marketing Plan Presentation Product Development

The impact of digitization. The case of Greece. Soft skills our compass to future Skills

Front End Digitization of tRPCs in the ATLAS Muon Spectrometer Phase-I Upgrade X.T. Meng 1,2 1

DOCUMENT DIGITIZATION Rethinking it with Machine Learning Nischal Harohalli Padmanabha QConAI

Commission: Out of touch, out of date, out of pocket April 2017 Commission: Out of touch, out of

Modeling of proteins and complexes High resolution Low resolution Modeling of domains Modeling

Virtual Reality Modeling Virtual Reality Modeling from http://www.okino.com/ Modeling Modeling

Kindergarten Reading Getting Ready for Kindergarten Oregon Trail School District Read, Read

An Overview of the b-Tagging Algorithms in the CMS Offline Software Christophe Saout CERN,

Section 3.2: Recursively Defined Functions and Procedures Function: Has inputs (arguments,

Introduction to the LLVM Compiler System Chris Lattner llvm.org Architect November 4, 2008

653 ELECTRONIC SYSYTEMS WING OVERVIEW I n t e g r i t y - S e r v i c e - E x c e l l e n c e

Full Compressed Affix Tree Representations L.I.R.M.M. Universit e de Montpellier Institut

Persistent Homology in Text Mining ACAT Meeting, Bremen Hubert Wagner (Jagiellonian University)

formatting text with face gestures Alice Strunkmann-Meister, Rodrigo Blsquez Interaction

96 Chapter 6 P olymorphism The is one of the most p o w erful mec hanisms pro

Sambuz

Useful Links

Newsletter

Mail Us

Strategic Plan for Detector R&D at Fermilab Petra Merkel Fermilab Detector R&D

DUNE Optical Detector Digitization and Backtracking Update. Jason Stock & Juergen