modeling detector digitization and read out with
play

Modeling detector digitization and read-out with adversarial - PowerPoint PPT Presentation

Modeling detector digitization and read-out with adversarial networks ACAT, Seattle, 2017-08-21 Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 1 Yandex, NRU Higher School of Economics 2 University of California Irvine 3


  1. Modeling detector digitization and read-out with adversarial networks ACAT, Seattle, 2017-08-21 Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 1 Yandex, NRU Higher School of Economics 2 University of California Irvine 3 Yale University

  2. Illustrations from the book ”We have no idea” by D. Whiteson, J. Cham.

  3. Cosmic RAYs Found In Smartphones Experiment › mobile phone’s camera as cosmic rays Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 CORSIKA simulation, by J. Oehlschläger and R. Engel. Illustration of an intensive air shower produced by iron ion, 1 PeV, air shower detector. › cluster of mobile phones as intensive detector; › distributed world-wide observatory; CRAYFIS experiment proposes usage (UHECR): Rays Cosmic Energy Ultra-High of private mobile phones for observing 3 › high energies: > 10 18 eV;

  4. CRAYFIS in a nutshell Illustrations from the book ”We have no idea” by D. Whiteson, J. Cham. Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 4

  5. Challenges Physics: › low signal event rate is expected: › background cosmic rays ( ≈ 10 13 eV): > 1000 per second per 𝑙𝑛 2 ; › UHECR ( ≈ 10 18 eV): less than once per year per 𝑙𝑛 2 ; › an intensive air shower from UHECR occurs in less than microseconds; Data processing: › Getting realistic muon track images: › how muons interact with smartphone cameras (no ground-truth)? › Tracking muons using smart phones: › shortage of computational power and storage space (mobile phones); › high frame rate processing is required ( ∼ 10 Hz); › limited throughput for selected images (end-user Wi-Fi < 1 Mbit/s); Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 5

  6. Getting realistic images of muons (Parti-GAN)

  7. The problem A simulation with simplified geometry and without readout process is relatively simple (GEANT). But there is no CMOS sensors details, precise enough for reliable simulation of particle-sensor interaction: › various type of sensors; › impossible to tune for every phone; › muons are difficult to find & to prove. Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 7

  8. Let’s see if GANs can solve it › simulated by GEANT images of energy deposition; › real images from radioactive source. no labels; › simple toy model adjusted to real data; › approximate ratio of signal/background: 0.001 . › Classical GAN - doesn’t deal with images as input and converges poorly; › Cycle-GAN - should help finding 1:1 mapping, but does not solve all problems alone; › Energy-Based - should help convergence, but still doesn’t work. What if we add physics-based insights into the training of the generator? Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 8 › Dataset:

  9. Overwhelming amount of noise Trick 1: importance sampling (real batch) › increase number of events with higher signal probability in the batch reduce variance of the gradient; › events are reweighted to keep original signal/noise ratio: 𝑗 𝑥 𝑗 𝑚 real (𝑧 𝑗 ) where: › we can use image brightness of the image as a proxy for 𝑞 𝑗 . Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 9 ℒ real = 1 𝑜 ∑ › 𝑥 𝑗 ∼ 1 𝑞 𝑗 - weight to compensate for change in sample distribution; › 𝑞 𝑗 - sampling probability for 𝑗 -th sample;

  10. Structure of a batch 8× 32× 4× 4× 4× original 𝜇 = 0.2 𝜇 = 0.5 𝜇 = 1.5

  11. Helping GAN to learn the signal 𝑘=1 Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 - redistribution term. 𝑙! 𝜇 𝑙 𝑓 −𝜇 𝑛 𝑙 1 𝑘 )) 𝑚 pseudo (𝐻(𝑦 𝑙 ∑ 𝑛 𝑙 𝑥 𝑙 𝑙 › apply reweighting to redistribute number of events to match Poisson(𝜇) : Trick 2: introduce 𝜇 as a GAN parameter to optimize; › signal event rate 𝜇 in real observations is not known exactly; interaction); 11 › simulation does not account for signal event rate (just provides examples of › generate 𝑛 𝑙 samples with 𝑙 events up to a large 𝑙 ; ℒ pseudo = ∑ where: 𝑥 𝑙 =

  12. Technical details. Overview › Cycle-GAN to ensure bijection mapping between GEANT and generated samples; › Batch reweighting to ensure convergence: › importance sampling for real images (decrease variance of gradient estimations); › physics process parameter 𝜇 for generated samples. Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 12 › Energy-Based GAN for [fast] convergence;

  13. Constructing generator. Assumptions › various kinds of noise; › particle-sensor interaction; › Track and noise energies can be added to each other: › simulation results can be added to various noise; › The simulated process is local: › restricted perception field of generator ( 3 × 3 ); › CMOS pixel brightness is of functional dependency on pixel energy deposit (adjacent pixels). Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 13 › Observed samples are result of two independent ’processes’:

  14. Generator structure

  15. Cycle-GAN architecture (Parti-GAN is the same)

  16. Parti-GAN Loss function 𝑀 𝑌 Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 › 𝑍 are weighted by importance sampling coefficients. › 𝑌 are weighted by physical sampling coefficients; 𝐷 , 𝑀 𝑍 › 𝑀 𝑌 𝐸 , 𝑀 𝑍 › 𝑀 𝑌 𝐻(𝑌)) ; ̃ 𝐻(𝐻(𝑌)) , 𝑍 ′′ ∶ 𝐻( ̃ › 𝑌′′ ∶ 𝐻(𝑍 ) in the batch; ̃ › 𝑍 ′ - ’GEANT’ images generated from real images 𝑍 , › 𝑌′ - ’real’ images generated from 𝑌 , 𝐻(𝑌) in the batch; › 𝑌 - bunch of GEANT images, 𝑍 - bunch of real images; 𝐷 (𝑍 , 𝑍 ′′) 𝐷 (𝑌, 𝑌′′) + 𝑀 𝑍 𝐸 (𝑍 , 𝑍 ′) + 𝑀 𝑌 𝐸 (𝑌, 𝑌′) + 𝑀 𝑍 16 𝐸 - EBGAN loss functions; 𝐷 - cycle loss functions (MSE);

  17. Parti-GAN results

  18. Results cross-check: pixel intensity

  19. Summary Parti-GAN learns actual physical process ( 𝜇 ) + preprocessing algorithm: › corrections on electron drift; › thermal noise, readout noise; › physical/hardware readout systems. Parti-GAN matches simulation to real observations (”unpaired image translation”) Parti-GAN is based on CycleGAN with EBGAN loss function + importance sampling It can generate realistic images of muon tracks for any phone model! Maxim Borisyak 1 , Chase Shimmin 3 , Andy Nelson 2 , Andrey Ustyuzhanin 1 19

  20. Thank you for attention!

  21. Backup

  22. CRAYFIS collaboration › Brandon Buonacarsi; › Danielle Cranmer; Unaffiliated: › Jianrong Deng; Observatory of China: National Astronomical › Mikhail Usvyatsov; › Maxim Borisyak; › Andrey Ustyuzhanin; Analysis: Yandex School of Data › Kyle Cranmer; New-York University: › Dustin Burns; University California Irvine: › Michael Mulhearn; University of California Davis: › Chase Shimmin; Yale University: › Jeff Swaney; › Zichao Ziliver Yuan; › Emma He; › Eric Albin; › Rob Porter; › Kyle Brodie; › Jay Karimi; › Homer Strong; › Daniel Whiteson; › Jodi Goddard.

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend