OASIS: Better simulated events to allow for fewer simulated events - PowerPoint PPT Presentation

OASIS: “Better” simulated events to allow for fewer simulated events Prasanth Shyamsundar University of Florida based on [arXiv:2006.16972] “OASIS: Optimal Analysis-Specifjc Importance Sampling for event generation” Konstantin T. Matchev, Prasanth Shyamsundar LPC Physics Forum, Fermilab July 30, 2020

Require fewer simulated events? Motivation ▶ Simulations in HEP are computationally expensive. • Detector simulation is the most resource intensive part of the pipeline. • Projected HL-LHC computational requirements may not be met. “Billion dollar problem” • Need to speed up the simulation pipeline. ATLAS CMS J. Albrecht et al. [HEP Software Foundation], “A Roadmap for HEP Software and Computing R&D for the 2020s,” Comput. Softw. Big Sci. 3 , no.1, 7 (2019) [arXiv:1712.06982 [physics.comp-ph]]. Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 1/27 [Go to the end]

Motivation ▶ Simulations in HEP are computationally expensive. • Detector simulation is the most resource intensive part of the pipeline. • Projected HL-LHC computational requirements may not be met. “Billion dollar problem” • Need to speed up the simulation pipeline. Require fewer simulated events? ATLAS CMS J. Albrecht et al. [HEP Software Foundation], “A Roadmap for HEP Software and Computing R&D for the 2020s,” Comput. Softw. Big Sci. 3 , no.1, 7 (2019) [arXiv:1712.06982 [physics.comp-ph]]. Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 1/27 [Go to the end]

Importance Sampling ▶ The simulation pipeline starts with the parton level hard scattering. ▶ At the parton level, we can compute the probability density of a given event. (under a given theory/set of param values) ▶ Ingredients: • Matrix element • Parton distribution functions ▶ Given an oracle for a distribution, how do we sample Image from the Sherpa Team events as per the distribution? Answer: Importance Sampling Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 2/27 [Go to the end]

Importance Sampling ▶ f = distribution to sample from g = distribution we can sample from (both unnormalized) f ▶ Throw darts uniformly at random into the “box”. Unnormalized distribution g Or sample events according to g . ▶ Option 1: Unweighting • Accept the events that fall under f . Or accept event i with probability f ( x i ) / g ( x i ) . ▶ Option 2: Weighted events • Accept all events, but weight them x w i = f ( x i ) / g ( x i ) ▶ The “box” g doesn’t have to be a rectangle. Just needs to be something we can sample from. Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 3/27 [Go to the end]

Importance Sampling ▶ f = distribution to sample from g = distribution we can sample from (both unnormalized) f selected ▶ Throw darts uniformly at random into the “box”. Unnormalized distribution g rejected Or sample events according to g . ▶ Option 1: Unweighting • Accept the events that fall under f . Or accept event i with probability f ( x i ) / g ( x i ) . ▶ Option 2: Weighted events • Accept all events, but weight them x w i = f ( x i ) / g ( x i ) ▶ The “box” g doesn’t have to be a rectangle. Just needs to be something we can sample from. Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 3/27 [Go to the end]

Importance Sampling Current philosophy: Try to make g close to f Rationale 1: Unweighting effjciency... circular argument f selected Unnormalized distribution g rejected We want unweighted events ⇓ g → f / F reduces wastage (lesser fraction of events thrown out) g → f / F is ideal x ⇓ We should unweight events at the parton level before moving onto the rest of the (computationally expensive) simulation pipeline Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 4/27 [Go to the end]

Importance Sampling Current philosophy: Try to make g close to f Rationale 2: Cross-section estimation f selected dx g ( x ) f ( x ) ∫ ∫ Unnormalized distribution g rejected F ≡ dx f ( x ) = g ( x ) ( g is normalized ) = E g [ w ] N s F = 1 ⇒ ˆ ∑ w i N s i = 1 [ ˆ ] = var [ w ] ( g → f / F reduces variance ) var F x N s Estimation of F is related to counting experiments But... HEP analyses have come a long way from counting experiments! Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 4/27 [Go to the end]

Weighted events = Yet unexplored degree of freedom OASIS abondons the notion that g → f / F is the best strategy ▶ Nature: • Produces unweighted events • Constrained to be distributed as per f / F ▶ Weighted simulations: • Not constrained... Sampling distribution g can be whatever we want! • OASIS exploits this freedom to an unprecedented degree ▶ Current usage examples of weighted events: — Oversampling tails: Extract the sensitivity from the tails without wasting resources on the bulk — (Also reweighting events, combining difgerent processes) ▶ Why would we want to deviate from f / F on purpose? • Focus on the regions of phase space important to the analysis. Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 5/27 [Go to the end]

An example: Top mass measurement A. M. Sirunyan et al. [CMS], “Measurement of the top quark mass in the dileptonic t ¯ t decay channel using the mass observables M b ℓ , M T 2 , and M b ℓ ν in pp collisions at √ s = 8 TeV,” Phys. Rev. D 96 , no.3, 032002 (2017) [arXiv:1704.06142 [hep-ex]]. ▶ Difgerent regions of the phase-space are sensitive to the value of a parameter (or presence of a signal) to difgerent extents. ▶ More simulated events → smaller theory error bars ▶ Reducing the theory error bars everywhere (maintaining the same ratios between error bars) is not the optimal strategy! Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 6/27 [Go to the end]

OASIS elevator pitch Optimal Analysis-Specifjc Importance Sampling ▶ Choose the sampling distribution optimally to maximize the sensitivity of the analysis at hand, for a given computational budget. ▶ Reach the target sensitivity with fewer simulated events. ▶ Piggyback on existing importance sampling techniques. (FOAM, VEGAS, machine-learning-based, etc) ▶ Save, in computational budget, Hundreds of Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 7/27 [Go to the end]

OASIS for parton level analysis ▶ To pick a good sampling distribution g , we need to understand the relationship between the sampling distribution and the sensitivity of the analysis. ▶ Let θ be a parameter we want to measure by analyzing the parton level events { x i } . Let L be the integrated luminosity. ▶ Fisher Information: ] 2 1 [ ∂ f ( x ; θ ) ∫ I ( θ ) = L dx f ( x ; θ ) ∂θ [ ˆ ] ≥ 1 θ ( Data ) ; θ 0 var I ( θ 0 ) ▶ The lower bound is achievable in the asymptotic limit by the maximum likelihood fjt or minimum- χ 2 fjt (fjne binning). Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 8/27 [Go to the end]

Fisher Information for simulation based analyses ] 2 [ ∂ f ( x ; θ ) 1 ∫ I ( θ ) = L dx f ( x ; θ ) ∂θ ▶ Note that there’s no g in the expression. This is for analyses based on the functional form of f ( x ; θ ) . ▶ What about analyses based on simulations? ( N s events distributed as per g ) ] 2 [ L ∂ f ( x ; θ ) ] 2 [ L ∂ f ( x ; θ ) 1 ∫ ∂θ ∫ I ( θ ) = dx I MC ( θ ) = dx [ L L f ( x ; θ ) ∂θ ] 2 L f ( x ; θ ) + N s g ( x ) w ( x ; θ ) s 2 s 2 compare to ∑ or ∑ N s i i σ 2 n i σ 2 i , real stat → σ 2 i , real stat + σ 2 i ∈ x bins i ∈ x bins i , real stat i , sim stat “ s ” ∼ difgerence between expected counts for θ and θ + δθ Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 9/27 [Go to the end]

Fisher Information for simulation based analyses ] 2 [ L ∂ f ( x ; θ ) ∂θ ∫ I MC ( θ ) = dx [ L ] 2 L f ( x ; θ ) + N s g ( x ) w ( x ) N s ] 2 [ f ( x ; θ ) ∂ θ [ ln f ( x ; θ )] ⇒ I MC ( θ ) ∫ = dx 1 + L L w ( x ; θ ) N s f ( x ) u 2 ( x ) where u ( x ) ≡ ∂ θ [ ln f ( x ; θ )] = 1 ∂ f ∫ ≡ dx 1 + L f ∂θ w ( x ) N s u ( x ) is a per-event score that captures the sensitivity of event to θ . Can be computed using the matrix element oracle. Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 10/27 [Go to the end]

OASIS: Better simulated events to allow for fewer simulated events - PowerPoint PPT Presentation

OASIS: Better simulated events to allow for fewer simulated events Prasanth Shyamsundar University of Florida based on [arXiv:2006.16972] OASIS: Optimal Analysis-Specifjc Importance Sampling for event generation Konstantin T.

OASIS SB Program Open Season On ramp Overview How to get on OASIS OASIS Program Management Office

Simulated Annealing Simulated annealing is a probabilistic search algorithm. The

Simulated Annealing G5BAIM: Artificial Intelligence Methods Graham Kendall 15 Feb 09 1

Outline Convergence DM812 METAHEURISTICS Lecture 2 1. Simulated Annealing Simulated Annealing

OASIS Standards for OASIS Standards for eGovernment Integration eGovernment Integration Brian

Federal Acquisition Service OASIS Industry Day OASIS Program Executive Office General Supplies

Customer Webinar / OASIS Application Updates Neela Alla & Prince Philip IT Product Management

Federal Acquisition Service OASIS SB Industry Day OASIS Program Executive Office General

Network Integration Transmission Service on OASIS (NITS on OASIS) Project 2016 NETWORK OPERATING

UBL Update Jon Bosak Sun Microsystems http:// oasis- open.org / OASIS Symposium on the

Organization for the Organization for the Advancement of Structured Advancement of Structured

P1 Orientation Programme 28 December 2015 Staff of Oasis Primary School Oasis Primary School

ROCKBOX FABRIQ EDITION ITS TIME FOR FOR BETTER SOUND. BETTER DESIGN. BETTER SPECS.

1/6/2017 1 1/6/2017 Commissioning Benefits Fewer Change Orders Fewer corrective actions

Traditional news media: fewer readers lower ad revenue fewer resources less original

Distributed coloring in sparse graphs with fewer colors Marthe Bonamy with Pierre Aboulker,

Preserving the Samoan Language and Culture Tavaesina Trust Board Board of Trustees Fetu Taiala

Thresholded Rewards: Acting Optimally in Timed, Zero-Sum Games Colin McMillen and Manuela Veloso

The Moroccan Perspective Main Achievements & Challenges Dr. Khammar MRABIT, Director General

5 th January 2017 Purpose & Structure of the Meeting Introduce key people

Business meeting PROFESSOR ANITA BERLIN LEAD COMMUNITY BASED MEDICAL EDUCATION 1 Creating a

Facilitating Commencing Student Success across the Lifecycle: Strategic Student Orientation

UGME Accreditation Boot Camp Part 2 Tu esd ay, A p r il 2 , 2 0 1 9 Victo r ia I n n Ho tel an

assessing & enhancing equity 9 th Central Oregon PBIS Conference Eoin Bastable

OASIS: Better simulated events to allow for fewer simulated events - PowerPoint PPT Presentation

OASIS: Better simulated events to allow for fewer simulated events Prasanth Shyamsundar University of Florida based on [arXiv:2006.16972] OASIS: Optimal Analysis-Specifjc Importance Sampling for event generation Konstantin T.

OASIS SB Program Open Season On ramp Overview How to get on OASIS OASIS Program Management Office

Simulated Annealing Simulated annealing is a probabilistic search algorithm. The

Simulated Annealing G5BAIM: Artificial Intelligence Methods Graham Kendall 15 Feb 09 1

Outline Convergence DM812 METAHEURISTICS Lecture 2 1. Simulated Annealing Simulated Annealing

OASIS Standards for OASIS Standards for eGovernment Integration eGovernment Integration Brian

Federal Acquisition Service OASIS Industry Day OASIS Program Executive Office General Supplies

Customer Webinar / OASIS Application Updates Neela Alla &amp; Prince Philip IT Product Management

Federal Acquisition Service OASIS SB Industry Day OASIS Program Executive Office General

Network Integration Transmission Service on OASIS (NITS on OASIS) Project 2016 NETWORK OPERATING

UBL Update Jon Bosak Sun Microsystems http:// oasis- open.org / OASIS Symposium on the

Organization for the Organization for the Advancement of Structured Advancement of Structured

P1 Orientation Programme 28 December 2015 Staff of Oasis Primary School Oasis Primary School

ROCKBOX FABRIQ EDITION ITS TIME FOR FOR BETTER SOUND. BETTER DESIGN. BETTER SPECS.

1/6/2017 1 1/6/2017 Commissioning Benefits Fewer Change Orders Fewer corrective actions

Traditional news media: fewer readers lower ad revenue fewer resources less original

Distributed coloring in sparse graphs with fewer colors Marthe Bonamy with Pierre Aboulker,

Preserving the Samoan Language and Culture Tavaesina Trust Board Board of Trustees Fetu Taiala

Thresholded Rewards: Acting Optimally in Timed, Zero-Sum Games Colin McMillen and Manuela Veloso

The Moroccan Perspective Main Achievements &amp; Challenges Dr. Khammar MRABIT, Director General

5 th January 2017 Purpose &amp; Structure of the Meeting Introduce key people

Business meeting PROFESSOR ANITA BERLIN LEAD COMMUNITY BASED MEDICAL EDUCATION 1 Creating a

Facilitating Commencing Student Success across the Lifecycle: Strategic Student Orientation

UGME Accreditation Boot Camp Part 2 Tu esd ay, A p r il 2 , 2 0 1 9 Victo r ia I n n Ho tel an

assessing &amp; enhancing equity 9 th Central Oregon PBIS Conference Eoin Bastable

Customer Webinar / OASIS Application Updates Neela Alla & Prince Philip IT Product Management

The Moroccan Perspective Main Achievements & Challenges Dr. Khammar MRABIT, Director General

5 th January 2017 Purpose & Structure of the Meeting Introduce key people

assessing & enhancing equity 9 th Central Oregon PBIS Conference Eoin Bastable