oasis better simulated events to allow for fewer
play

OASIS: Better simulated events to allow for fewer simulated events - PowerPoint PPT Presentation

OASIS: Better simulated events to allow for fewer simulated events Prasanth Shyamsundar University of Florida based on [arXiv:2006.16972] OASIS: Optimal Analysis-Specifjc Importance Sampling for event generation Konstantin T.


  1. OASIS: “Better” simulated events to allow for fewer simulated events Prasanth Shyamsundar University of Florida based on [arXiv:2006.16972] “OASIS: Optimal Analysis-Specifjc Importance Sampling for event generation” Konstantin T. Matchev, Prasanth Shyamsundar LPC Physics Forum, Fermilab July 30, 2020

  2. Require fewer simulated events? Motivation ▶ Simulations in HEP are computationally expensive. • Detector simulation is the most resource intensive part of the pipeline. • Projected HL-LHC computational requirements may not be met. “Billion dollar problem” • Need to speed up the simulation pipeline. ATLAS CMS J. Albrecht et al. [HEP Software Foundation], “A Roadmap for HEP Software and Computing R&D for the 2020s,” Comput. Softw. Big Sci. 3 , no.1, 7 (2019) [arXiv:1712.06982 [physics.comp-ph]]. Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 1/27 [Go to the end]

  3. Motivation ▶ Simulations in HEP are computationally expensive. • Detector simulation is the most resource intensive part of the pipeline. • Projected HL-LHC computational requirements may not be met. “Billion dollar problem” • Need to speed up the simulation pipeline. Require fewer simulated events? ATLAS CMS J. Albrecht et al. [HEP Software Foundation], “A Roadmap for HEP Software and Computing R&D for the 2020s,” Comput. Softw. Big Sci. 3 , no.1, 7 (2019) [arXiv:1712.06982 [physics.comp-ph]]. Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 1/27 [Go to the end]

  4. Importance Sampling ▶ The simulation pipeline starts with the parton level hard scattering. ▶ At the parton level, we can compute the probability density of a given event. (under a given theory/set of param values) ▶ Ingredients: • Matrix element • Parton distribution functions ▶ Given an oracle for a distribution, how do we sample Image from the Sherpa Team events as per the distribution? Answer: Importance Sampling Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 2/27 [Go to the end]

  5. Importance Sampling ▶ f = distribution to sample from g = distribution we can sample from (both unnormalized) f ▶ Throw darts uniformly at random into the “box”. Unnormalized distribution g Or sample events according to g . ▶ Option 1: Unweighting • Accept the events that fall under f . Or accept event i with probability f ( x i ) / g ( x i ) . ▶ Option 2: Weighted events • Accept all events, but weight them x w i = f ( x i ) / g ( x i ) ▶ The “box” g doesn’t have to be a rectangle. Just needs to be something we can sample from. Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 3/27 [Go to the end]

  6. Importance Sampling ▶ f = distribution to sample from g = distribution we can sample from (both unnormalized) f selected ▶ Throw darts uniformly at random into the “box”. Unnormalized distribution g rejected Or sample events according to g . ▶ Option 1: Unweighting • Accept the events that fall under f . Or accept event i with probability f ( x i ) / g ( x i ) . ▶ Option 2: Weighted events • Accept all events, but weight them x w i = f ( x i ) / g ( x i ) ▶ The “box” g doesn’t have to be a rectangle. Just needs to be something we can sample from. Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 3/27 [Go to the end]

  7. Importance Sampling Current philosophy: Try to make g close to f Rationale 1: Unweighting effjciency... circular argument f selected Unnormalized distribution g rejected We want unweighted events ⇓ g → f / F reduces wastage (lesser fraction of events thrown out) g → f / F is ideal x ⇓ We should unweight events at the parton level before moving onto the rest of the (computationally expensive) simulation pipeline Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 4/27 [Go to the end]

  8. Importance Sampling Current philosophy: Try to make g close to f Rationale 1: Unweighting effjciency... circular argument f selected Unnormalized distribution g rejected We want unweighted events ⇓ g → f / F reduces wastage (lesser fraction of events thrown out) g → f / F is ideal x ⇓ We should unweight events at the parton level before moving onto the rest of the (computationally expensive) simulation pipeline Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 4/27 [Go to the end]

  9. Importance Sampling Current philosophy: Try to make g close to f Rationale 2: Cross-section estimation f selected dx g ( x ) f ( x ) ∫ ∫ Unnormalized distribution g rejected F ≡ dx f ( x ) = g ( x ) ( g is normalized ) = E g [ w ] N s F = 1 ⇒ ˆ ∑ w i N s i = 1 [ ˆ ] = var [ w ] ( g → f / F reduces variance ) var F x N s Estimation of F is related to counting experiments But... HEP analyses have come a long way from counting experiments! Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 4/27 [Go to the end]

  10. Weighted events = Yet unexplored degree of freedom OASIS abondons the notion that g → f / F is the best strategy ▶ Nature: • Produces unweighted events • Constrained to be distributed as per f / F ▶ Weighted simulations: • Not constrained... Sampling distribution g can be whatever we want! • OASIS exploits this freedom to an unprecedented degree ▶ Current usage examples of weighted events: — Oversampling tails: Extract the sensitivity from the tails without wasting resources on the bulk — (Also reweighting events, combining difgerent processes) ▶ Why would we want to deviate from f / F on purpose? • Focus on the regions of phase space important to the analysis. Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 5/27 [Go to the end]

  11. An example: Top mass measurement A. M. Sirunyan et al. [CMS], “Measurement of the top quark mass in the dileptonic t ¯ t decay channel using the mass observables M b ℓ , M T 2 , and M b ℓ ν in pp collisions at √ s = 8 TeV,” Phys. Rev. D 96 , no.3, 032002 (2017) [arXiv:1704.06142 [hep-ex]]. ▶ Difgerent regions of the phase-space are sensitive to the value of a parameter (or presence of a signal) to difgerent extents. ▶ More simulated events → smaller theory error bars ▶ Reducing the theory error bars everywhere (maintaining the same ratios between error bars) is not the optimal strategy! Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 6/27 [Go to the end]

  12. OASIS elevator pitch Optimal Analysis-Specifjc Importance Sampling ▶ Choose the sampling distribution optimally to maximize the sensitivity of the analysis at hand, for a given computational budget. ▶ Reach the target sensitivity with fewer simulated events. ▶ Piggyback on existing importance sampling techniques. (FOAM, VEGAS, machine-learning-based, etc) ▶ Save, in computational budget, Hundreds of Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 7/27 [Go to the end]

  13. OASIS for parton level analysis ▶ To pick a good sampling distribution g , we need to understand the relationship between the sampling distribution and the sensitivity of the analysis. ▶ Let θ be a parameter we want to measure by analyzing the parton level events { x i } . Let L be the integrated luminosity. ▶ Fisher Information: ] 2 1 [ ∂ f ( x ; θ ) ∫ I ( θ ) = L dx f ( x ; θ ) ∂θ [ ˆ ] ≥ 1 θ ( Data ) ; θ 0 var I ( θ 0 ) ▶ The lower bound is achievable in the asymptotic limit by the maximum likelihood fjt or minimum- χ 2 fjt (fjne binning). Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 8/27 [Go to the end]

  14. Fisher Information for simulation based analyses ] 2 [ ∂ f ( x ; θ ) 1 ∫ I ( θ ) = L dx f ( x ; θ ) ∂θ ▶ Note that there’s no g in the expression. This is for analyses based on the functional form of f ( x ; θ ) . ▶ What about analyses based on simulations? ( N s events distributed as per g ) ] 2 [ L ∂ f ( x ; θ ) ] 2 [ L ∂ f ( x ; θ ) 1 ∫ ∂θ ∫ I ( θ ) = dx I MC ( θ ) = dx [ L L f ( x ; θ ) ∂θ ] 2 L f ( x ; θ ) + N s g ( x ) w ( x ; θ ) s 2 s 2 compare to ∑ or ∑ N s i i σ 2 n i σ 2 i , real stat → σ 2 i , real stat + σ 2 i ∈ x bins i ∈ x bins i , real stat i , sim stat “ s ” ∼ difgerence between expected counts for θ and θ + δθ Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 9/27 [Go to the end]

  15. Fisher Information for simulation based analyses ] 2 [ L ∂ f ( x ; θ ) ∂θ ∫ I MC ( θ ) = dx [ L ] 2 L f ( x ; θ ) + N s g ( x ) w ( x ) N s ] 2 [ f ( x ; θ ) ∂ θ [ ln f ( x ; θ )] ⇒ I MC ( θ ) ∫ = dx 1 + L L w ( x ; θ ) N s f ( x ) u 2 ( x ) where u ( x ) ≡ ∂ θ [ ln f ( x ; θ )] = 1 ∂ f ∫ ≡ dx 1 + L f ∂θ w ( x ) N s u ( x ) is a per-event score that captures the sensitivity of event to θ . Can be computed using the matrix element oracle. Konstantin T. Matchev, Prasanth Shyamsundar [arXiv:2006.16972] 10/27 [Go to the end]

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend