cwola hunting extending the bump hunt with machine
play

CWoLa Hunting: Extending the Bump Hunt with Machine Learning Based - PowerPoint PPT Presentation

CWoLa Hunting: Extending the Bump Hunt with Machine Learning Based on: Phys. Rev. Lett. 121, 241803 (2018) [1805.02664] Jack Collins, Kiel Howe, Ben Nachman 1 CWoLa Hunting Outline 1) Machine Learning 2) Model Unspecific Searches 3) CWoLa


  1. CWoLa Hunting: Extending the Bump Hunt with Machine Learning Based on: Phys. Rev. Lett. 121, 241803 (2018) [1805.02664] Jack Collins, Kiel Howe, Ben Nachman 1 CWoLa Hunting

  2. Outline 1) Machine Learning 2) Model Unspecific Searches 3) CWoLa Hunting 2 CWoLa Hunting

  3. Machine Learning Classification Classification Regression https://becominghuman.ai/building-an-image-classifier-using-deep-learning-in-python- totally-from-a-beginners-perspective-be8dbaf22dd8 Generation 3 CWoLa Hunting

  4. Machine Learning Classification Regression Regression Generation https://research.nvidia.com/sites/default/files/publications/dnn_denoise_author.pdf 4 CWoLa Hunting

  5. Machine Learning Classification http://karpathy.github.io/2015/05/21/rnn-effectiveness/ Regression Generation Generation 5 CWoLa Hunting

  6. CWoLa Hunting 6

  7. CWoLa Hunting 7

  8. Machine Learning at the LHC Classification: Jets Classification Regression ArXiv: 1511.05190 L. Oliveira, M. Kagan, L. Mackey, B. Nachman, A. Schwartzman Generation 8 CWoLa Hunting

  9. Machine Learning at the LHC Classification Regression: Pileup removal Regression Generation ArXiv:1707.08600. P. T. Komiske, E. M. Metodiev, B. Nachman, M. D. Schwartz 9 CWoLa Hunting

  10. Machine Learning at the LHC Classification ArXiv 1712.10321, M. Paganini, L. de Oliveira, B. Nachamn Regression Generation: Fast simulation Generation 10 CWoLa Hunting

  11. Basic Machine Learning Primer 0) Decide objective: e.g. classify dog vs cat pictures 1) Choose a network architecture: e.g. CNN 2) Choose loss function (objective metric) 3) Train network using training data 4) Apply network on new test data https://becominghuman.ai/building-an-image-classifier-using-deep-learning-in-python-totally-from-a-beginners-perspective-be8dbaf22dd8 11 CWoLa Hunting

  12. Basic Machine Learning Primer 0) Decide objective: e.g. classify dog vs cat pictures A NN is just a function mapping N input 1) Choose a network architecture: e.g. CNN numbers to M output numbers. 2) Choose loss function (objective metric) 3) Train network using training data Trainable internal numbers – weights 4) Apply network on new test data and biases – determine that function. https://becominghuman.ai/building-an-image-classifier-using-deep-learning-in-python-totally-from-a-beginners-perspective-be8dbaf22dd8 12 CWoLa Hunting

  13. Basic Machine Learning Primer 0) Decide objective: e.g. classify dog vs cat pictures Naive example: 1) Choose a network architecture: e.g. CNN Fraction of correct predictions 2) Choose loss function (objective metric) 3) Train network using training data Practical example: Cross-entropy loss 4) Apply network on new test data -∑ y(x) log[NN(x)] https://becominghuman.ai/building-an-image-classifier-using-deep-learning-in-python-totally-from-a-beginners-perspective-be8dbaf22dd8 13 CWoLa Hunting

  14. Basic Machine Learning Primer 0) Decide objective: e.g. classify dog vs cat pictures Use some iterative optimization 1) Choose a network architecture: e.g. CNN algorithm to minimize the loss 2) Choose loss function (objective metric) function on training data. 3) Train network using training data 4) Apply network on new test data Be careful in selecting training data! https://becominghuman.ai/building-an-image-classifier-using-deep-learning-in-python-totally-from-a-beginners-perspective-be8dbaf22dd8 14 CWoLa Hunting

  15. Basic Machine Learning Primer 0) Decide objective: e.g. classify dog vs cat pictures Training: Slow 1) Choose a network architecture: e.g. CNN Testing: Fast 2) Choose loss function (objective metric) 3) Train network using training data Performance may be limited by the 4) Apply network on new test data quality / relevance of training data. https://becominghuman.ai/building-an-image-classifier-using-deep-learning-in-python-totally-from-a-beginners-perspective-be8dbaf22dd8 15 CWoLa Hunting

  16. BSM Searches: Nothing so far 16 CWoLa Hunting

  17. BSM Searches: Nothing so far Possibilities: 1) LHC doesn’t have the answers to our questions 2) Maybe new physics is rare: have to wait for high luminosity LHC 3) Maybe it is there but we are not doing the right search (theory bias has been wrong) 17 CWoLa Hunting

  18. Are we missing something? 1) Ever-more sensitive dedicated searches for the standard culprits: – Minimal Supersymmetry – Top Partners – diboson / ttbar resonances 18 CWoLa Hunting

  19. Are we missing something? 1) Ever-more sensitive 2) General-purpose dedicated searches for the ‘model-independent’ standard culprits: searches for unexpected new – Minimal Supersymmetry physics – Top Partners – diboson / ttbar resonances 19 CWoLa Hunting

  20. Signatures vs Models E.g. 2-body resonances (pp → X → SM SM): Signatures [1610.09392] Craig, Draper, Kong, Ng, Whiteson Models 20 CWoLa Hunting

  21. Signatures vs Models E.g. 2-body resonances: pp → X → BSM BSM largely uncovered 21 CWoLa Hunting

  22. Basic Resonance Searches E.g. Dijet Search 22 CWoLa Hunting

  23. Basic Resonance Searches E.g. Dijet Search E.g. WW resonance, tt resonance, etc. Selection for signal-like events 23 CWoLa Hunting

  24. Basic Resonance Searches E.g. Dijet Search E.g. WW resonance, tt resonance, etc. Selection for signal-like events ‘Old fashioned’: Simple few-D substructure selection ‘Modern’: Deep NN classifier W-jets using ~few hundred jet constituent inputs (~300-D selection). 24 CWoLa Hunting

  25. Basic Resonance Searches E.g. Dijet Search E.g. ?? resonance Selection for signal-like events ?-jets 25 CWoLa Hunting

  26. A Traditional Dichotomy Model Inclusive Search Model Specific Search – Weak signal assumptions – Strong signal assumptions – Basic selection criteria in few – Sophisticated multivariate variables selection – Large backgrounds – Small backgrounds – Risk missing a signal under – Risk not making the ‘correct’ background signal selection How to make a search with a sophisticated multivariate selection to beat backgrounds while using weak signal assumptions (unknown specific signal model)? → Learn selection from data 26 CWoLa Hunting

  27. Why Train Machines on Data? 1) Maybe you have not simulated the correct signal model (either because you haven’t thought of it, or because it involves non-perturbative physics that prevents simulation) 27 CWoLa Hunting

  28. Why Train Machines on Data? 1) Maybe you have not simulated the correct signal model (either because you haven’t thought of it, or because it involves non-perturbative physics that prevents simulation) 2) Monte-Carlo simulation for training data may difger from real LHC data Figure taken from Ben Nachman’s talk at BOOST 2018 https://indico.cern.ch/event/649482/contributions/2993322/attachments/1688082/2715256/WeakSu pervision_BOOST2018.pdf 28 CWoLa Hunting

  29. Weak Supervision Solution for ML: P. T. Komiske, E. M. Metodiev, B. Nachman, M. D. Schwartz Train directly on data using mixed samples See also [1801.10158] A) LoLiProp (Learning from Labelled B) CWoLa (Classification Without Labels) Proportions) Train using class proportions Train to classify as mixed sample 1 or 2. [1702.00414] L. Dery, B. Nachman, F. Rubbo, A. [1708.02949] E. M. Metodiev, B. Nachman, J. Thaler Schwartzman [1706.09451] T. Cohen, M. Freytsis, B. Ostdiek 29 CWoLa Hunting

  30. CWoLa CWoLa Classifier trained to optimally discriminate mixed sample 1 from mixed sample 2 is also optimal for discriminating S from B, so long as: – Samples 1 and 2 contain difgerent fractions of S and B – S in sample 1 is drawn from the same distribution as S in sample 2 – B in sample 1 is drawn from the same distribution as B in sample 2 – Training statistics are sufgiciently large How to use this for a search where S is new physics and B is SM background? 30 CWoLa Hunting

  31. CWoLa Hunting 1. Assume signal is localized in some specific variable in which background is smooth. 31 CWoLa Hunting

  32. CWoLa Hunting 1. Assume signal is localized in some specific variable in which background is smooth. 2. Assume signal has some distinguishing characteristics within some broad set of additional observables y . 32 CWoLa Hunting

  33. CWoLa Hunting Mixed Sample 1 Mixed Sample 2 1. Assume signal is localized in some specific variable in which background is smooth. 2. Assume signal has some distinguishing characteristics within some broad set of additional observables y . 3. For some resonance mass hypothesis, split data into signal-region and sideband-region mixed samples 33 CWoLa Hunting

  34. CWoLa Hunting Mixed Sample 1 Mixed Sample 2 Selection for signal-region -like events Train classifier to discriminate samples based on variables y Note: background y distribution should not be strongly varying with the resonance variable. 34 CWoLa Hunting

  35. CWoLa Hunting Mixed Sample 1 Mixed Sample 2 Selection for signal-region -like events 1 2 35 CWoLa Hunting

  36. Overfitting and the Look Elsewhere Efgect Of course, there is going to be a large trials factor, especially if y is high-dimensional. Easy solution: Train test split (Statistical fluctuations in training and test set are uncorrelated) More sophisticated: Nested cross-training 36 CWoLa Hunting

  37. Nested Cross-Training 1) Divide entire dataset into k-folds Test set Training signal region Training sideband 37 CWoLa Hunting

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend