Detection of faulty Beam Position Monitors E. Fol, R. Tomas Garcia - PowerPoint PPT Presentation

Detection of faulty Beam Position Monitors E. Fol, R. Tomas Garcia Machine Learning Applications for Particle Accelerators, Feb. 28 – March 2, 2018, SLAC National Accelerator Laboratory

Content Introduction to optics measurements at LHC 1. Faulty BPM data 2. Cluster analysis of measured signal 3. Autoencoder for anomaly detection 4. Conclusion and future tasks 5. 3

Introduction to optics measurements at LHC • Beam position monitors (BPMs) record the turn-by-turn data measuring the oscillations of the excited beam • Reconstruction of the optics mainly concerned by the phase advances obtained from the measured data using Fourier transformation analysis • One of the main parameters is beta-beating: ratio of the measured betatron function with respect to the nominal designed function • The measured optics is compared then to the nominal model of the machine in order to compute the corrections 4

1. Faulty BPM data • 1024 BPMs per beam around the ring to measure the turn-by-turn data • Statistical analysis of the past measurements show that ~10% of BPMs are faulty Optics functions are affected by occurrences of faulty signal • • some artifacts cannot be recognized in turn-by-turn data • reasons, relevant features and their correlations are unknown time-consuming manual “cleaning” and analysis re -computation • Unphysical values still escape our current automatic cleaning tools •  ML as an alternative solution to improve the optics measurements 5

1. Faulty BPM data 6

1. Faulty BPM data • Current cleaning tools • Basic pre-defined cuts (distance between signal peaks, flat signals, etc.) and known bad BPMs are removed in the first step • Advanced techniques*: • SVD: Singular vectors represent the variation of physical modes describing the beam motion: faulty BPMs produce modes with spatial vectors consisting of localized peaks defined by a certain threshold. • Fourier transform in windows: faulty signal show a randomly populated Fourier spectrum: computing the RMS of the amplitudes of the spectral lines within one or several windows detect bad BPMs * R. Calaga and R. Tomas “Statistical analysis of RHIC beam position monitors performance” 7

2. Cluster analysis of measured signal Manual cleaning and data observation required before trying to reconstruct the optics from the signals doesn’t prevent unphysical values in obtained optics functions • • manual cleaning might produce different results • is likely that some wrong values still escape human intervention General Idea: • we do not want to replicate the currently used automatic cleaning tools, so training data set could be built only combining automatic and manual cleaning result  unsupervised learning approach is preferable • assuming most of the BPMs measure correctly, the bad BPMs should appear as an anomaly, but the relevant parameters are unknown • consider all possible parameters and separate into regions  Cluster analysis 8

2. Cluster analysis of measured signal Signals from faulty BPM can be considered as “outliers” and • isolated into a separate cluster • Data visualization is needed to choose appropriate clustering method • Data preprocessing steps are needed before applying cluster analysis • How to define algorithm parameters for diverse optics settings? • How to evaluate the results? What is the relevant parameter? 9

2. Cluster analysis of measured signal K-means* as basic clustering approach: • Starting with randomly chosen data points as cluster centers, each point is assigned to the closest center • Move centers to the centroids of the clusters, reassign the points to the nearest center • Repeat until moving the centroid gives no improvement (based on total squared distance between each point and cluster centroid) *Stuart P. Lloyd. Least squares quantization in PCM http://shabal.in/visuals/kmeans 10

2. Cluster analysis of measured signal Implementation • • sklearn.cluster.KMeans • Features: amplitude, tune, phase advance beating (obtained from harmonic analysis on turn-by-turn data) • K-means allows only 2D-Clustering: all possible combinations of features are evaluated • all features scaled to the [0,1] range to be represented in a common space • IRs and arcs data treated separately since the beam behaves differently in these regions. 11

K-means with 3 clusters: Optics measurement from LHC Machine Development Arcs IRs 12

2. Cluster analysis of measured signal • Evaluation of K-means algorithm: • Appropriate number of clusters unknown • Very sensitive to outliers • Choice of the centroids affects the clustering results • also an outlier can become a centroid • unstable results  DBSCAN: Density-Based Spatial Clustering of application with noise* * “ A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise” Ester, M., H. P. Kriegel, J. Sander, and X. Xu 13

2. Cluster analysis of measured signal DBSCAN: • relies on density-based notion of clusters of arbitrary shape Core points : build a “neighbourhood” according to • a given distance Non-core points: are inside a neighbourhood, but don’t have enough points around to build a new neighbourhood Noise : don’t have enough points in the neighbourhood and are too far away from any of core points • noise can be easily detected as data points not belonging to any of clusters https://dashee87.github.io/data%20science/general/Clustering-with-Scikit-with-GIFs/ 14

DBSCAN ε = 0.3, MinPts = 80, Optics measurement from LHC Machine Development Implementation: • Scikit Learn tune, amplitude and phase advance • beating as features rescaled to [0,1] range • Clustering in n-dimensional space is possible: no combinations of features are needed • Set of measurements: we classify a BPM as bad if faulty signal is detected in at least one measurement 15

DBSCAN ε = 0.3, MinPts = 80, Optics measurement from LHC Machine Development Arcs IRs 16

2. Cluster analysis of measured signal Conclusions: • DBSCAN is more suitable for the given problem compared to K-means DBSCAN K-means 17

2. Cluster analysis of measured signal • the cluster constellation is stable over several runs • more information might be obtained from analysing the clusters of good BPMs 18

3. Autoencoder for anomaly detection How does an autoencoder work? * Tries to reproduce in its output whatever • comes in the input  f(x) = x • Encoder: compressing the input data to lower dimensions Decoder: Reconstructing the data into original input • The difference between output and input is measured using a loss function General idea of bad BPMs detection: Since the majority of BPMs are good, the trained model will learn the behaviour of valid signal • • The loss function will measure the difference between a given point and the learned general case Loss above a defined threshold  anomaly detected! • * https://cds.cern.ch/record/2274402 19

4. Conclusions and future tasks Cluster analysis • • Cluster analysis improvements potentially possible using ensemble methods (e.g. Random Forest*) • The results of cluster analysis can be used as trainings data for a binary classificator to detect bad BPMs • Classifier model that outputs the probability for a BPM to be bad • Deep neural networks • Building a deep autoencoder no feature engineering will be needed since the hidden layers will extract and learn the relevant features  direct application on turn-by-turn data is possible Beyond anomaly detection • • Study on prediction based optics correction using neural networks http://scikit-learn.org/stable/modules/outlier_detection.html#isolation-forest 20

21 2/21/2018

Detection of faulty Beam Position Monitors E. Fol, R. Tomas Garcia - PowerPoint PPT Presentation

Detection of faulty Beam Position Monitors E. Fol, R. Tomas Garcia Machine Learning Applications for Particle Accelerators, Feb. 28 March 2, 2018, SLAC National Accelerator Laboratory Content Introduction to optics measurements at LHC 1.

Real Real- -Time Systems Time Systems Monitors Monitors Monitors: (Burns & Wellings,

COMP31212: Concurrency Topics 4.1: Concurrency Patterns - Monitors Topic 4.1: Concurrency

Linac and Booster Beam Diagnostics Proton Source Workshop December 7 and 8 2010 December 7 and 8,

DEBRA Members Weekend 2017 EB therapy research in Dundee: an update Knocking out the faulty

E-lens related beam-beam experiment Xiaofeng Gu 1 IP10 -- e-beam collision with proton only (up

Beam-beam Studies, Tool Development and Tests EIC Collaboration Meeting, Jlab, Oct. 29-Nov. 1,

Intra-Pulse Beam-Beam Scans at the NLC IP Steve Smith SLAC Nanobeams 2002 Beam-Beam Scans

Monitors & Condition Synchronisation controller DM519 Concurrent Programming 1 Monitors

Monitors Dr. Liam OConnor University of Edinburgh LFCS (and UNSW) Term 2 2020 1 Monitors

Monitors (week 4) 2 / 44 INF4140 - Models of concurrency Monitors, lecture 4 Hsten 2013 16.

Primary Proton Beam Profile Monitors -Segmented Secondary Emission Monitor- Outline Introduction

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

Position Manage me nt Position Management Age nda Overview of Position Management

Optimal Learning of Joint Alignments with a Faulty Oracle Charalampos E. Tsourakakis

. Bruno Durand LIRMM CNRS Universit de Montpellier II November26 th 2011 . . . 1.

Beam Position Monitor for a Particle Accelerator Andrew Noble Mona Elkady Overview Free

Mixture Models and EM Henrik I. Christensen Robotics & Intelligent Machines @ GT Georgia

Classification method in single particle analysis Cluster Analysis Pawel A. Penczek

K -means Clustering Ke Chen Reading: [7.3, EA], [9.1, CMB] COMP24111 Machine Learning Outline

Machine Learning Lecture Notes on Clustering (IV) 2016-2017 Davide Eynard davide.eynard@usi.ch

CLUSTERING Based on Foundations of Statistical NLP, C. Manning & H. Sch utze, MIT

On learning statistical mixtures maximizing the complete likelihood The k -MLE methodology using

The impact of high dimension on clustering Gilles Celeux Inria Saclay-le-de-France, Universit

Density-based Clustering MAT 6480W / STT 6705V Guy Wolf guy.wolf@umontreal.ca Universit e de

Detection of faulty Beam Position Monitors E. Fol, R. Tomas Garcia - PowerPoint PPT Presentation

Detection of faulty Beam Position Monitors E. Fol, R. Tomas Garcia Machine Learning Applications for Particle Accelerators, Feb. 28 March 2, 2018, SLAC National Accelerator Laboratory Content Introduction to optics measurements at LHC 1.

Real Real- -Time Systems Time Systems Monitors Monitors Monitors: (Burns &amp; Wellings,

COMP31212: Concurrency Topics 4.1: Concurrency Patterns - Monitors Topic 4.1: Concurrency

Linac and Booster Beam Diagnostics Proton Source Workshop December 7 and 8 2010 December 7 and 8,

DEBRA Members Weekend 2017 EB therapy research in Dundee: an update Knocking out the faulty

E-lens related beam-beam experiment Xiaofeng Gu 1 IP10 -- e-beam collision with proton only (up

Beam-beam Studies, Tool Development and Tests EIC Collaboration Meeting, Jlab, Oct. 29-Nov. 1,

Intra-Pulse Beam-Beam Scans at the NLC IP Steve Smith SLAC Nanobeams 2002 Beam-Beam Scans

Monitors &amp; Condition Synchronisation controller DM519 Concurrent Programming 1 Monitors

Monitors Dr. Liam OConnor University of Edinburgh LFCS (and UNSW) Term 2 2020 1 Monitors

Monitors (week 4) 2 / 44 INF4140 - Models of concurrency Monitors, lecture 4 Hsten 2013 16.

Primary Proton Beam Profile Monitors -Segmented Secondary Emission Monitor- Outline Introduction

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

Position Manage me nt Position Management Age nda Overview of Position Management

Optimal Learning of Joint Alignments with a Faulty Oracle Charalampos E. Tsourakakis

. Bruno Durand LIRMM CNRS Universit de Montpellier II November26 th 2011 . . . 1.

Beam Position Monitor for a Particle Accelerator Andrew Noble Mona Elkady Overview Free

Mixture Models and EM Henrik I. Christensen Robotics &amp; Intelligent Machines @ GT Georgia

Classification method in single particle analysis Cluster Analysis Pawel A. Penczek

K -means Clustering Ke Chen Reading: [7.3, EA], [9.1, CMB] COMP24111 Machine Learning Outline

Machine Learning Lecture Notes on Clustering (IV) 2016-2017 Davide Eynard davide.eynard@usi.ch

CLUSTERING Based on Foundations of Statistical NLP, C. Manning &amp; H. Sch utze, MIT

On learning statistical mixtures maximizing the complete likelihood The k -MLE methodology using

The impact of high dimension on clustering Gilles Celeux Inria Saclay-le-de-France, Universit

Density-based Clustering MAT 6480W / STT 6705V Guy Wolf guy.wolf@umontreal.ca Universit e de

Real Real- -Time Systems Time Systems Monitors Monitors Monitors: (Burns & Wellings,

Monitors & Condition Synchronisation controller DM519 Concurrent Programming 1 Monitors

Mixture Models and EM Henrik I. Christensen Robotics & Intelligent Machines @ GT Georgia

CLUSTERING Based on Foundations of Statistical NLP, C. Manning & H. Sch utze, MIT