Nesime Mejbah Justin Tae Jun Stan Alam Gottschlich Lee Zdonik - PowerPoint PPT Presentation

Nesime Mejbah Justin Tae Jun Stan Alam Gottschlich Lee Zdonik Tatbul 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Montreal, Canada

Motivation: Time Series Anomaly Detection  Anomaly: Patterns that do not conform to expected behavior.  Anomalies can have critical impact: loss of life, property damage, monetary loss, ...  Applications of anomaly detection (AD) are numerous and diverse. Autonomous Driving Cancer Detection Six levels of autonomy:  L0: No automation  L1: Driver assistance Anomalies  L2: Partial automation often occur  L3: Conditional automation L3+ autonomy over a  L4: High automation requires robust period of time.  L5: Full automation AD systems. Source: Society of Automotive Engineers (SAE), Source: National Highway and Traffic Safety Administration (NHTSA) http://www.vaccinogeninc.com/oncovax/science-due-diligence/overview-part-1 2

Motivation: Range-based Anomalies  Time series anomalies are range based , i.e., they occur over a period of time. Atrial Premature Contraction anomaly in human ECG  There are domain-specific application preferences . – Cancer detection, Real-time systems: – Early response; Avoid false negatives! – Robotic defense systems: – Delayed response; Avoid false positives! – Emergency braking in self-driving cars: Source: Chandola et al., “Anomaly Detection: A Survey”, – Neither too early nor too late; Avoid false negatives! ACM Computing Surveys, 41(3), 2009. 3

Problem: How to Measure Accuracy? Point-based Anomalies Range-based Anomalies T rue P ositives F alse F alse N egatives P ositives 𝑄𝑠𝑓𝑑𝑗𝑡𝑗𝑝𝑜 = ? 𝑆𝑓𝑑𝑏𝑚𝑚 = ? 𝑄𝑠𝑓𝑑𝑗𝑡𝑗𝑝𝑜 = 𝑈𝑄 ÷ (𝑈𝑄 + 𝐺𝑄)  Must express partial detection 𝑆𝑓𝑑𝑏𝑚𝑚 = 𝑈𝑄 ÷ 𝑈𝑄 + 𝐺𝑂  Must support flexible time bias 4

State of the Art  Classical Precision and Recall 𝑄𝑠𝑓𝑑𝑗𝑡𝑗𝑝𝑜 × 𝑆𝑓𝑑𝑏𝑚𝑚 𝛾 = (1 + 𝛾 2 ) × 𝐺 (𝛾 2 × 𝑄𝑠𝑓𝑑𝑗𝑡𝑗𝑝𝑜) + 𝑆𝑓𝑑𝑏𝑚𝑚 – Point-based anomalies β : relative importance of Recall to Precision – Precision penalizes FP, Recall penalizes FN β = 1 : evenly weighted (harmonic mean) β = 2 : weights Recall higher (i.e., no FN!) – F β -Score to combine and weight them β = 0.5 : weights Precision higher (i.e., no FP!)  Numenta Anomaly Benchmark (NAB) [2] – Point-based anomalies – Focuses specifically on early detection use cases – Difficult to use in practice (irregularities, ambiguities, magic numbers) [3]  Activity recognition metrics – No support for flexible time bias [2] Lavin and Ahmad, “Evaluating Real -Time Anomaly Detection Algorithms – The Numenta Anomaly Benchmark”, IEEE ICMLA, 2015. 5 [3] Singh and Olinsky , “ Demistifying Numenta Anomaly Benchmark”, IEEE IJCNN, 2017.

Precision and Recall for Time Series Customizable parameters  We extend classical Precision and Recall to measure ranges. Range-based Recall  Our model is: – expressive – flexible – extensible Range-based Precision 6

Customization Examples Overlap Size ω() Positional Bias δ() Cancer Detection: Robotic Defense: Emergency Braking:  Set δ () = Front-end , β = 2  Set δ() = Back-end , β = 0.5  Set δ() = Middle , β = 1.5 Our model subsumes the classical point-based model , when:  all ranges are represented as unit-size ranges, and  α=0 , γ()=1 , ω() is as above, and δ() = Flat 7

Selected Experimental Results Comparison to Classical model Comparison to Numenta model Multiple Anomaly Detectors (LSTM-AD) (LSTM-AD) (NYC-Taxi) Our model Our model can Our model is more effective in  subsumes the classical model  mimic the Numenta model  evaluating multiple detectors  is sensitive to positional bias  catch additional intricacies  capturing subtleties in data Please see our paper for details of this experimental study and additional results. 8

Key Takeaways  This work extends the classical Precision and Recall model to time series data.  We provide tunable parameters to capture domain-specific application preferences.  Experiments with diverse datasets and anomaly detectors prove the benefits of our approach.  Future work includes: – designing new training strategies for range-based anomaly detection – exploring use in other time series classification tasks and applications 9

More Information Watch our short video: https://www.youtube.com/watch?v=K5f-dUBiQP4 Read our paper: https://arxiv.org/abs/1803.03639/ Download our tool: https://github.com/IntelLabs/TSAD-Evaluator/ Visit our poster session at NeurIPS’18: Today at 5:00 - 7:00 PM in Room 210 & 230 AB #116 Thanks to Intel and NSF for funding this research. 10

Nesime Mejbah Justin Tae Jun Stan Alam Gottschlich Lee Zdonik - PowerPoint PPT Presentation

Nesime Mejbah Justin Tae Jun Stan Alam Gottschlich Lee Zdonik Tatbul 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Montreal, Canada Motivation: Time Series Anomaly Detection Anomaly: Patterns that do not

SKY NETWORK TELEVISION ANNUAL RESULTS 2005 Jun-05 Jun-04 Wholesale Jun-03 Jun-02 Jun-01

Alice Springs Annual Water Production and Rainfall Jun 06 Jun 04 Jun 02 Jun 00 Jun

Challenges and Opportunities Nesime Tatbul Talk Outline Integrated data stream processing

An Introduction to Stan and RStan Introduction I (MW) am not a developer of Stan , only a very

Invyswell: A HyTM for Haswell RTM Irina Calciu, Justin Gottschlich, Tatiana Shpeisman, Gilles

NUMA-Friendly Stack (using Delegation and Elimination) Irina Calciu Justin Gottschlich Maurice

Machine Programming Justin Gottschlich, Intel Labs December 12 th , 2018 TVM Conference,

NAVY NAVY NAVY NAVY Justin G. Miller Justin G. Miller Justin G. Miller Justin G. Miller ENS

Photobioreactor system case-study Tom a s Stan ek Tom a s Stan ek

of Nanofiltration Membranes GeoEnergy 2018 Zamir.Alam@suez.com Matt.Boczkowski@suez.com Z. Alam,

Presented by: Ajwad Alam Prepared By: Ajwad Alam, Jaman Sharif, Makame Mahmud & Raqib Al

Data Inges*on for the Connected World John Meehan, Cansu Aslantas, Stan Zdonik (Brown

Genesis: A Hardware Acceleration Framework for Genomic Data Analysis Tae Jun Ham , David

Generative Well-intentioned Networks Justin Cosentino ( justin@cosentino.io ) Jun Zhu (

Intelligent Compaction Intelligent Stan Rakowski Stan Rakowski Technical Services Manager

UNDERSTANDI NG UNDERSTANDI NG ELECTI ONS ELECTI ONS I N PAKI STAN I N PAKI STAN Dr. Ijaz

Project leads: Dr Elvan U. Akyuz & Declan Phelan Project team: City & Hackney Assertive

Powerline noise elimination MATLAB tutorial series (Part 2.2) Pouyan Ebrahimbabaie Laboratory

FaultTracer: A Change Impact and Regression Fault Analysis Tool for Evolving Java Programs

Using the cARdiac ECG Augmented Reality Application Acknowledgements: School of Medicine,

MIMI Study Minimalist Immediate Mechanical Interven4on

Higher Proper ads Philip Hackney University of Louisiana at Lafayette 3rd Conference on ope - ad

The missing link in dynamic software analysis Symposium on Software Performance This research was

Mathematical modeling from ion channel to ECG h l t ECG an Introduction Mark Potse model

Nesime Mejbah Justin Tae Jun Stan Alam Gottschlich Lee Zdonik - PowerPoint PPT Presentation

Nesime Mejbah Justin Tae Jun Stan Alam Gottschlich Lee Zdonik Tatbul 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Montreal, Canada Motivation: Time Series Anomaly Detection Anomaly: Patterns that do not

SKY NETWORK TELEVISION ANNUAL RESULTS 2005 Jun-05 Jun-04 Wholesale Jun-03 Jun-02 Jun-01

Alice Springs Annual Water Production and Rainfall Jun 06 Jun 04 Jun 02 Jun 00 Jun

Challenges and Opportunities Nesime Tatbul Talk Outline Integrated data stream processing

An Introduction to Stan and RStan Introduction I (MW) am not a developer of Stan , only a very

Invyswell: A HyTM for Haswell RTM Irina Calciu, Justin Gottschlich, Tatiana Shpeisman, Gilles

NUMA-Friendly Stack (using Delegation and Elimination) Irina Calciu Justin Gottschlich Maurice

Machine Programming Justin Gottschlich, Intel Labs December 12 th , 2018 TVM Conference,

NAVY NAVY NAVY NAVY Justin G. Miller Justin G. Miller Justin G. Miller Justin G. Miller ENS

Photobioreactor system case-study Tom a s Stan ek Tom a s Stan ek

of Nanofiltration Membranes GeoEnergy 2018 Zamir.Alam@suez.com Matt.Boczkowski@suez.com Z. Alam,

Presented by: Ajwad Alam Prepared By: Ajwad Alam, Jaman Sharif, Makame Mahmud &amp; Raqib Al

Data Inges*on for the Connected World John Meehan, Cansu Aslantas, Stan Zdonik (Brown

Genesis: A Hardware Acceleration Framework for Genomic Data Analysis Tae Jun Ham , David

Generative Well-intentioned Networks Justin Cosentino ( justin@cosentino.io ) Jun Zhu (

Intelligent Compaction Intelligent Stan Rakowski Stan Rakowski Technical Services Manager

UNDERSTANDI NG UNDERSTANDI NG ELECTI ONS ELECTI ONS I N PAKI STAN I N PAKI STAN Dr. Ijaz

Project leads: Dr Elvan U. Akyuz &amp; Declan Phelan Project team: City &amp; Hackney Assertive

Powerline noise elimination MATLAB tutorial series (Part 2.2) Pouyan Ebrahimbabaie Laboratory

FaultTracer: A Change Impact and Regression Fault Analysis Tool for Evolving Java Programs

Using the cARdiac ECG Augmented Reality Application Acknowledgements: School of Medicine,

MIMI Study Minimalist Immediate Mechanical Interven4on

Higher Proper ads Philip Hackney University of Louisiana at Lafayette 3rd Conference on ope - ad

The missing link in dynamic software analysis Symposium on Software Performance This research was

Mathematical modeling from ion channel to ECG h l t ECG an Introduction Mark Potse model

Presented by: Ajwad Alam Prepared By: Ajwad Alam, Jaman Sharif, Makame Mahmud & Raqib Al

Project leads: Dr Elvan U. Akyuz & Declan Phelan Project team: City & Hackney Assertive