ai and predictive analytics in data center environments
play

AI and Predictive Analytics in Data-Center Environments Supervised - PowerPoint PPT Presentation

AI and Predictive Analytics in Data-Center Environments Supervised Learning Methods Josep Ll. Berral @BSC Intel Academic Education Mindshare Initiative for AI Introduction If we have data and it is labeled, we can learn their relation and


  1. AI and Predictive Analytics in Data-Center Environments Supervised Learning Methods Josep Ll. Berral @BSC Intel Academic Education Mindshare Initiative for AI

  2. Introduction “If we have data and it is labeled, we can learn their relation and predict future labels”

  3. Supervised Learning • Supervised Learning • Training data is already labeled • Want to predict new unlabeled data

  4. Supervised Learning Observe Set of Examples <features> + <label> Label Later... New Set of Automatic Examples Labeling Observe Model <features> <labels>

  5. Supervised Learning • Labeling data: • By hand • By known methods • By posterior metrics • From known data Observe Label dataset “cat” “cat” “cat” in out “cat” “cat” “dog” “dog” “dog” “cat” “dog”

  6. Typical Flow What is “dog” Model this? Right! [Reinforce]

  7. Typical Flow What is “dog” Model this? Right! [Reinforce] What is “dog” Model this? No! It is a “cat” [Adapt]

  8. Typical Flow What is “dog” Model this? Right! [Reinforce] What is “dog” Model this? No! It is a “cat” [Adapt] What is “cat” Model this? Right! [Reinforce]

  9. Typical Flow What is “dog” Model this? Right! [Reinforce] Iterate What is until the “dog” Model this? model is accurate No! It is a “cat” [Adapt] What is “cat” Model this? Right! [Reinforce]

  10. Good Procedures • Keep some data “unseen”, to avoid overfitting / memorizing Algorithm Model Training Set

  11. Good Procedures • Keep some data “unseen”, to avoid overfitting / memorizing Tune the Algorithm Algorithm Model Training Set No Is it Predict Model Evaluate good? Validation Set

  12. Good Procedures • Keep some data “unseen”, to avoid overfitting / memorizing Tune the Algorithm Algorithm Model Training Set No Is it Predict Model Evaluate good? Yes Validation Set Final Predict Model Evaluation Test Set

  13. Supervised Learning Algorithms & Methods!

  14. Algorithms & Methods • Classification • The outputs are “classes” “cat” E.g.: “dog” • Regression • The outputs are “quantities” Max Speed E.g.: Car Properties

  15. Some Methods • Regression algorithms • Linear & Polynomial Regression, Gaussian Processes, ... “Attempt to find a function/set -of-functions that match with the example points” • The learning process minimizes the regression error

  16. Some Methods • Trees and Forests • Decision Trees, Regression Trees, Random Forests “Attempt to find a set of recursive partition that minimize the classification or regression error” “A” “C” “B” “A” “A” “B” “C” “B” “B” “C” “A” “C”

  17. Some Methods • k – Nearest Neighbors “ Compare new samples with some memorized ones, and classify/predict as the ‘k’ nearest ones” ?

  18. Some Methods • Bayesian Methods • Naïve Bayes, Bayesian Networks, ... “Compute probabilities of classes, events and relations, then apply Bayes theorem” P(A|B) = P(A) · P(B|A) / P(B) P(Class|Example) = P(Class) · P(Example|Class) / P(Example) = P(Class & Example) / P(Example)

  19. Some Methods • Support Vector Machines “Find the function that best divides classes, with a minimal tolerance for errors”

  20. Summary • Supervised Learning: • Models learn from labeled data, and “human - driven” tuning • Methods for Regression & Classification • Lots of Algorithms to be applied • Each with its characteristics • Strong and weak points • Different consumption of resources

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend