Fundamentals of Machine Learning Instructor: Ekpe Okorafor 1. - - PowerPoint PPT Presentation

fundamentals of machine
SMART_READER_LITE
LIVE PREVIEW

Fundamentals of Machine Learning Instructor: Ekpe Okorafor 1. - - PowerPoint PPT Presentation

Fundamentals of Machine Learning Instructor: Ekpe Okorafor 1. Accenture Big Data Academy 2. Computer Science African University of Science & Technology Ekpe Okorafor, PhD Affiliations: Accenture Digital Big Data Academy


slide-1
SLIDE 1

Fundamentals of Machine Learning

Instructor: Ekpe Okorafor

1. Accenture – Big Data Academy 2. Computer Science African University of Science & Technology

slide-2
SLIDE 2

Affiliations:

  • Accenture Digital – Big Data Academy

 Principal & Faculty, Applied Intelligence

  • African University of Science & Technology

 Visiting Professor, Computer Science / Data Science

  • Dallas, USA

Ekpe Okorafor, PhD

Email: ekpe.okorafor@gmail.com; eokorafor@aust.edu.ng Twitter: @EkpeOkorafor; @Radicube

  • Big Data, Predictive & Adaptive Analytics
  • Artificial Intelligence, Machine Learning
  • Performance Modelling and Analysis
  • Information Assurance and Cybersecurity.
  • High Performance Computing & Network Architectures
  • Distributed Storage & Processing
  • Massively Parallel Processing & Programming
  • Fault-tolerant Systems

Research Interests:

slide-3
SLIDE 3

Objectives

Objectives

  • What Machine Learning is
  • When to Leverage Machine learning
  • Machine Learning algorithms
  • Machine Learning methodology

3

slide-4
SLIDE 4

What is Machine Learning

4

Machines are taking over!

slide-5
SLIDE 5

But Seriously, What is Machine Learning?

5

“Machine Learning is the science of getting computers to act without being explicitly programmed.” – Andrew Ng (Coursera) “A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at task in T, as measured by P, improves with experience E.” – Tom M. Mitchell (1997)

slide-6
SLIDE 6

What are AI and ML?

6

  • Artificial Intelligence (AI) is a branch or Computer Science that uses algorithms and techniques to

mimic human intelligence

  • Machine Learning (ML) is one of several AI techniques for sophisticated cognitive tasks

Computer Science

Mathematical foundations Algorithms and data structures

Artificial Intelligence

Communication and security Computer Architecture Databases …… Symbolic AL (e.g. Expert Systems) Probabilistic AI (e.g. Search & optimization)

Machine Learning

Decision trees Bayesian inference Deep learning Reinforced learning Support vector machines Neural networks Random forest ……

slide-7
SLIDE 7

Machine Learning

7

  • Machine Learning is a particularly interesting technique because it represents a paradigm shift

within AI Traditional AI techniques

Data Logic

Machine Learning

Data Output Output Logic

  • Static – hard-coded set of

steps and scenarios

  • Rule Based – expert

knowledge

  • No generalization – handling

special cases is difficult

  • Dynamic – evolves with data,

finds new patterns

  • Data driven – discovers

knowledge

  • Generalization – adapts to

new situations and special cases

slide-8
SLIDE 8

Machine Learning - Example

8

Symbolic AI “Let us sit down with the world’s best chess player, Ekpe Okorafor, and put his knowledge into a computer program” Mathematical/Statistical AI Machine Learning Approach

  • Example - Excelling at playing the game of chess

“Let us simulate all the different possible moves and the associated outcomes at each single step and go with the most likely to win” “Let us show millions of examples or real life and simulated games (won and lost) to the program, and let it learn from experience”

slide-9
SLIDE 9

Machine Learning – When to use

9

Tasks programmers can’t describe

  • Machine learning is particularly good at solving 2 types of problems

where other AI techniques fail

Complex multidimensional problems that can’t be solved by numerical reasoning

Hand writing Cognitive Reasoning Weather Forecasting Network Intrusion Health Care Outcomes Movie Recommendation

slide-10
SLIDE 10

Machine Learning – Breaking it down

10

Supervised and Unsupervised Learning

  • Supervised learning - we already know the answers we want (found in

past or completed data).

  • Unsupervised learning - we want to find unknown structures or trends.

Input Data

Information + Answers

Result

Optimum Model

  • Relationships
  • Patterns
  • Dependencies
  • Hidden structures

Machine Learning

slide-11
SLIDE 11

Supervised Learning

11

slide-12
SLIDE 12

Unsupervised Learning

12

slide-13
SLIDE 13

Supervised and Unsupervised Learning

13

REGRESSION: Estimate continuous values (Real-valued output)

Supervised Learning:

Predicting values. Known targets. User inputs correct answers to learn from. Machine uses the information to guess new answers.

CLASSIFICATION: Identify a unique class (Discrete values, Boolean, Categories) CLUSTER ANALYSIS: Group into sets

Unsupervised Learning:

Search for structure in data. Unknown targets. User inputs data with undefined answers. Machine finds useful information hidden in data

DENSITY ESTIMATION: Approximate distribution DENSITY REDUCTION: Select relevant variables

slide-14
SLIDE 14

Supervised and Unsupervised Learning

14

Supervised Learning: Unsupervised Learning:

Regression

  • Linear Regression
  • Ordinary Least Squares Regression
  • LOESS (Local Regression)
  • Neural Networks

Classification

  • Decision Trees
  • K-Nearest Neighbors
  • Support Vector Machine
  • Logistic Regression
  • Naïve Bayes
  • Random Forests

Cluster Analysis

  • K-Means Clustering
  • Hierarchical Clustering

Dimension Reduction

  • Principal Component Analysis (PCA)
  • Linear Discriminant Analysis (LDA)
slide-15
SLIDE 15

What About Reinforcement Learning?

15

10 Mins

slide-16
SLIDE 16

Machine Learning Application – Recommender Systems

16

  • Recommender systems deal with making recommendations based upon

previously collected data and leveraging ML techniques. Content Based (Features) Modified Linear Regression Non-content Based (No Features) Collaborative Filtering Matrix Factorization

slide-17
SLIDE 17

Train & Test Methodology

17

  • ML techniques use a train + test system (commonly known as cross-

validation) before using findings in real situations.

TRAINING:

Learn data properties

1. The machine makes conclusions by learning from the data 2. It improves its model until optimal Performance is reached 3. Using a Cost / Loss Function to measure

  • Accuracy. It repeats iterations until a

minimum Is reached.

TESTING:

Test the properties

1. Apply the conclusions to new data and compare results to know answers 2. The model does not change. It us just tested to measure how good the machine did after training 3. Useful to detect overfitting. If good enough, it is ready to be used

APPLICATION:

Use the properties

  • In a real situation, the answers are not known
  • Apply the model conclusions to predict the

answers from the inputs. Use the answers in whatever necessary

slide-18
SLIDE 18

Additional Resources

18

  • ML course at Coursera: https://www.coursera.org/learn/machine-learning/
  • Toolbox scikit-learn: http://scikit-learn.org/stable/user_guide.html
  • Caret Package: http://topepo.github.io/caret/index.html
  • Python and R codes: http://www.analyticsvidhya.com/blog/2015/09/full-

cheatsheet-machine-learning-algorithms/

  • Introductory Primer to Machine Learning: http://www.toptal.com/machine-

learning/machine-learning-theory-an-introductory-primer

slide-19
SLIDE 19

Summary

19

  • Machine Learning (ML) is one of several AI techniques for sophisticated

cognitive tasks

  • Machine Learning is a particularly interesting technique because it

represents a paradigm shift within AI

  • Machine learning is particularly good at solving 2 types of problems

where other AI techniques fail

  • Tasks programmers can’t describe
  • Complex multidimensional problems that can’t be solved by

numerical reasoning

  • Machine Learning employs supervised and unsupervised learning

approaches

  • ML techniques use a train + test system (commonly known as cross-

validation) before using findings in real situations.