School of EECS Washington State University CptS 570 - Machine - - PowerPoint PPT Presentation

school of eecs
SMART_READER_LITE
LIVE PREVIEW

School of EECS Washington State University CptS 570 - Machine - - PowerPoint PPT Presentation

CptS 570 Machine Learning School of EECS Washington State University CptS 570 - Machine Learning 1 Course overview What is machine learning? Why do machine learning? Applications Approaches Resources CptS 570 -


slide-1
SLIDE 1

CptS 570 – Machine Learning School of EECS Washington State University

CptS 570 - Machine Learning 1

slide-2
SLIDE 2

 Course overview  What is machine learning?  Why do machine learning?  Applications  Approaches  Resources

CptS 570 - Machine Learning 2

slide-3
SLIDE 3

 Objectives

  • Knowledge of machine learning (ML) foundations,

paradigms and algorithms

  • Techniques for evaluating ML algorithms
  • Practical experience using ML algorithms
  • Current ML issues

 Website

  • www.eecs.wsu.edu/~holder/courses/CptS570.html

CptS 570 - Machine Learning 3

slide-4
SLIDE 4

 Assignments

  • Readings
  • Six (6) homeworks (40%)
  • Two (2) exams (20%)
  • Project (20%)
  • Presentation (10%)
  • Critiques and class participation (10%)

CptS 570 - Machine Learning 4

slide-5
SLIDE 5

 Textbook

  • Ethem Alpaydin (2010). Introduction to Machine

Learning, Second Edition. MIT Press.

  • www.cmpe.boun.edu.tr/~ethem/i2ml2e

CptS 570 - Machine Learning 5

slide-6
SLIDE 6

 What is learning?  What is machine learning?

CptS 570 - Machine Learning 6

slide-7
SLIDE 7

 Webster

  • Gain knowledge or understanding of or

skill in by study, instruction or experience

  • Memorize
  • Synonym: Discovery

 Obtain knowledge of for the first time  May imply acquiring knowledge with little effort or conscious intention (as by simply being told) or it may imply study and practice

  • Knowledge

 Knowing something with familiarity gained through experience or association  Facts or ideas acquired by study, investigation, observation,

  • r experience

CptS 570 - Machine Learning 7

Sleep learning?

(www.links999.net)

slide-8
SLIDE 8

 Herbert Simon (1970)

  • Any process by which a system improves its

performance.

 Tom Mitchell (1990)

  • A computer program that improves its performance

at some task through experience.

 Ethem Alpaydin (2010)

  • Programming computers to optimize a performance

criterion using example data or past experience.

CptS 570 - Machine Learning 8

slide-9
SLIDE 9

 How is knowledge represented?  How is experience represented?  What is the performance measure?  Knowledge acquisition vs. skill acquisition  Is deduction learning?

CptS 570 - Machine Learning 9

slide-10
SLIDE 10

 Automated knowledge engineering

  • Expertise is scarce
  • Codification of expertise is difficult
  • Expertise frequently consists of a set of test cases
  • Data from measurements, but no information or

knowledge

 Only one computer has to learn, then copy  Discover new knowledge  Understand human learning  Systems need to adapt to unknown, dynamic

environments

CptS 570 - Machine Learning 10

slide-11
SLIDE 11

 Patient cases  [medical knowledge] 

automated (better?) diagnosis

 Autonomous driving  Speech recognition  Recommendations (Amazon, Netflix)  Prediction (business, financial, environment,

health, energy, …)

 Fraud/intrusion detection

CptS 570 - Machine Learning 11

slide-12
SLIDE 12

 Statistics  Pattern recognition  Signal processing  Control  Artificial intelligence  Data mining  Neuroscience  Cognitive science  Psychology

CptS 570 - Machine Learning 12

slide-13
SLIDE 13

 Supervised Learning

  • Classification
  • Regression

 Unsupervised Learning

  • Clustering

 Reinforcement Learning

CptS 570 - Machine Learning 13

slide-14
SLIDE 14

Income D e b t Default Good Status

14 CptS 570 - Machine Learning

slide-15
SLIDE 15

Income D e b t Default Good Status t If Income < t Then Default

15 CptS 570 - Machine Learning

slide-16
SLIDE 16

Income D e b t Default Good Status No Loan Loan if Debt < a*Income + b then Loan else No Loan

16 CptS 570 - Machine Learning

slide-17
SLIDE 17

Income D e b t Categories 1) Debt exceeds Income 2) High Debt, High Income 3) Low Debt Cluster 3 Cluster 1 Cluster 2

17 CptS 570 - Machine Learning

slide-18
SLIDE 18

Debt<50

Income Income

NO YES YES NO NO YES

yes no no <50 50 50- 100 100

>100 >100

<50 50 50- 100 100

>100 >100

18 CptS 570 - Machine Learning

Income D e b t

No Loan Loan

slide-19
SLIDE 19

Income D e b t

No Loan Loan Debt Loan No No Loan 0.123 23 0.203 03 0.117 17 0.545 45 Input Layer Hidden Layer Outpu put Layer Income me

19 CptS 570 - Machine Learning

slide-20
SLIDE 20

 Evaluation

  • Which learning approach is better

 Theoretical bounds

  • What is and is not learnable

 Scalability

  • Learning from massive datasets

CptS 570 - Machine Learning 20

slide-21
SLIDE 21

 Software

  • Weka (www.cs.waikato.ac.nz/~ml/weka)
  • Machine learning open-source software (mloss.org)

 Data

  • UCI ML Repository (archive.ics.uci.edu/ml)
  • UCI KDD Repository (kdd.ics.uci.edu)
  • Challenges: KDD-Cup, Netflix, …

CptS 570 - Machine Learning 21

slide-22
SLIDE 22

 Conferences

  • International Conference on Machine Learning

(ICML)

  • Knowledge Discovery and Data Mining (KDD)
  • IEEE Conference on Data Mining (ICDM)
  • SIAM Data Mining Conference (SDM)
  • Association for the Advancement of AI (AAAI)
  • International Joint Conference on AI (IJCAI)
  • Many more …

CptS 570 - Machine Learning 22

slide-23
SLIDE 23

 Journals

  • Machine Learning Journal
  • Journal of Machine Learning Research
  • Data Mining and Knowledge Discovery
  • Many more …

 WWW

  • www.kdnuggets.com (subscribe!)

CptS 570 - Machine Learning 23

slide-24
SLIDE 24

 Machine learning is a computational process

that improves performance based on experience.

 Numerous successful methods  Maturing theory  Open and active research area

CptS 570 - Machine Learning 24