Lecture 1: Course outline and logistics What is Machine Learning - - PowerPoint PPT Presentation

lecture 1
SMART_READER_LITE
LIVE PREVIEW

Lecture 1: Course outline and logistics What is Machine Learning - - PowerPoint PPT Presentation

Lecture 1: Course outline and logistics What is Machine Learning Aykut Erdem February 2016 Hacettepe University Todays Schedule Course outline and logistics An overview of Machine Learning 2 Course outline and logistics


slide-1
SLIDE 1

Aykut Erdem

February 2016 Hacettepe University

Lecture 1:

−Course outline and logistics −What is Machine Learning

slide-2
SLIDE 2

Today’s Schedule

  • Course outline and logistics
  • An overview of Machine Learning

2

slide-3
SLIDE 3

Course outline and logistics

slide-4
SLIDE 4

Logistics

  • Instructor: 



 Aykut ERDEM (aykut@cs.hacettepe.edu.tr)

  • Teaching Assistant: 



 Aysun Kocak (aysunkocak@cs.hacettepe.edu.tr)
 


Burcak Asal (basal@cs.hacettepe.edu.tr)

  • Lectures: Tue 10:00 - 10:50_D10


Thu 09:00 - 10:50_D9

  • Tutorials: Fri 09:00 - 10:50_D8

4

slide-5
SLIDE 5

About this course

  • This is a undergraduate-level introductory course in machine

learning (ML)

⎯ A broad overview of many concepts and algorithms in ML.

  • Requirements

⎯ Basic algorithms, data structures. ⎯ Basic probability and statistics. ⎯ Basic linear algebra and calculus ⎯ Good programming skills


  • BBM 409 Introduction to Machine Learning Practicum

(New)

⎯ Students will gain skills to apply the concepts to real

world problems.

5

vector/matrix manipulations, partial derivatives common distributions, Bayes rule, mean/median/model

slide-6
SLIDE 6

Communication

  • The course webpage will be updated regularly

throughout the semester with lecture notes, programming and reading assignments and important deadlines. 
 http://web.cs.hacettepe.edu.tr/~aykut/classes/ spring2016/bbm406/

  • We will be using Piazza for course related discussions

and announcements. Please enroll the class on Piazza by following the link
 http://piazza.com/class#spring2016/bbm406

6

slide-7
SLIDE 7

Reference Books

  • Artificial Intelligence: A Modern Approach (3rd Edition), Russell

and Norvig. Prentice Hall, 2009

  • Bayesian Reasoning and Machine Learning, Barber, Cambridge

University Press, 2012. (online version available)

  • Introduction to Machine Learning (2nd Edition), Alpaydin, MIT

Press , 2010

  • Pattern Recognition and Machine Learning, Bishop, Springer,

2006

  • Machine Learning: A Probabilistic Perspective, Murphy, MIT

Press, 2012

7

slide-8
SLIDE 8

Grading Policy

  • Grading for BBM 406 will be based on

⎯ a course project (done in pairs) (25%),

⎯ a midterm exam (30%), ⎯ a final exam (40%), and ⎯ class participation (5%)

  • In BBM 409, the grading will be based on

⎯ a set of quizzes (20%), and

⎯ 3 assignments (done individually)

8

slide-9
SLIDE 9

Assignments

  • 3 assignments, first one worth 20%, last two worth 30%

each

  • Theoretical: Pencil-and-paper derivations
  • Programming: Implementing Python code to solve a

given real-world problem

  • A quick Python tutorial in this week’s tutorial session.

9

slide-10
SLIDE 10

10

slide-11
SLIDE 11

Course Project

  • Done individually, or in teams of two students.
  • Choose your own topic and explore ways to

solve the problem

  • Proposal: 1 page (Mar 8) (10%)


Progress Report: 4-5 pages (Apr 19) (25%)
 Poster Presentation: (last week of classes) (20%)
 Final Report: (due at the beginning of poster session) (45%)

11

slide-12
SLIDE 12

Collaboration Policy

  • All work on assignments have to be done individually. The

course project, however, can be done in pairs.

  • You are encouraged to discuss with your classmates about

the given assignments, but these discussions should be carried out in an abstract way.

  • In short, turning in someone else’s work, in whole or in

part, as your own will be considered as a violation of academic integrity.

  • Please note that the former condition also holds for the

material found on the web as everything on the web has been written by someone else.

12

http://www.plagiarism.org/plagiarism-101/prevention/

slide-13
SLIDE 13

Course Outline

  • Week1

Overview of Machine Learning, Nearest Neighbor Classifier

  • Week2

Linear Regression, Least Squares

  • Week3

Machine Learning Methodology

  • Week4

Statistical Estimation: MLE, MAP , Naïve Bayes Classifier


  • Week5

Linear Classification Models: Logistic Regression, Linear Discriminant Functions, Perceptron

  • Week6

Neural Networks

  • Week7

Midterm Exam

13

Assg1 out Assg1 due, Assg2 out Course project proposal due Assg2 due Assg3 out

slide-14
SLIDE 14

Course Outline (cont’d.)

  • Week8 Deep Learning
  • Week9 Support Vector Machines (SVMs)
  • Week10

Multi-class SVM

  • Week11

Decision Tree Learning

  • Week12

Ensemble Methods: Bagging, Random Forests, Boosting

  • Week13

Clustering

  • Week14

Principle Component Analysis, Autoencoders

14

Project progress report due Assg3 due

slide-15
SLIDE 15

Machine Learning: 
 An Overview

slide-16
SLIDE 16

Quotes

  • “If you were a current computer science student what area

would you start studying heavily?” –Answer: Machine Learning. –“The ultimate is computers that learn” –Bill Gates, Reddit AMA

  • “Machine learning is the next Internet”

–Tony Tether, Director, DARPA

  • “Machine learning is today’s discontinuity”

–Jerry Yang, CEO, Yahoo

16

slide by David Sontag

slide-17
SLIDE 17

Google Trends

17

Machine learning Deep learning

slide-18
SLIDE 18

2015 Edition

slide-19
SLIDE 19

2016 Edition

slide-20
SLIDE 20

Learning

20 Richard Feynman

slide by Bernhard Schölkopf

slide-21
SLIDE 21

Two definitions of learning

(1) Learning is the acquisition of knowledge 
 about the world. 
 Kupfermann (1985)
 (2) Learning is an adaptive change in behavior 
 caused by experience. 
 Shepherd

(1988)

21

slide by Bernhard Schölkopf

slide-22
SLIDE 22

Bernhard Schölkopf

8

  • !

x x x x x

x y

x x x

Leibniz, Weyl, Chaitin

x

y = a * x y = Σi ai k(x,xi) + b

Empirical Inference

  • Drawing conclusions from empirical data

(observations, measurements)

  • Example1: Scientific inference

22

slide by Bernhard Schölkopf

slide-23
SLIDE 23

Empirical Inference

  • Example2: Perception

23

slide by Bernhard Schölkopf

slide-24
SLIDE 24

slide by Bernhard Schölkopf

slide-25
SLIDE 25

slide by Bernhard Schölkopf

slide-26
SLIDE 26

slide by Bernhard Schölkopf

slide-27
SLIDE 27

slide by Bernhard Schölkopf

slide-28
SLIDE 28

slide by Bernhard Schölkopf

slide-29
SLIDE 29

slide by Bernhard Schölkopf

slide-30
SLIDE 30

slide by Bernhard Schölkopf

slide-31
SLIDE 31

slide by Bernhard Schölkopf

slide-32
SLIDE 32

slide by Bernhard Schölkopf

slide-33
SLIDE 33

slide by Bernhard Schölkopf

slide-34
SLIDE 34

slide by Bernhard Schölkopf

slide-35
SLIDE 35

slide by Bernhard Schölkopf

slide-36
SLIDE 36

slide by Bernhard Schölkopf

slide-37
SLIDE 37

slide by Bernhard Schölkopf

slide-38
SLIDE 38

slide by Bernhard Schölkopf

slide-39
SLIDE 39

slide by Bernhard Schölkopf

slide-40
SLIDE 40

slide by Bernhard Schölkopf

slide-41
SLIDE 41

slide by Bernhard Schölkopf

slide-42
SLIDE 42

slide by Bernhard Schölkopf

slide-43
SLIDE 43

slide by Bernhard Schölkopf

slide-44
SLIDE 44

Empirical Inference

  • Example2: Perception

44

"The brain is nothing but a sta0s0cal decision organ" 


  • H. Barlow

slide by Bernhard Schölkopf

slide-45
SLIDE 45

X

slide by Bernhard Schölkopf

slide-46
SLIDE 46

X

slide by Bernhard Schölkopf

slide-47
SLIDE 47
slide-48
SLIDE 48

slide by Bernhard Schölkopf

slide-49
SLIDE 49

What is machine learning?

slide-50
SLIDE 50

Example: Netflix Challenge

  • Goal: Predict how a viewer will rate a movie
  • 10% improvement = 1 million dollars

50

slide by Yaser Abu-Mostapha

slide-51
SLIDE 51

Example: Netflix Challenge

  • Goal: Predict how a viewer will rate a movie
  • 10% improvement = 1 million dollars
  • Essence of Machine Learning:
  • A pattern exists
  • We cannot pin it down mathematically
  • We have data on it

51

slide by Yaser Abu-Mostapha

slide-52
SLIDE 52

Watch out AlphaGo vs. Lee Sedol in March!

52

slide-53
SLIDE 53

Comparison

  • Traditional Programming
  • Machine Learning

53

Computer Data Program Output Computer Data Output Program

slide by Pedro Domingos, Tom Mitchel, Tom Dietterich

slide-54
SLIDE 54

What is Machine Learning?

  • [Arthur Samuel, 1959]
  • Field of study that gives computers
  • the ability to learn without being explicitly programmed
  • [Kevin Murphy] algorithms that
  • automatically detect patterns in data
  • use the uncovered patterns to predict future data or
  • ther outcomes of interest
  • [Tom Mitchell] algorithms that
  • improve their performance (P)
  • at some task (T)
  • with experience (E)

54

slide by Dhruv Batra

slide-55
SLIDE 55

What is Machine Learning?

  • If you are a Scientist
  • If you are an Engineer / Entrepreneur
  • Get lots of data
  • Machine Learning
  • ???
  • Profit!

55

Data Understanding Machine 
 Learning

slide by Dhruv Batra

slide-56
SLIDE 56

Why Study Machine Learning?


Engineering Better Computing Systems

  • Develop systems
  • too difficult/expensive to construct manually
  • because they require specific detailed skills/knowledge
  • knowledge engineering bottleneck
  • Develop systems
  • that adapt and customize themselves to individual users.
  • Personalized news or mail filter
  • Personalized tutoring
  • Discover new knowledge from large databases
  • Medical text mining (e.g. migraines to calcium channel

blockers to magnesium)

  • data mining

56

slide by Dhruv Batra

slide-57
SLIDE 57

Why Study Machine Learning?


Cognitive Science

  • Computational studies of learning may help

us understand learning in humans

  • and other biological organisms.
  • Hebbian neural learning
  • “Neurons that fire together, wire together.”

57

slide by Dhruv Batra

slide-58
SLIDE 58

Why Study Machine Learning?


The Time is Ripe

  • Algorithms
  • Many basic effective and efficient algorithms

available.

  • Data
  • Large amounts of on-line data available.
  • Computing
  • Large amounts of computational resources

available.

58

slide by Ray Mooney

slide-59
SLIDE 59

Where does ML fit in?

59

slide by Fei Sha

slide-60
SLIDE 60

A Brief History of AI

60

slide by Dhruv Batra

slide-61
SLIDE 61

61

adopted from Dhruv Batra

slide-62
SLIDE 62

AI Predictions: Experts

62

Image Credit: http://intelligence.org/files/PredictingAI.pdf slide by Dhruv Batra

slide-63
SLIDE 63

AI Predictions: Non-Experts

63

Image Credit: http://intelligence.org/files/PredictingAI.pdf slide by Dhruv Batra

slide-64
SLIDE 64

AI Predictions: Failed

64

Image Credit: http://intelligence.org/files/PredictingAI.pdf slide by Dhruv Batra

slide-65
SLIDE 65

Why is AI hard?

65

Image Credit: http://karpathy.github.io/2012/10/22/state-of-computer-vision/

slide by Dhruv Batra

slide-66
SLIDE 66

What humans see

66

slide by Larry Zitnick

slide-67
SLIDE 67

What computers see

67

243

239 240 225 206 185 188 218 211 206 216 225 242 239 218 110 67 31 34 152 213 206 208 221 243 242 123 58 94 82 132 77 108 208 208 215 235 217 115 212 243 236 247 139 91 209 208 211 233 208 131 222 219 226 196 114 74 208 213 214 232 217 131 116 77 150 69 56 52 201 228 223 232 232 182 186 184 179 159 123 93 232 235 235 232 236 201 154 216 133 129 81 175 252 241 240 235 238 230 128 172 138 65 63 234 249 241 245 237 236 247 143 59 78 10 94 255 248 247 251 234 237 245 193 55 33 115 144 213 255 253 251 248 245 161 128 149 109 138 65 47 156 239 255 190 107 39 102 94 73 114 58 17 7 51 137 23 32 33 148 168 203 179 43 27 17 12 8 17 26 12 160 255 255 109 22 26 19 35 24

slide by Larry Zitnick

slide-68
SLIDE 68

“I saw her duck”

68

Image Credit: Liang Huang

slide by Liang Huang

slide-69
SLIDE 69

“I saw her duck”

69

Image Credit: Liang Huang

slide by Liang Huang

slide-70
SLIDE 70

“I saw her duck”

70

Image Credit: Liang Huang

slide by Liang Huang

slide-71
SLIDE 71

We’ve come a long way… IBM Watson

  • What is Jeopardy?
  • http://youtu.be/Xqb66bdsQlw?t=53s
  • Challenge:
  • http://youtu.be/_429UIzN1JM
  • Watson Demo:
  • http://youtu.be/WFR3lOm_xhE?t=22s
  • Explanation
  • http://youtu.be/d_yXV22O6n4?t=4s
  • Future: Automated operator, doctor assistant,

finance

71

  • IBM Watson wins on Jeopardy (February 2011)
  • Watson provides cancer treatment options to

doctors in seconds (February 2013)

slide by Liang Huang

slide-72
SLIDE 72

Why are things working today?

  • More compute

power

  • More data
  • Better algorithms/

models

72

Figure Credit: Banko & Brill, 2011

Accuracy

Better

Amount of Training Data

slide by Dhruv Batra