Aykut Erdem
October 2016 Hacettepe University
Lecture 1:
−Course outline and logistics −What is Machine Learning
Lecture 1: Course outline and logistics What is Machine Learning - - PowerPoint PPT Presentation
Lecture 1: Course outline and logistics What is Machine Learning Aykut Erdem October 2016 Hacettepe University Todays Schedule Course outline and logistics An overview of Machine Learning 2 Course outline and logistics
Aykut Erdem
October 2016 Hacettepe University
−Course outline and logistics −What is Machine Learning
2
Aykut ERDEM (aykut@cs.hacettepe.edu.tr)
Aysun Kocak (aysunkocak@cs.hacettepe.edu.tr)
Burcak Asal (basal@cs.hacettepe.edu.tr)
Thu 11:00 - 11:50_D8
4
machine learning (ML)
⎯ A broad overview of many concepts and algorithms in ML.
⎯ Basic algorithms, data structures. ⎯ Basic probability and statistics. ⎯ Basic linear algebra and calculus ⎯ Good programming skills
Practicum
⎯ Students will gain skills to apply the concepts to real
world problems.
5
vector/matrix manipulations, partial derivatives common distributions, Bayes rule, mean/median/model
throughout the semester with lecture notes, programming and reading assignments and important deadlines. http://web.cs.hacettepe.edu.tr/~aykut/classes/ fall2016/bbm406/
and announcements. Please enroll the class on Piazza by following the link http://piazza.com/class#fall2016/bbm406
6
and Norvig. Prentice Hall, 2009
University Press, 2012. (online version available)
Press , 2010
2006
Press, 2012
7
⎯ a course project (done in pairs) (25%),
⎯ a midterm exam (30%), ⎯ a final exam (40%), and ⎯ class participation (5%)
⎯ a set of quizzes (20%), and
⎯ 3 assignments (done individually)
8
each
given real-world problem
9
10
solve the problem
Progress Report: 4-5 pages (Dec 12) (25%) Poster Presentation: (last week of classes) (20%) Final Report: (due at the beginning of poster session) (45%)
11
course project, however, can be done in pairs.
the given assignments, but these discussions should be carried out in an abstract way.
part, as your own will be considered as a violation of academic integrity.
material found on the web as everything on the web has been written by someone else.
12
http://www.plagiarism.org/plagiarism-101/prevention/
Overview of Machine Learning, Nearest Neighbor Classifier
Linear Regression, Least Squares
Machine Learning Methodology
Statistical Estimation: MLE, MAP , Naïve Bayes Classifier
Linear Classification Models: Logistic Regression, Linear Discriminant Functions, Perceptron
Neural Networks
Midterm Exam
13
Assg1 out Assg1 due, Assg2 out Course project proposal due Assg2 due Assg3 out
Deep Learning
Support Vector Machines (SVMs)
Multi-class SVM
Decision Tree Learning
Ensemble Methods: Bagging, Random Forests, Boosting
Clustering
Principle Component Analysis, Autoencoders
14
Project progress report due Assg3 due
would you start studying heavily?” –Answer: Machine Learning. –“The ultimate is computers that learn” –Bill Gates, Reddit AMA
–Tony Tether, Director, DARPA
–Jerry Yang, CEO, Yahoo
16
slide by David Sontag
17
Machine learning Deep learning
2015 Edition
2016 Edition
20
slide by Bernhard Schölkopf
(observations, measurements)
21
slide by Bernhard Schölkopf
(observations, measurements)
22
slide by Bernhard Schölkopf
y
× × × × ×
x
(observations, measurements)
23
slide by Bernhard Schölkopf
y
× × × × ×
y = a * x x
(observations, measurements)
24
slide by Bernhard Schölkopf
x y
× × × × ×
y = a * x
Leibniz, Weyl, Chaitin
× × × ×
(observations, measurements)
25
slide by Bernhard Schölkopf
x y
× × × × ×
y = ∑i ai k(x, xi)+b
Leibniz, Weyl, Chaitin
× × × ×
26
slide by Bernhard Schölkopf
slide by Bernhard Schölkopf
slide by Bernhard Schölkopf
slide by Bernhard Schölkopf
slide by Bernhard Schölkopf
slide by Bernhard Schölkopf
slide by Bernhard Schölkopf
slide by Bernhard Schölkopf
slide by Bernhard Schölkopf
slide by Bernhard Schölkopf
slide by Bernhard Schölkopf
slide by Bernhard Schölkopf
slide by Bernhard Schölkopf
slide by Bernhard Schölkopf
slide by Bernhard Schölkopf
slide by Bernhard Schölkopf
slide by Bernhard Schölkopf
slide by Bernhard Schölkopf
slide by Bernhard Schölkopf
slide by Bernhard Schölkopf
slide by Bernhard Schölkopf
47
"The brain is nothing but a statistical decision organ"
slide by Bernhard Schölkopf
48
X
slide by Bernhard Schölkopf
X
slide by Bernhard Schölkopf
slide by Bernhard Schölkopf
reflected light = illumination * reflectance
53
slide by Bernhard Schölkopf
54
slide by Bernhard Schölkopf
— consider many factors simultaneously to find regularity — nonlinear; nonstationary, etc. — e.g. no mechanistic models for the data — processing requires computers and automatic inference methods
56
slide by Yaser Abu-Mostapha
57
slide by Yaser Abu-Mostapha
58
59
60
Computer Data Program Output Computer Data Output Program
slide by Pedro Domingos, Tom Mitchel, Tom Dietterich
61
slide by Dhruv Batra
62
Data Understanding Machine Learning
slide by Dhruv Batra
Engineering Better Computing Systems
blockers to magnesium)
63
slide by Dhruv Batra
Cognitive Science
64
slide by Dhruv Batra
The Time is Ripe
available.
available.
65
slide by Ray Mooney
66
slide by Fei Sha
67
slide by Dhruv Batra
68
adopted from Dhruv Batra
69
Image Credit: http://intelligence.org/files/PredictingAI.pdf slide by Dhruv Batra
70
Image Credit: http://intelligence.org/files/PredictingAI.pdf slide by Dhruv Batra
71
Image Credit: http://intelligence.org/files/PredictingAI.pdf slide by Dhruv Batra
72
Image Credit: http://karpathy.github.io/2012/10/22/state-of-computer-vision/
slide by Dhruv Batra
73
slide by Larry Zitnick
74
243 239 240 225 206 185 188 218 211 206 216 225 242 239 218 110 67 31 34 152 213 206 208 221 243 242 123 58 94 82 132 77 108 208 208 215 235 217 115 212 243 236 247 139 91 209 208 211 233 208 131 222 219 226 196 114 74 208 213 214 232 217 131 116 77 150 69 56 52 201 228 223 232 232 182 186 184 179 159 123 93 232 235 235 232 236 201 154 216 133 129 81 175 252 241 240 235 238 230 128 172 138 65 63 234 249 241 245 237 236 247 143 59 78 10 94 255 248 247 251 234 237 245 193 55 33 115 144 213 255 253 251 248 245 161 128 149 109 138 65 47 156 239 255 190 107 39 102 94 73 114 58 17 7 51 137 23 32 33 148 168 203 179 43 27 17 12 8 17 26 12 160 255 255 109 22 26 19 35 24slide by Larry Zitnick
75
Image Credit: Liang Huang
slide by Liang Huang
76
Image Credit: Liang Huang
slide by Liang Huang
77
Image Credit: Liang Huang
slide by Liang Huang
finance
78
doctors in seconds (February 2013)
slide by Liang Huang
power
models
79
Figure Credit: Banko & Brill, 2011
Accuracy
Better
Amount of Training Data
slide by Dhruv Batra
81
slide by Alex Smola
82
Amazon books Don’t mix preferences on Netflix!
slide by Alex Smola
83
Avatar learns from your behavior
Black & White Lionsgate Studios
slide by Alex Smola
84
https://www.youtube.com/watch?v=lleRKHsJBJ0
slide by Alex Smola
85
ham spam
slide by Alex Smola
86
segment image recognize handwriting
slide by Alex Smola
87
slide by Alex Smola
88
why these ads?
slide by Alex Smola
89
slide by Alex Smola
90
Given an audio waveform, robustly extract & recognize any spoken words
91
I need to hide a body noun, verb, preposition, …
92
Sudhakar et al., Multi-view Face Detection Using Deep Convolutional Neural Networks, 2015
93
Yang et al., From Facial Parts Responses to Face Detection: A Deep Learning Approach, ICCV 2015
94
slide by Eric Sudderth
95
trees skyscraper sky bell dome temple buildings sky
slide by Eric Sudderth
96
data
Learning
knowledge prior knowledge
slide by Stuart Russell
97
data
Learning
knowledge prior knowledge
slide by Stuart Russell
IF x THEN DO y
98
slide by Mehryar Mohri
99
general learning algorithms to
– deal with large-scale problems. – make accurate predictions (unseen examples). – handle a variety of different learning problems.
– what can be learned? Under what conditions? – what learning guarantees can be given? – what is the algorithmic complexity?
slide by Mehryar Mohri
represented as a vector, associated to an example (e.g., height and weight for gender prediction).
an object (e.g., positive or negative in binary classification); in regression real value.
algorithm (often labeled data).
100
slide by Mehryar Mohri
101
slide by Mehryar Mohri
slide by Alex Smola
Given x find y in {-1, 1}
Given x find y in {1, ... k}
Given x find y in R (or R
d
)
Given sequence x1 ... xl find y1 ... yl
Given x find a point in the hierarchy of y (e.g. a tree)
Given xt and yt-1 ... y1 find yt
103
l(y, f(x))
slide by Alex Smola
104
slide by Alex Smola
105
slide by Alex Smola
106
linear nonlinear
slide by Alex Smola
107
given sequence gene finding speech recognition activity segmentation named entities
slide by Alex Smola
108
webpages genes
slide by Alex Smola
109
tomorrow’s stock price
slide by Alex Smola
slide by Alex Smola
Find a set of prototypes representing the data
Find a subspace representing the data
Find a latent causal sequence for observations
Find (small) set of factors for observation
Find the odd one out
111
slide by Alex Smola
112
slide by Alex Smola
113
Variance component model to account for sample structure in genome-wide association studies, Nature Genetics 2010
slide by Alex Smola
114
Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project, Nature 2007
slide by Alex Smola
115
slide by Alex Smola
116
find them automatically
slide by Alex Smola
117
typical atypical
slide by Alex Smola
118