Machine Learning
George Konidaris gdk@cs.duke.edu
Machine Learning George Konidaris gdk@cs.duke.edu Spring 2016 - - PowerPoint PPT Presentation
Machine Learning George Konidaris gdk@cs.duke.edu Spring 2016 Machine Learning Subfield of AI concerned with learning from data . Broadly, using: Experience To Improve Performance On Some Task (Tom Mitchell, 1997)
George Konidaris gdk@cs.duke.edu
Subfield of AI concerned with learning from data.
vs
vs
Developing effective learning methods has proved difficult. Why bother?
Adaptive behavior
circumstances.
Depends on feedback available:
Input: X = {x1, …, xn} Y = {y1, …, yn}
Given x: y?
Input: X = {x1, …, xn}
structure of the data.
How can they vary?
Learning counterpart of planning.
π
R =
∞
γtrt π : S → A
Formal definition:
X = {x1, …, xn} Y = {y1, …, yn}
Decision function
f : X → Y X
i
err(f(xi), yi)
If the set of labels Y is discrete:
Y is real-valued:
Class of functions F, from which to find f.
if condition then class1 else class2
Minimize error measured on what?
Do not measure error on the data you train on!
Let’s assume:
How to make one?
X = {x1, …, xn} Y = {y1, …, yn}
take a max.
A B C L T F T 1 T T F 1 T F F 1 F T F 2 F T T 2 F T F 2 F F T 1 F F F 1
A B C L T F T 1 T T F 1 T F F 1 F T F 2 F T T 2 F T F 2 F F T 1 F F F 1
A B C L T F T 1 T T F 1 T F F 1 F T F 2 F T T 2 F T F 2 F F T 1 F F F 1
A B C L T F T 1 T T F 1 T F F 1 F T F 2 F T T 2 F T F 2 F F T 1 F F F 1
A B C L T F T 1 T T F 1 T F F 1 F T F 2 F T T 2 F T F 2 F F T 1 F F F 1
Key question:
label in a dataset?
I(A) = −f1 log2 f1 − f2 log2 f2 Gain(B) = I(A) − X
i
fiI(Bi)
What if the inputs are real-valued?
a > 3.1
b < 0.6?
What is the hypothesis class for a decision tree?