MACHINE LEARNING
Liviu Ciortuz Department of CS, University of Ia¸ si, Romˆ ania
0.
MACHINE LEARNING Liviu Ciortuz Department of CS, University of Ia - - PowerPoint PPT Presentation
0. MACHINE LEARNING Liviu Ciortuz Department of CS, University of Ia si, Rom ania 1. What is Machine Learning? ML studies algorithms that improve with experience. learn from Tom Mitchells Definition of the [
0.
Tom Mitchell’s Definition of the [general] learning problem: “A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance
1.
2.
3.
Intelligence Artificial (concept learning) Algorithms Mathematics Statistics (model fitting) Machine Learning Learning Statistical Pattern Recognition Mining Data Engineering Database Systems (Knowledge Discovery in Databases)
4.
utze, ch. 2)
[and the relationship with Logistic Regression]
utze, ch. 14)
5.
Tentative Plan
utze, ch. 2)
(see Estimating Probabilities, additional chapter to T. Mitchell’s book, 2016)
utze, ch. 9)
6.
¸ii de ˆ ınv˘ at ¸are automat˘ a”
ad˘ ar˘ au. Ia¸ si, Romania, 2020 www.info.uaic.ro/∼ciortuz/ML.ex-book/book.pdf
Tom Mitchell. McGraw-Hill, 1997
Trevor Hastie, Robert Tibshirani, Jerome Friedman. Springer, 2nd ed. 2009
Kevin Murphy, MIT Press, 2012
Christopher Bishop. Springer, 2006
Christopher Manning, Hinrich Sch¨
7.
“We are drawning in information but starved for knowledge.”
John Naisbitt, “Megatrends” book, 1982
8.
concept c ⊆ X, or c : X → {0, 1} example (labeled instance): x, c(x); positive examples, neg. examples
hypotheses representation language hypotheses set H hypotheses consistent with the concept c: h(x) = c(x), ∀ example x, c(x) version space
supervised learning (classification), unsupervised learning (clustering)
training error, test error accuracy, precision, recall
n-fold cross-validation, leave-one-out cross-validation
9.
10.
Consider
after training on data Dc.
The inductive bias of L is any minimal set of assertions B such that (∀xi ∈ X)[(B ∨ Dc ∨ xi) ⊢ L(xi, Dc)] for any target concept c and corresponding training examples Dc. (A ⊢ B means A logically entails B)
11.
Inductive systems can be modelled by equivalent deductive systems
12.
tp − true positives fp − false positives tn − true negatives fn − false negatives accuracy: Acc = tp + tn tp + tn + fp + fn precision: P = tp tp + fp recall (or: sensitivity): R = tp tp + fn F-measure: F = 2 P × R P+R specificity: Sp = tn tn + fp follout: = fp tn + fp Mathew’s Correlation Coefficient: MCC = tp × tn − fp × fn
13.
Eager: generalize before seeing query
works, . . .
Lazy: wait for query before generalizing
soning
Does it matter? If they use the same hypothesis space H, lazy learners can represent more complex functions. E.g., a lazy Backpropagation algorithm can learn a NN which is dif- ferent for each query point, compared to the eager version of Back- propagation.
14.
si, Romania, 1985 PhD in CS from Universit´ e de Lille, France, 1996
Bac˘ au, Romania (1985-1987)
Germany (DFKI, Saarbr¨ ucken, 1997-2001), UK (Univ. of York and Univ. of Aberystwyth, 2001-2003), France (INRIA, Rennes, 2012-2013)
15.
Reinhard Wilhelm, quoted by Cristian Calude, in The Human Face of Computing, Imperial College Press, 2016
16.
17.
Cedric Villani, winner of the Fields prize, 2010
xxx
18.
19.
https://profs.info.uaic.ro/∼ancai/ML/
alinescu (. . . Probabilities and Statistics) https://profs.info.uaic.ro/∼adrian.zalinescu/ML.html
www.seminarul.ml
¸tefan Pant ¸iru (MSc)
¸tefan Matcovici (MSc)
20.
Minim: 2p Minim: 2p Minim: 1.25p Minim: 1.25p Prezenta la seminar: obligatorie! Penalizare: 0.2p pentru fiecare absenta de la a doua incolo! Pentru promovare: Nota >= 4.5 T2 S2 S1 T1
Nota = (8 + S1 + S2 + T1 + T2) / 4 <=> S1 + S2 + T1 + T2 >= 10 Test: 6p Test: 6p Seminar: 10p Seminar: 10p Prezenta la curs: recomandata!
21.
Regulile de organizare a cursului de ˆ Inv˘ at ¸are Automat˘ a (engl., Machine Learning, ML),
ın fi¸ sa disciplinei http://profs.info.uaic.ro/∼ciortuz/fisa-disciplinei.RO.pdf
a: vezi slide #6
apt˘ amˆ an˘ a (curs + seminar): http://profs.info.uaic.ro/∼ciortuz/what-you-should-know.pdf
¸a la curs: recomandat˘ a!
¸a la seminar: obligatorie! Pentru fiecare absent ¸˘ a la seminar, ˆ ıncepˆ and de la a doua absent ¸˘ a ˆ ıncolo, se aplic˘ a o penalizare/depunctare de 0.1 puncte din S1, respectiv din S2. (Vezi formula de notare.) Regulile se aplic˘ a inclusiv student ¸ilor reˆ ınmatriculat ¸i.
apt˘ amˆ anal se t ¸inem un seminar suplimentar, destinat pentru acei student ¸i care sunt foarte interesat ¸i de acest domeniu. (Vedet ¸i sect ¸iunile “Advanced issues” din documentul http://profs.info.uaic.ro/∼ciortuz/what-you-should-know.pdf.) 22.
Regula 1: Pentru seminarii, nu se admit mut˘
ari ale student ¸ilor de la o grup˘ a la alta, decˆ at ˆ ın cadrul grupelor care au acela¸ si asistent / profesor responsabil de seminar.
Regula 2: Nu se fac echival˘
ari de punctaje pentru student ¸ii care nu au promovat cursul ˆ ın anii precedent ¸i.
Regula 3: Profesorul responsabil pentru acest curs, Liviu Ciortuz,
NU va r˘ aspunde la email-uri care pun ˆ ıntreb˘ ari pentru care raspunsul a fost deja dat – fie ˆ ın aceste slide-uri, – fie pe site-ul Piazza dedicat acestui curs: https://piazza.com/info.uaic.ro/fall2020/fiiml2020f/home, (care este ment ¸ionat ¸ si pe pagina profesorului: https://profs.info.uaic.ro/∼ciortuz/) – fie la curs.
Recomandare important˘ a (1) La fiecare curs ¸
si seminar, student ¸ii vor avea culegerea de Exercit ¸ii de ˆ ınv˘ at ¸are automat˘ a (de L. Ciortuz et al) — v˘ a recomand˘ am s˘ a imprimat ¸i capitolele Clasificare bayesian˘ a, ˆ Inv˘ at ¸are bazat˘ a pe memorare, Arbori de decizie ¸ si Clusterizare — ¸ si eventual slide-urile pe care le-am indicat ˆ ın slide-ul precedent.
Recomandare important˘ a (2) Consultat
¸i s˘ apt˘ amˆ anal documentul http://profs.info.uaic.ro/∼ciortuz/what-you-should-know.pdf 23.
REGULI generale pentru cursul de ˆ Inv˘ at ¸are automat˘ a (cont.) de la licent ¸˘ a
ın aceast˘ a ordine ¸ si, de preferat, COLOR): http://profs.info.uaic.ro/∼ciortuz/SLIDES/foundations.pdf https://profs.info.uaic.ro/∼ciortuz/ML.ex-book/SLIDES/ML.ex-book.SLIDES.ProbStat.pdf https://profs.info.uaic.ro/∼ciortuz/ML.ex-book/SLIDES/ML.ex-book.SLIDES.DT.pdf https://profs.info.uaic.ro/∼ciortuz/ML.ex-book/SLIDES/ML.ex-book.SLIDES.Bayes.pdf https://profs.info.uaic.ro/∼ciortuz/ML.ex-book/SLIDES/ML.ex-book.SLIDES.IBL.pdf https://profs.info.uaic.ro/∼ciortuz/ML.ex-book/SLIDES/ML.ex-book.SLIDES.Cluster.pdf (Atent ¸ie: acest set de slide-uri poate fi actualizat pe parcursul semestrului!)
http://profs.info.uaic.ro/∼ciortuz/SLIDES/ml0.pdf http://profs.info.uaic.ro/∼ciortuz/SLIDES/ml3.pdf http://profs.info.uaic.ro/∼ciortuz/SLIDES/ml6.pdf http://profs.info.uaic.ro/∼ciortuz/SLIDES/ml8.pdf http://profs.info.uaic.ro/∼ciortuz/SLIDES/cluster.pdf
24.