COMP 364: Computer Tools for Life Sciences
Intro to machine learning with scikit-learn Christopher J.F. Cameron and Carlos G. Oliver
1 / 1
COMP 364: Computer Tools for Life Sciences Intro to machine learning - - PowerPoint PPT Presentation
COMP 364: Computer Tools for Life Sciences Intro to machine learning with scikit-learn Christopher J.F. Cameron and Carlos G. Oliver 1 / 1 Key course information Assignment #4 available now due Monday, November 27th at 11:59:59 pm
1 / 1
◮ remaining concepts will be taught today and on Monday
◮ https://horizon.mcgill.ca/pban1/twbkwbis.P_
2 / 1
3 / 1
1
2 3
4
5
6
7
8
9
10
11
12
4 / 1
5 / 1
6 / 1
1
2 3
4
5
6
7
8
7 / 1
1
2
3
4
5
◮ i.e., the passenger would need to already be dead to have a
8 / 1
1
2
3
4
5
6
7
9 / 1
1
2
3
4
5
6
7
8
9
10
11
10 / 1
◮ ‘women and children first’
◮ were on average younger than male passengers ◮ paid more for their tickets ◮ were more likely to travel with families
11 / 1
◮ some passengers do not have a complete set of features ◮ ML algorithms have difficulty with missing data
◮ computers have an easier time interpreting numbers
◮ why would we want to limit the amount of features? ◮ overfitting 12 / 1
◮ increased error for testing data during evaluation
13 / 1
◮ rather than the intended labels
14 / 1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
15 / 1
1
2
3
4
5
1
16 / 1
1
2
3
4
5
6
7
8
9
10
11
12
13
17 / 1
18 / 1
1
2 3
4
5
6
7
8
9
10
11
12
13
19 / 1
20 / 1
1
2
21 / 1
22 / 1
1
2 3
4
5
6
7
23 / 1
24 / 1
◮ support vector machines (SVM)
◮ making predictions using a learned model
◮ true/false positive (TP/FP) rates ◮ error measures ◮ receiver operating characteristic (ROC) curves? 25 / 1