comp 135 introduction to machine learning
play

COMP 135 Introduction to Machine Learning First day of class - PowerPoint PPT Presentation

COMP 135 Introduction to Machine Learning First day of class Spring 2019 https://www.cs.tufts.edu/comp/135/2019s/ Many slides attributable to: Emily Fox (UW), Finale Doshi-Velez (Harvard), Erik Sudderth (UCI), & Liping Liu (Tufts) 1


  1. COMP 135 Introduction to Machine Learning First day of class Spring 2019 https://www.cs.tufts.edu/comp/135/2019s/ Many slides attributable to: Emily Fox (UW), Finale Doshi-Velez (Harvard), Erik Sudderth (UCI), & Liping Liu (Tufts) 1

  2. Why Machine Learning? Mike Hughes - Tufts COMP 135 - Spring 2019 2

  3. Image Credit: Emily Fox Mike Hughes - Tufts COMP 135 - Spring 2019 3

  4. Artificial Intelligence (AI) • Study of “intelligent systems”, with many parts: logic, planning, search, probabilistic reasoning, learning from experience, interacting with other agents, etc. Mike Hughes - Tufts COMP 135 - Spring 2019 4

  5. Machine Learning (ML) • Study of algorithms that learn from experience/data to perform a task • Task output: a prediction or a decision Mike Hughes - Tufts COMP 135 - Spring 2019 5

  6. The Machine Learning Process Past data Performance Task / Goal (or “experience”) measure input data prediction / decision (now) (now) Mike Hughes - Tufts COMP 135 - Spring 2019 6

  7. Why take this course? Mike Hughes - Tufts COMP 135 - Spring 2019 7

  8. Why take this course? Our goal is to prepare students to effectively apply machine learning methods to problems that might arise in "the real world" -- in industry, medicine, education, and beyond. Gain skills and understanding for a future as: • Developer using ML “out-of-the-box” • ML methods researcher Mike Hughes - Tufts COMP 135 - Spring 2019 8

  9. Why take this course? Students will be able to: - Think systematically - Compare/contrast each method’s strengths & limitations - Deploy rapidly - Hands-on experience with open-source libraries - Evaluate carefully - Design experiments with task-appropriate performance metrics and strong baselines - Report uncertainty in performance Mike Hughes - Tufts COMP 135 - Spring 2019 9

  10. What skills will we learn? Mike Hughes - Tufts COMP 135 - Spring 2019 10

  11. What will we learn? Supervised Learning Data, Label Pairs Performance { x n , y n } N measure Task n =1 Unsupervised Learning data label x y Reinforcement Learning Mike Hughes - Tufts COMP 135 - Spring 2019 11

  12. Task: Regression y is a continuous variable Supervised e.g. sales in $$ Learning regression y Unsupervised Learning Reinforcement x Learning Mike Hughes - Tufts COMP 135 - Spring 2019 12

  13. Regression Example: Uber Supervised Learning regression Unsupervised Learning Reinforcement Learning Mike Hughes - Tufts COMP 135 - Spring 2019 13

  14. Regression Example: Uber Mike Hughes - Tufts COMP 135 - Spring 2019 14

  15. Regression Example: Uber Mike Hughes - Tufts COMP 135 - Spring 2019 15

  16. Task: Classification y is a discrete variable Supervised (red or blue) Learning classification x 2 Unsupervised Learning Reinforcement Learning x 1 Mike Hughes - Tufts COMP 135 - Spring 2019 16

  17. Classification Example: Swype Mike Hughes - Tufts COMP 135 - Spring 2019 17

  18. What will we learn? Supervised Learning Data, Label Pairs Performance { x n , y n } N measure Task n =1 Unsupervised Learning data label x y Reinforcement Learning Mike Hughes - Tufts COMP 135 - Spring 2019 18

  19. What will we learn? Supervised Learning Data Examples Performance { x n } N measure Task n =1 Unsupervised Learning summary data of x x Reinforcement Learning Mike Hughes - Tufts COMP 135 - Spring 2019 19

  20. Task: Clustering Supervised Learning x 2 Unsupervised Learning clustering x 1 Reinforcement Learning Mike Hughes - Tufts COMP 135 - Spring 2019 20

  21. Clustering Example: News Mike Hughes - Tufts COMP 135 - Spring 2019 21

  22. Task: Embedding Supervised Learning x 2 Unsupervised Learning embedding Reinforcement x 1 Learning Mike Hughes - Tufts COMP 135 - Spring 2019 22

  23. Example: Genes vs. geography Mike Hughes - Tufts COMP 135 - Spring 2019 23

  24. Task: Recommendation Supervised Learning recommendation Unsupervised Learning Reinforcement Learning Mike Hughes - Tufts COMP 135 - Spring 2019 24

  25. Recommendation Example Mike Hughes - Tufts COMP 135 - Spring 2019 25

  26. What will we learn? Histories of states, Supervised actions, rewards Learning Performance { s t , a t , r t , s t +1 , . . . } N n =1 measure Task Unsupervised Learning Recent Next history action a Reinforcement Learning Mike Hughes - Tufts COMP 135 - Spring 2019 26

  27. RL example: Pancake robot Mike Hughes - Tufts COMP 135 - Spring 2019 27

  28. What will we learn this semester? If I want more? Supervised ~ 10 weeks Take COMP 136 Learning 6 homeworks 3 projects Take COMP 150 – Deep Learning Unsupervised ~ 2 weeks Take COMP 136 1.5 homeworks Learning Reinforcement ~ 1 week Take COMP 150 – RL (Prof. Jivko Sinapov) 0.5 homeworks Learning Mike Hughes - Tufts COMP 135 - Spring 2019 28

  29. What we won’t cover • Active learning • Transfer learning • Semi-supervised learning • Learning theory • ….. lots more Mike Hughes - Tufts COMP 135 - Spring 2019 29

  30. Who is teaching? • Instructor: Prof. Mike Hughes • TA Staff • Mike Pietras • Rui Chen • Minh Nguyen • Duc Nguyen • Wayne Tang Mike Hughes - Tufts COMP 135 - Spring 2019 30

  31. Problem: When will ICU patient need intervention ? Ghassemi, Wu, Hughes , et al. AMIA CRI 2017 Interventions: mechanical ventilation • blood pressure drugs • Early prediction helps: prepare patient plan staffing try less aggressive options early

  32. Cohort from MIMIC-III dataset mimic.physionet.org 36,050 patients (Johnson et al. Sci. Data 2016) • from Beth-Israel Deaconess in Boston 2001-2012 • kept all adults with record within 6-360 hours

  33. Observed data 7 nurse-validated vital signs (hourly) heart rate, blood pressure, temp., SpO2, … 11 lab measurements (much less than hourly) hematocrit, lactate, … each channel standardized to mean=0, var=1 with carry-and-hold for missing data

  34. Task: predict need in advance +2 hrs ahead Mike Hughes - Tufts COMP 135 - Spring 2019 34

  35. Vasopressor prediction : 1 hr ahead Area-under-ROC static demographics (age, race, etc) “embedding” using time-series model dynamic patient vitals at time t Mike Hughes - Tufts COMP 135 - Spring 2019 35

  36. Key Stakeholder Questions • How should we fill in missing values? • How to deal with imbalance data? • most patients never get drug X • How to deal with imbalanced mistake costs? • remove ventilator too early ends life, too late costs $$ • How uncertain are predictions? • How will this generalize to a new hospital? We can start to answer many of these in COMP 135 Mike Hughes - Tufts COMP 135 - Spring 2019 36

  37. Logistics • Course website (take a tour) • https://www.cs.tufts.edu/comp/135/2019s/ • Discussions on Piazza • Lectures every Mon & Wed • Recitation Sessions with TAs every Mon • Deliverables: 2 exams, 8 homeworks, 3 projects • Collaboration Policy Mike Hughes - Tufts COMP 135 - Spring 2019 37

  38. Let’s Get Started! • Setup your Python environment ASAP • First recitation is Mon 1/21 • Get help with your Python env. • Learn basics of arrays, plotting, etc • HW0 due NEXT WEEK (Wed 1/23 at 11:59pm) Mike Hughes - Tufts COMP 135 - Spring 2019 38

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend