can i add this class
play

Can I add this class? Lulu Liang (ll882) is handling the waiting - PowerPoint PPT Presentation

Can I add this class? Lulu Liang (ll882) is handling the waiting list. We expect all majors and minors to be able to enroll. INFO 2950: Intro to Data Science Prof. David Mimno Thank you for your interest, but... This class is required for


  1. Can I add this class? Lulu Liang (ll882) is handling the waiting list. We expect all majors and minors to be able to enroll.

  2. INFO 2950: Intro to Data Science Prof. David Mimno

  3. Thank you for your interest, but... This class is required for InfoSci majors and minors. If you do not need it, please consider other options.

  4. Where to fjnd things ● Course website: http://mimno.infosci.cornell.edu/info2950 ● Question answering: https://campuswire.com/c/G7E579AA4 (code 3402) ● Assignments: CMS (enrollment will sync every 24 hrs)

  5. Textbooks VanderPlas, Python Data Science Handbook James, Witten, Hastie, Tibshirani, An introduction to statistical learning Both are free, links from course website

  6. The wheat is stored... The information is stored... The data is stored...

  7. Statistics (20th century version) Experiments are designed Computation is hard Data is expensive Goal is causation Wikipedia, Fisher; Gosset

  8. Data Science (21st century) Observations are gathered opportunistically Computation is cheap Data is abundant Goal is prediction linksys.com

  9. Drew Conway's Venn diagram http://drewconway.com/zia/2013/3/26/the-data-science-venn-diagram

  10. Data science pattern 1. Map real-world entities to a computational representation 2. Perform mathematical operations on those representations 3. Interpret results of those operations

  11. Data science pattern 1. Map real-world entities to a computational representation 2. Perform mathematical operations on those representations 3. Interpret results of those operations 4. [go to step 1]

  12. Math questions What representations are good for supporting mathematical operations? How can we create accurate mathematical models of real-world events? How can we convince ourselves and others that this isn't just randomness?

  13. The math is the easy part ● Is the data reliable and complete? ● Are we answering the right question? ● How can we balance between what is useful and what is easily available? ● Will anyone believe that we have the right answer? Should they? Wikipedia "Town hall meeting"

  14. Live experiment! Find a study group https://forms.gle/NCZ6CSMB6qiiasfUA

  15. Where to fjnd things ● Course website: http://mimno.infosci.cornell.edu/info2950 ● Question answering: https://campuswire.com/c/G7E579AA4 (code 3402) ● Assignments: CMS (enrollment will sync every 24 hrs)

  16. Weekly pattern Monday Tuesday Wednesday Thursday Friday Mimno offjce Presentation Presentation Lab sessions: hours, of new of new practice and 1:30-3:30 material material; discuss Gates 205 Homework due 11:59pm

  17. For Friday: Install Python 3 ● Anaconda is the easiest, most reliable installation: https://anaconda.com/download ● NO PYTHON 2. To check: type print "hello" with no ○ (parentheses). You should get an error. We will work in notebooks, scripts, and the command line ( >>> )

  18. RIP Python 2 Wikipedia, "Headstone"

  19. How to do well in this class Show up Don't just read, test yourself Start early Snacks! Healthy sleep

  20. Can I add this class? Lulu Liang (ll882) is handling the waiting list. We expect all majors and minors to be able to enroll.

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend