Computational Structures in Data Science
Lecture #1: Welcome to CS88!
UC Berkeley EECS Lecturer Michael Ball http://cs88.org August 26, 2020
Lecture #1: Welcome to CS88! UC Berkeley EECS Lecturer Michael - - PowerPoint PPT Presentation
Computational Structures in Data Science Lecture #1: Welcome to CS88! UC Berkeley EECS Lecturer Michael Ball August 26, 2020 http://cs88.org In The News For Quick Coronavirus Testing, Israel Turns to Clever Algorithm The New York Times
UC Berkeley EECS Lecturer Michael Ball http://cs88.org August 26, 2020
2
8/26/2020 UCB CS88 Fa20 L1
For Quick Coronavirus Testing, Israel Turns to Clever Algorithm The New York Times https://www.nytimes.com/202 0/08/21/health/fast- coronavirus-testing- israel.html Pooled testing is more efficient, but requires a lot of duplicate testing when positive results are found. This approach splits a sample into multiple pools, which are tested together → Fewer “retests” are done. Based on “error correcting codes”, a subject in computer science!
3
UCB CS88 Fa20 L1
– ball@Berkeley.edu – You’re best off by using Ed! J – 625 Soda Hall / Berkeley.zoom.us / my apartment – http://michaelball.co – I don’t update this much… » It was great procrastination when I was a CS student. – Office hours: tentatively Tuesday early afternoon. – A few minutes after class
– Intro CS Research » Tools, curriculum – Training TAs – Building Educational Software (Gradescope) – Tools for web accessibility
4
8/26/2020 UCB CS88 Fa20 L1
5
8/26/2020 UCB CS88 Fa20 L1
– Maps – Ants vs Some Bees
– Same as cs61a
6
8/26/2020 UCB CS88 Fa20 L1
7
UCB CS88 Fa20 L1
8
UCB CS88 Fa20 L1
9
UCB CS88 Fa20 L1
– Start early! ”Checkpoint” assignments
– Use up to 3 on any assignment – We apply the in the order that’s most beneficial! » i.e. use them on projects if you need! – Can be used for homework, labs, projects, but not project checkpoints.
10
UCB CS88 Fa20 L1
Data Science growing organically everywhere
! ! ! !
Feb!15,!2013!
AMP!Lab!
Ion!Stoica,!CS! Michael!Franklin,!CS! Adam!Arkin,! Bioengineering! Emmanuel!Saez,!Economics!
Reconstruc=ng!the!movies! in!your!mind!
Bin!Yu,!Sta=s=cs! Jack!Gallant,!Neuroscience!
Earthquake Strong Shaking in11seconds
Richard!Allen!! Earth&!Plan.! Science! Geospa=al!Lab! Fernando!Perez,!! Brain!Imaging!Center! iPython!tools!and!community! Charles!Marshall! Rosie!Gillespie! Integra=ve!Biology! Digi=zed!Museum! !
Nearly every field of discovery is transitioning from “data poor” to “data rich”
Astronomy:*LSST* Physics:*LHC* Oceanography:*OOI* Sociology:*The*Web* Biology:*Sequencing* Economics:*POS* terminals* Neuroscience:*EEG,*fMRI*
6*
11
UCB CS88 Fa20 L1
5/24/18 21st Century
8
Increasingly US jobs require data science and analytics skills. Can we meet the demand? The current shortage of skills in the national job pool demonstrates that business-as- usual strategies won’t satisfy the growing need. If we are to unlock the promise and potential of data and all the technologies that depend
have to transform. By 2021, 69% of employers expect candidates with DSA skills to get preference for jobs in their organizations. Only 23% of college and university leaders say their graduates will have those skills.
13
UCB CS88 Fa20 L1
1/25/16 UCB CS88 Sp16 L1
11
Connected Industrial Revolution Age of Enlighte nment World
15
5/24/18 21st Century
16
5/24/18 21st Century
17
5/24/18 21st Century
18
1969
2.0 B 1/26/11
1974
RFC 675 TCP/IP
WWW ARPANet Internet HTTP 0.9
1990 2010
Eng21
3.0 B 11/15
8/26/2020 UCB CS88 Fa20 L1
– Authoring computational documents – Tables – Within Python3 and “SciPy”
19
8/26/2020 UCB CS88 Fa20 L1
– Hands-on experience => Foundational Concept – How would you create what you use in c8 ?
– What is involved in interpreting the code you write ? – Deeper CS Concepts: Recursion, Objects, Classes, Higher-
– Managing complexity in creating larger software systems through composition
20
8/26/2020 UCB CS88 Fa20 L1
21
Intro Programming & Tools CS Concepts and Techniques Interpretation CS61A Intro Programming Statistics Concepts in a Computational Approach Thinking w/ Data DATA8 CS Concepts and Techniques CS88 & Tools Working w/ Data Units
8/26/2020 UCB CS88 Fa20 L1
22
1/25/16 UCB CS88 Sp16 L1
CS minor CS major
***
19
Mathematics Data 8: Foundations of Data Science Data 100: Principles & Techniques of Data Science Computing College Breadth & Electives Probability Computational & Inferential Depth Domain Emphasis Human Contexts & Ethics Domain Emphasis Electives Foundational Lower Division Individualized Upper Division 30 units Modeling, Learning & Decision Making
24
8/26/2020 UCB CS88 Fa20 L1
25
8/26/2020 UCB CS88 Fa20 L1
26
8/26/2020 UCB CS88 Fa20 L1
28
UCB CS88 Fa20 L1
29
UCB CS88 Fa20 L1
– Rewards good behavior – Effort
» E.g., Office hours, doing every single lab, hw, reading Ed posts
– Participation
» E.g., Raising hand in lec or discussion, asking questions
– Altruism
» E.g., helping other students in lab, answering questions on Ed
30
01/28/19 UCB CS88 Sp19 L1
– https://www.gradescope.com/courses/157733/assignments/621918/submissions/n ew
– https://us.edstem.org/courses/2362/discussion/111922
– Fill out the intro survey
– Signup Genius form for lab times 39
8/26/2020 UCB CS88 Fa20 L1