Advanced ML in Google Cloud (2)
Abhay Agarwal (MS Design ‘19)
CS341: Project in Mining Massive Datasets
Advanced ML in Google Cloud (2) Abhay Agarwal (MS Design 19) Agenda - - PowerPoint PPT Presentation
CS341: Project in Mining Massive Datasets Advanced ML in Google Cloud (2) Abhay Agarwal (MS Design 19) Agenda Productizing analytics Data wrangling Data fundamentals Data studio vs datalab vs colab
Abhay Agarwal (MS Design ‘19)
CS341: Project in Mining Massive Datasets
production?
roughly sampled from real examples -- is it ready for production?
real examples, and my algorithm tests hypotheses that match the use cases -- is it ready for production?
6
7
Freshness Quality Structure Cost Quantity
8
9
Depth Brea dth
World Bank Development Indicators
10
Structured Unstructured Semi-structured
11
Moz.com
customer records
prejudice/stereotype
12
13
14
The Economist, August 20, 2016
But what about perpetuating bias against minorities?
15
16
to pull data
scripting, versioned scripts and models
long-running scripting