Preparing for Collaborative Data Driven Projects December 9, 2016 - - PowerPoint PPT Presentation

preparing for collaborative data driven projects
SMART_READER_LITE
LIVE PREVIEW

Preparing for Collaborative Data Driven Projects December 9, 2016 - - PowerPoint PPT Presentation

Preparing for Collaborative Data Driven Projects December 9, 2016 Lauren Hareem Erika Haynes Naveed Salomon Center for Data Science and Public Policy dsapp.uchicago.edu @datascifellows 1 University of Chicago Agenda 5 Minutes


slide-1
SLIDE 1

Preparing for Collaborative Data Driven Projects

December 9, 2016

1 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows Lauren Haynes Erika Salomon Hareem Naveed

slide-2
SLIDE 2

Agenda

  • 5 Minutes Introductions
  • 20 Minutes Intro to Project Scoping (DSaPP)
  • 20 Minutes Intro to Data Maturity (DSaPP)
  • 20 minutes Intro to Data Governance (McClean) - How

to get legal agreements, stakeholder buy in

  • 20 Minutes Q&A
slide-3
SLIDE 3

38 projects

slide-4
SLIDE 4
slide-5
SLIDE 5

“We are used to using data to justify funding decisions. Now we can use data to improve what we do.”

5 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-6
SLIDE 6

6 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-7
SLIDE 7

This May Sound Daunting ...

7 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-8
SLIDE 8

But Most Common Machine Learning Tasks...

Regression Using trends to predict outcomes Clustering Finding existing groups or categories Classification Labeling and sorting into groups

Dimensionality Reduction

Dimension Reduction Finding important predictors

8 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-9
SLIDE 9

… You Actually Learned In Kindergarten

Regression Using trends to predict outcomes Clustering

Finding existing groups or categories

Dimension Reduction Finding important predictors Classification

Labeling and sorting into groups

9 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-10
SLIDE 10

Most Projects Fall in a Few Categories

  • Early warning & intervention
  • Efficient resource allocation & targeted action
  • Effective advocacy & fundraising
  • Data-driven policy recommendation & evaluation

10 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-11
SLIDE 11

Data are People

11 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-12
SLIDE 12

“Predictive analytics is emerging as a game-changer. Instead of looking backward to analyze “what happened?” predictive analytics help executives answer “What’s next?” and “What should we do about it?”

Forbes Magazine Why Predictive Analytics Is A Game-Changer 12 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-13
SLIDE 13

Project Scoping Workshop

dsapp.uchicago.edu 13 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-14
SLIDE 14

“We are used to using data to justify funding decisions. Now we can use data to improve what we do.”

14 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-15
SLIDE 15

Scoping a Good Project is Easier Said than Done

15 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-16
SLIDE 16

What Makes a Good Project

  • A solvable problem.
  • A challenging problem.
  • An important problem with social impact.
  • A motivated, capable, and committed partner.

a. Domain/business resources b. Data understanding resources c. Commitment to implementation

  • Appropriate, relevant, available data.

16 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-17
SLIDE 17

17 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-18
SLIDE 18

Relevance and Sufficiency

GOAL

Relevant but Insufficient Relevant and Sufficient Irrelevant and Insufficient

18 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-19
SLIDE 19

19 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-20
SLIDE 20

Data Analytics: Problem Formulation

  • Identify Goals
  • Identify Actions you can take to achieve those goals

– Break down actions into fine-grained questions/subactions

  • Identify Data Sources Resources you need and have
  • Identify Analysis/Modeling that needs to be done

20 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-21
SLIDE 21

Workshop

  • Scope an analytics-driven project for a problem your organization is facing

– Identify goal(s) – Actions (persuasion for example) – Data sources (data you have, data you need to collect, relationships to get that data) – Models

  • Who? (to target for each action)
  • What? (to say to them)
  • How? (to use different communication channels)

21 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-22
SLIDE 22

Workshop: Identify Actions that can be taken to achieve the goal

  • What programs do I have access to? What would they do differently if they

had more information/knew where their interventions were most likely to be effective

  • Be sure to name out which individuals are taking the action

22 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-23
SLIDE 23

Workshop: Data Sources

  • Data sources (data you have, data you need to collect, relationships to get that

data)

23 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-24
SLIDE 24

Workshop : Changing Actions

  • Who? (to target for each action)
  • What? (to say to them)
  • How? (to use different communication channels)

24 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-25
SLIDE 25

25 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

Partner Data Maturity Framework

slide-26
SLIDE 26

Data Maturity Framework

26 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-27
SLIDE 27

27 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-28
SLIDE 28

How to Arrive at Data Nirvana

  • Commit to data-driven decisionmaking.

○ Data as a first class citizen ○ Ready for tough love from data ○ Willing to take data risks

  • Understand what data supports the mission and how.

○ Has the data it needs or can obtain it ○ Technical and organizational capacity ○ Outcome linked to action

28 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-29
SLIDE 29

Data Readiness

29 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-30
SLIDE 30

Relevance and Sufficiency

GOAL

Relevant but Insufficient Relevant and Sufficient Irrelevant and Insufficient

30 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-31
SLIDE 31

Data Capture

population with data population of interest

31 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-32
SLIDE 32

Data Storage / Format

.csv .json .xml

API

High cost of use Low cost of use

32 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-33
SLIDE 33

Data Quality

33 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-34
SLIDE 34

Integration

34 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-35
SLIDE 35

Accessibility

35 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-36
SLIDE 36

Documentation

DATA

36 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-37
SLIDE 37

Organizational Readiness

37 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-38
SLIDE 38

Leadership Buy In

38 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-39
SLIDE 39

Individual Buy In

39 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-40
SLIDE 40

Stakeholder Buy In

40 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-41
SLIDE 41

Thank You! http://dssg.uchicago.edu/ http://dsapp.uchicago.edu/ DSSG 2017 Project Partner Applications Due 1/31/17!

Contact Us lnhaynes@uchicago.edu

41 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-42
SLIDE 42

Case Studies

42 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-43
SLIDE 43

43 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-44
SLIDE 44

44 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-45
SLIDE 45

Impaired Attention Lack of Motor Skills Learning Disability Memory Problems Hearing Loss Lower IQ

LEAD

45 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-46
SLIDE 46

Constituents Elected Representatives

slide-47
SLIDE 47

47 Center for Data Science and Public Policy University of Chicago dsapp.uchicago.edu @datascifellows

slide-48
SLIDE 48

Personal features and location

slide-49
SLIDE 49