what is a computational biologist doing at the New York Times? - - PowerPoint PPT Presentation

what is a computational biologist doing at the new york
SMART_READER_LITE
LIVE PREVIEW

what is a computational biologist doing at the New York Times? - - PowerPoint PPT Presentation

what is a computational biologist doing at the New York Times? (and what can academia do for a 163-year old company?) chris.wiggins@columbia.edu chris.wiggins@nytimes.com chris.wiggins@hackNY.org @chrishwiggins context/background


slide-1
SLIDE 1

what is a computational biologist doing at the New York Times?

  • (and what can academia do for a

163-year old company?)

chris.wiggins@columbia.edu chris.wiggins@nytimes.com chris.wiggins@hackNY.org @chrishwiggins

slide-2
SLIDE 2

context/background

slide-3
SLIDE 3

context/background

(before ‘the talk’)

slide-4
SLIDE 4

biology: 1892 vs. 1995 biology changed for good.

slide-5
SLIDE 5

genetics: 1837 vs. 2012 from “segments” to algorithms

slide-6
SLIDE 6

genetics: 1837 vs. 2012 from intuition to prediction

slide-7
SLIDE 7

data science: web scale

slide-8
SLIDE 8

example: 163 yr old

slide-9
SLIDE 9

bit.ly/nyt-interactive-2013

slide-10
SLIDE 10

R+D: nytlabs.com

slide-11
SLIDE 11

developer.nytimes.com: 2008

slide-12
SLIDE 12
slide-13
SLIDE 13
slide-14
SLIDE 14
slide-15
SLIDE 15
slide-16
SLIDE 16

example: millions of views per hour

slide-17
SLIDE 17

from “segments” to algorithms

insert figure here

slide-18
SLIDE 18

from intuition to prediction

insert figure here

slide-19
SLIDE 19

data science: the web

slide-20
SLIDE 20

data science: the web is your “online presence”

slide-21
SLIDE 21

data science: the web is a microscope

slide-22
SLIDE 22

data science: the web is an experimental tool

slide-23
SLIDE 23

data science: the web is an optimization tool

slide-24
SLIDE 24

</header>

slide-25
SLIDE 25

</header>

i.e., <body>

slide-26
SLIDE 26

common requirements in data science:

slide-27
SLIDE 27

common requirements in data science:

  • 1. practices
  • 2. skills
  • 3. culture
slide-28
SLIDE 28

common requirements in data science:

  • 1. practices
  • 2. skills
  • 3. culture
slide-29
SLIDE 29

common requirements in data science:

  • 1. practices
  • 2. skills
  • 3. culture
slide-30
SLIDE 30

data science: practice

slide-31
SLIDE 31

data science: practice

  • reframe domain questions

as machine learning tasks

slide-32
SLIDE 32

data science: practice

  • better wrong than "nice"
slide-33
SLIDE 33

data science: practice

  • be relevant
slide-34
SLIDE 34

data science: practice

  • be relevant
slide-35
SLIDE 35

data science: practice

  • be relevant
slide-36
SLIDE 36

data science: practice

  • hypotheses are not data jeopardy
slide-37
SLIDE 37

data science: practice

  • befriend experimentalists
slide-38
SLIDE 38

data science: practice

  • befriend experimentalists
slide-39
SLIDE 39

data science: practice

  • befriend experimentalists
slide-40
SLIDE 40

data science: skills

slide-41
SLIDE 41

data science: skills

  • find quantifiables
slide-42
SLIDE 42

data science: skills

  • find quantifiables (choose carefully)
slide-43
SLIDE 43

data science: skills

  • straw man first
slide-44
SLIDE 44

data science: skills

  • straw man first
slide-45
SLIDE 45

data science: skills

  • small wins before feature engineering
slide-46
SLIDE 46

data science: skills

  • data engineering before data science
slide-47
SLIDE 47

data science: culture

slide-48
SLIDE 48

data science: culture

  • be communicative
slide-49
SLIDE 49

data science: culture

  • be communicative

(promote rhetorical literacy)

slide-50
SLIDE 50

data science: culture

  • be communicative

(promote rhetorical literacy)

  • related: strive to build models

which are both predictive and interpretable

slide-51
SLIDE 51

data science: culture

  • be skeptical

(promote critical literacy)

slide-52
SLIDE 52

data science: culture

  • be empowering
slide-53
SLIDE 53

data science: culture

  • be transparent
slide-54
SLIDE 54

data science: culture

  • promote literacy:

functional critical rhetorical

  • (cf. Selber, Multiliteracies for a Digital Age. 2004)
slide-55
SLIDE 55

data science: culture

  • promote literacies:
  • 1. functional
  • 2. critical
  • 3. rhetorical
  • (cf. Selber, Multiliteracies for a Digital Age. 2004)
slide-56
SLIDE 56

data science: culture

  • promote literacies:
  • 1. functional
  • 2. critical
  • 3. rhetorical
  • (cf. Selber, Multiliteracies for a Digital Age. 2004)
slide-57
SLIDE 57

data science: culture

  • promote literacies:
  • 1. functional
  • 2. critical
  • 3. rhetorical
  • (cf. Selber, Multiliteracies for a Digital Age. 2004)
slide-58
SLIDE 58

data science: culture

  • promote literacies:
  • 1. functional
  • 2. critical
  • 3. rhetorical
  • (cf. Selber, Multiliteracies for a Digital Age. 2004)
slide-59
SLIDE 59

</body>

i.e., <footer>

slide-60
SLIDE 60

summary:

slide-61
SLIDE 61

summary: pay attention to:

  • 1. practices
  • 2. skills
  • 3. culture
slide-62
SLIDE 62

practices:

  • 1. reframe questions as ML
  • 2. better wrong than "nice"
  • 3. be relevant
  • 4. aim for hypothesis vs data jeapordy
  • 5. befriend experimentalists
slide-63
SLIDE 63

skills:

  • 1. find quantifiables
  • 2. straw man first
  • 3. small wins before feature engineering
  • 4. data engineering before data science
slide-64
SLIDE 64

culture:

  • 1. be communicative
  • 2. be skeptical
  • 3. be empowering
  • 4. be transparent
  • 5. promote literacies
slide-65
SLIDE 65

find out more!

  • 1. postdoc/student opportunities:

chris.wiggins@columbia.edu

  • 2. always hiring:

chris.wiggins@nytimes.com

  • 3. let’s talk:
  • @chrishwiggins
  • gist.github.com/chrishwiggins/
slide-66
SLIDE 66

what is a computational biologist doing at the New York Times?

  • (and what can academia do for a

163-year old company?)

chris.wiggins@columbia.edu chris.wiggins@nytimes.com chris.wiggins@hackNY.org @chrishwiggins