SLIDE 1 what is a computational biologist doing at the New York Times?
- (and what can academia do for a
163-year old company?)
chris.wiggins@columbia.edu chris.wiggins@nytimes.com chris.wiggins@hackNY.org @chrishwiggins
SLIDE 2
context/background
SLIDE 3
context/background
(before ‘the talk’)
SLIDE 4
biology: 1892 vs. 1995 biology changed for good.
SLIDE 5
genetics: 1837 vs. 2012 from “segments” to algorithms
SLIDE 6
genetics: 1837 vs. 2012 from intuition to prediction
SLIDE 7
data science: web scale
SLIDE 8
example: 163 yr old
SLIDE 9
bit.ly/nyt-interactive-2013
SLIDE 10
R+D: nytlabs.com
SLIDE 11
developer.nytimes.com: 2008
SLIDE 12
SLIDE 13
SLIDE 14
SLIDE 15
SLIDE 16
example: millions of views per hour
SLIDE 17
from “segments” to algorithms
insert figure here
SLIDE 18
from intuition to prediction
insert figure here
SLIDE 19
data science: the web
SLIDE 20
data science: the web is your “online presence”
SLIDE 21
data science: the web is a microscope
SLIDE 22
data science: the web is an experimental tool
SLIDE 23
data science: the web is an optimization tool
SLIDE 24
</header>
SLIDE 25
</header>
i.e., <body>
SLIDE 26
common requirements in data science:
SLIDE 27 common requirements in data science:
- 1. practices
- 2. skills
- 3. culture
SLIDE 28 common requirements in data science:
- 1. practices
- 2. skills
- 3. culture
SLIDE 29 common requirements in data science:
- 1. practices
- 2. skills
- 3. culture
SLIDE 30
data science: practice
SLIDE 31 data science: practice
as machine learning tasks
SLIDE 32 data science: practice
SLIDE 33 data science: practice
SLIDE 34 data science: practice
SLIDE 35 data science: practice
SLIDE 36 data science: practice
- hypotheses are not data jeopardy
SLIDE 37 data science: practice
- befriend experimentalists
SLIDE 38 data science: practice
- befriend experimentalists
SLIDE 39 data science: practice
- befriend experimentalists
SLIDE 40
data science: skills
SLIDE 41 data science: skills
SLIDE 42 data science: skills
- find quantifiables (choose carefully)
SLIDE 43 data science: skills
SLIDE 44 data science: skills
SLIDE 45 data science: skills
- small wins before feature engineering
SLIDE 46 data science: skills
- data engineering before data science
SLIDE 47
data science: culture
SLIDE 48 data science: culture
SLIDE 49 data science: culture
(promote rhetorical literacy)
SLIDE 50 data science: culture
(promote rhetorical literacy)
- related: strive to build models
which are both predictive and interpretable
SLIDE 51 data science: culture
(promote critical literacy)
SLIDE 52 data science: culture
SLIDE 53 data science: culture
SLIDE 54 data science: culture
functional critical rhetorical
- (cf. Selber, Multiliteracies for a Digital Age. 2004)
SLIDE 55 data science: culture
- promote literacies:
- 1. functional
- 2. critical
- 3. rhetorical
- (cf. Selber, Multiliteracies for a Digital Age. 2004)
SLIDE 56 data science: culture
- promote literacies:
- 1. functional
- 2. critical
- 3. rhetorical
- (cf. Selber, Multiliteracies for a Digital Age. 2004)
SLIDE 57 data science: culture
- promote literacies:
- 1. functional
- 2. critical
- 3. rhetorical
- (cf. Selber, Multiliteracies for a Digital Age. 2004)
SLIDE 58 data science: culture
- promote literacies:
- 1. functional
- 2. critical
- 3. rhetorical
- (cf. Selber, Multiliteracies for a Digital Age. 2004)
SLIDE 59
</body>
i.e., <footer>
SLIDE 60
summary:
SLIDE 61 summary: pay attention to:
- 1. practices
- 2. skills
- 3. culture
SLIDE 62 practices:
- 1. reframe questions as ML
- 2. better wrong than "nice"
- 3. be relevant
- 4. aim for hypothesis vs data jeapordy
- 5. befriend experimentalists
SLIDE 63 skills:
- 1. find quantifiables
- 2. straw man first
- 3. small wins before feature engineering
- 4. data engineering before data science
SLIDE 64 culture:
- 1. be communicative
- 2. be skeptical
- 3. be empowering
- 4. be transparent
- 5. promote literacies
SLIDE 65 find out more!
- 1. postdoc/student opportunities:
chris.wiggins@columbia.edu
chris.wiggins@nytimes.com
- 3. let’s talk:
- @chrishwiggins
- gist.github.com/chrishwiggins/
SLIDE 66 what is a computational biologist doing at the New York Times?
- (and what can academia do for a
163-year old company?)
chris.wiggins@columbia.edu chris.wiggins@nytimes.com chris.wiggins@hackNY.org @chrishwiggins